quartz/LLM Metacognition is inference about inference.md at 9d3d2e2ecec3d50ab2e5b496b88d639cd3534388

mirror of https://github.com/jackyzha0/quartz.git synced 2025-12-19 10:54:06 -06:00

Courtland Leer f1aa5ec235 added descriptions to all posts for SEO, updated tags, added authors to all, fixed legacy header & tl;dr formatting across the board, & lots more

2025-12-05 15:35:45 -05:00

1.6 KiB

Raw Blame History

title

date

tags

author

description

LLM Metacognition is inference about inference

03.26.24

notes

Courtland Leer

Defining metacognition in LLMs as running inference on prior inference outputs--a critical architecture for building rich user representations.

For wetware, metacognition is typically defined as ‘thinking about thinking’ or often a catch-all for any ‘higher-level’ cognition.

(In some more specific domains, it's an introspective process, focused on thinking about exclusively your own thinking or a suite of personal learning strategies...all valid within their purview, but too constrained for our purposes.)

In large language models, the synthetic corollary of cognition is inference. So we can reasonably define a metacognitive process in an LLM architecture as any that runs inference on the output of prior inference. That is, inference itself is used as context--inference about inference.

It might be instantly injected into the next prompt, stored for later use, or leveraged by another model. This kind of architecture is critical when dealing with user context, since LLMs can run inference about user behavior, then use that synthetic context in the future. Experiments here will be critical to overcome Machine learning is fixated on task performance. For us at Plastic, one of the most interesting species of metacogntion is Loose theory of mind imputations are superior to verbatim response predictions to form high-fidelity representations of users.

1.6 KiB Raw Blame History Unescape Escape

1.6 KiB

Raw Blame History