quartz/Human-AI chat paradigm hamstrings the space of possibility.md at 80a902ee2b9433adf56bb54d68d4aeedca5e1f1a

mirror of https://github.com/jackyzha0/quartz.git synced 2025-12-19 10:54:06 -06:00

Vineeth Voruganti 80a902ee2b

chore: Delete old job postings and fix dates on notes (#81 )

2024-11-20 12:15:29 -05:00

1.9 KiB

Raw Blame History

title	date
Human-AI chat paradigm hamstrings the space of possibility	02.21.24

The human-AI chat paradigm assumes only two participants in a given interaction. While this is sufficient for conversations directly with un-augmented foundation models, it creates many obstacles when designing more sophisticated cognitive architectures. When you train/fine-tune a language model, you begin to reinforce token distributions that are appropriate to come in between the special tokens denoting human vs AI messages.

Here's a limited list of things besides a direct response we routinely want to generate:

A 'thought' about how to respond to the user
A Loose theory of mind imputations are superior to verbatim response predictions about the user's internal mental state
A list of ways to improve prediction
A list of items to search over storage
A 'plan' for how to approach a problem
A mock user response
A LLM Metacognition is inference about inference to consider the product of prior inference

In contrast, the current state of inference is akin to immediately blurting out the first thing that comes into your mind--something that humans with practiced aptitude in social cognition rarely do. But this is very hard given the fact that those types of responses don't ever come after the special AI message token. Not very flexible.

We're already anecdotally seeing well-trained completion models follow instructions impressively likely because of incorporation into pretraining. Is chat the next thing to be subsumed by general completion models? Because if so, flexibility in the types of inferences you can make would be very beneficial.

Metacognition then becomes something you can do at any step in a conversation. Same with instruction following & chat. Maybe this helps push LLMs in a much more general direction.

1.9 KiB Raw Blame History

1.9 KiB

Raw Blame History