mirror of
https://github.com/jackyzha0/quartz.git
synced 2025-12-19 10:54:06 -06:00
fill in rest of outline
This commit is contained in:
parent
9b93ab24ce
commit
156aad32e5
@ -5,6 +5,8 @@ date: Dec 19, 2023
|
||||
(meme)
|
||||
## TL;DR
|
||||
|
||||
## Defining Terms
|
||||
|
||||
## Background and Related Work
|
||||
|
||||
(Def wanna give this a more creative name)
|
||||
@ -15,4 +17,10 @@ date: Dec 19, 2023
|
||||
- what they all have in common is they're trying to explore a problem space as exhaustively as possible, providing a large number of diverse examples to evaluate on (MMLU - language understanding, HumanEval - coding, HellaSwag - reasoning)
|
||||
- high performance on these datasets demonstrates incredible *general* abilities
|
||||
- and in fact their performance on these diverse datasets proves their capabilities are probably much more vast than we think they are
|
||||
- but they're not given the opportunity to query these diverse capabilities in current user-facing systems
|
||||
- but they're not given the opportunity to query these diverse capabilities in current user-facing systems
|
||||
|
||||
## How We've Explored It
|
||||
|
||||
## Selective Metacog Taxonomy
|
||||
|
||||
## The Future/Potential/Importance
|
||||
|
||||
Loading…
Reference in New Issue
Block a user