mirror of
https://github.com/jackyzha0/quartz.git
synced 2025-12-20 03:14:06 -06:00
add cyborgism
This commit is contained in:
parent
9ee225dbf2
commit
a6a6faab17
@ -57,8 +57,9 @@ And it can't hurt to [join Discord](https://discord.gg/plasticlabs) and introduc
|
|||||||
[Language Models Represent Space and Time](https://arxiv.org/pdf/2310.02207)
|
[Language Models Represent Space and Time](https://arxiv.org/pdf/2310.02207)
|
||||||
[Generative Agents: Interactive Simulacra of Human Behavior](https://arxiv.org/abs/2304.03442)
|
[Generative Agents: Interactive Simulacra of Human Behavior](https://arxiv.org/abs/2304.03442)
|
||||||
[Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge](https://arxiv.org/abs/2407.19594)
|
[Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge](https://arxiv.org/abs/2407.19594)
|
||||||
|
[Cyborgism](https://www.lesswrong.com/posts/bxt7uCiHam4QXrQAA/cyborgism)
|
||||||
[Spontaneous Reward Hacking in Iterative Self-Refinement](https://arxiv.org/abs/2407.04549)
|
[Spontaneous Reward Hacking in Iterative Self-Refinement](https://arxiv.org/abs/2407.04549)
|
||||||
[... accompanying twitter thread](https://x.com/JanePan_/status/1813208688343052639)
|
[... accompanying twitter thread](https://x.com/JanePan_/status/1813208688343052639)
|
||||||
|
|
||||||
|
|
||||||
(Back to [[Work at Plastic]])
|
(Back to [[Work at Plastic]])
|
||||||
|
|||||||
@ -50,8 +50,9 @@ And it can't hurt to [join Discord](https://discord.gg/plasticlabs) and introduc
|
|||||||
[Language Models Represent Space and Time](https://arxiv.org/pdf/2310.02207)
|
[Language Models Represent Space and Time](https://arxiv.org/pdf/2310.02207)
|
||||||
[Generative Agents: Interactive Simulacra of Human Behavior](https://arxiv.org/abs/2304.03442)
|
[Generative Agents: Interactive Simulacra of Human Behavior](https://arxiv.org/abs/2304.03442)
|
||||||
[Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge](https://arxiv.org/abs/2407.19594)
|
[Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge](https://arxiv.org/abs/2407.19594)
|
||||||
|
[Cyborgism](https://www.lesswrong.com/posts/bxt7uCiHam4QXrQAA/cyborgism)
|
||||||
[Spontaneous Reward Hacking in Iterative Self-Refinement](https://arxiv.org/abs/2407.04549)
|
[Spontaneous Reward Hacking in Iterative Self-Refinement](https://arxiv.org/abs/2407.04549)
|
||||||
[... accompanying twitter thread](https://x.com/JanePan_/status/1813208688343052639)
|
[... accompanying twitter thread](https://x.com/JanePan_/status/1813208688343052639)
|
||||||
|
|
||||||
|
|
||||||
(Back to [[Work at Plastic]])
|
(Back to [[Work at Plastic]])
|
||||||
|
|||||||
Loading…
Reference in New Issue
Block a user