x

LESSWRONG

LW

Sai Sasank Y — LessWrong

Sai Sasank Y

Sai Sasank Y

Message

21

1

3

6y

Sai Sasank Y

21

6y

Auto-Enhance: Developing a meta-benchmark to measure LLM agents’ ability to improve other agents

by Sam F. Brown, BasilLabib, Codruta (Coco) Lugoj, and Sai Sasank Y

Summary * Scaffolded LLM agents are, in principle, able to execute arbitrary code to achieve the goals they have been set. One such goal could be self-improvement. * This post outlines our plans to build a benchmark to measure the ability of LLM agents to modify and improve other LLM...

Jul 22, 2024•20

What are some open exposition problems in AI?

Basically, topics or ideas that are not well explained, but could benefit from good expositions.

Aug 16, 2021•4