LESSWRONG
LW

1508
Mia Hopman
109200
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No Comments Found
No wikitag contributions to display.
39Prompt optimization can enable AI control research
2mo
4
51Optimally Combining Probe Monitors and Black Box Monitors
Ω
4mo
Ω
2
30Untrusted AIs can exploit feedback in control protocols
6mo
0