LESSWRONG
LW

604
Mia Hopman
104200
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No Comments Found
No wikitag contributions to display.
36Prompt optimization can enable AI control research
1mo
3
49Optimally Combining Probe Monitors and Black Box Monitors
Ω
3mo
Ω
2
30Untrusted AIs can exploit feedback in control protocols
5mo
0