LESSWRONG
LW

1107
Mia Hopman
72200
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No Comments Found
No wikitag contributions to display.
13Prompt optimization can enable AI control research
13h
1
40Optimally Combining Probe Monitors and Black Box Monitors
Ω
2mo
Ω
2
30Untrusted AIs can exploit feedback in control protocols
4mo
0