LESSWRONG
LW

1672
Mary Phuong
318010
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
AI companies aren't really using external evaluators
Mary Phuong1y169

The latest Gemini tech report has some more info on GDM external safety testing: https://storage.googleapis.com/deepmind-media/gemini/gemini_v1_5_report.pdf 
(section 9.6, p. 71)

Reply411
52Evaluating and monitoring for AI scheming
Ω
3mo
Ω
9
76Unfaithful Reasoning Can Fool Chain-of-Thought Monitoring
Ω
4mo
Ω
17
79Threat Model Literature Review
Ω
3y
Ω
4
127Clarifying AI X-risk
Ω
3y
Ω
24