x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
AI Safety — LessWrong
AI Safety
This page is a stub.
Subscribe
Discussion
Subscribe
Discussion
Posts tagged
AI Safety
Most Relevant
1
47
Synthetic Persona Pretraining: Alignment from Token Zero
Julian Minder
,
Raghav Singhal
,
Viktor Moskvoretskii
,
Stefan Krsteski
,
ashtonanderson
,
rolandaydin
,
Robert West
12h
2
1
34
The case for fine-grained tracking of compute for AI
Farhan
,
Katherine Biewer
7d
10
1
15
From 8B to Frontier: How System Prompts Control Whether AI Agents Blackmail, Leak, and Kill
Chijioke Ugwuanyi
18h
2