x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
LLM Personas — LessWrong
LLM Personas
This page is a stub.
Subscribe
Discussion
Subscribe
Discussion
Posts tagged
LLM Personas
Most Relevant
10
201
Alignment Pretraining: AI Discourse Causes Self-Fulfilling (Mis)alignment
Ω
Cam
,
Puria
,
Kyle O’Brien
,
David Africa
,
Samuel Ratnam
,
andyk
5mo
Ω
25
8
712
Simulators
Ω
janus
4y
Ω
169
8
417
the void
Ω
nostalgebraist
1y
Ω
108
7
25
Experimental Evidence for Simulator Theory— Part 1: Emergent Misalignment and Weird Generalizations
RogerDearnaley
2mo
0
7
21
Experimental Evidence for Simulator Theory— Part 2: The Scalers Strike Back
RogerDearnaley
2mo
0
6
764
The Rise of Parasitic AI
Adele Lopez
8mo
191
6
260
A Three-Layer Model of LLM Psychology
Ω
Jan_Kulveit
1y
Ω
17
6
176
Persona Parasitology
Raymond Douglas
3mo
38
6
106
Pretraining on Aligned AI Data Dramatically Reduces Misalignment—Even After Post-Training
Ω
RogerDearnaley
4mo
Ω
12
6
78
Shaping the exploration of the motivation-space matters for AI safety
Maxime Riché
,
Victor Gillioz
,
nielsrolf
,
Kajetan Dymkiewicz
,
Filip Sondej
,
RogerDearnaley
,
Daniel Tan
,
dillonkn
3mo
15
5
120
A Case for Model Persona Research
nielsrolf
,
Maxime Riché
,
Daniel Tan
5mo
11
4
68
The Bleeding Mind
Ω
Adele Lopez
5mo
Ω
9
4
40
Selection Pressures on LM Personas
Ω
Raymond Douglas
1y
Ω
0
3
68
Concrete research ideas on AI personas
nielsrolf
,
Maxime Riché
,
Daniel Tan
4mo
10
2
442
Claude 4.5 Opus' Soul Document
Richard Weiss
6mo
44