LESSWRONGTags
LW

SERI MATS

EditHistory
Discussion (0)
Help improve this page
EditHistory
Discussion (0)
Help improve this page
SERI MATS
Random Tag
Contributors
1Multicore

The Stanford Existential Risks Initiative ML Alignment Theory Scholars program.

https://www.serimats.org/

Posts tagged SERI MATS
7
654SolidGoldMagikarp (plus, prompt generation)Ω
Jessica Rumbelow, mwatkins
4mo
Ω
199
7
303Understanding and controlling a maze-solving policy networkΩ
TurnTrout, peligrietzer, Ulisse Mini, Monte M, David Udell
3mo
Ω
22
7
71SERI MATS Program - Winter 2022 CohortΩ
Ryan Kidd, Victor Warlop, Christian Smith
8mo
Ω
12
6
21Project proposal: Testing the IBP definition of agent
Jeremy Gillen, Thomas Larsen, JamesH
10mo
4
5
106Soft optimization makes the value target biggerΩ
Jeremy Gillen
5mo
Ω
20
4
119Taking the parameters which seem to matter and rotating them until they don't
Garrett Baker
10mo
48
3
104Predictions for shard theory mechanistic interpretability resultsΩ
TurnTrout, Ulisse Mini, peligrietzer
4mo
Ω
9
3
74Neural Tangent Kernel Distillation
Thomas Larsen, Jeremy Gillen
8mo
20
3
26Normative vs Descriptive Models of AgencyΩ
mattmacdermott
4mo
Ω
5
2
82Qualities that alignment mentors value in junior researchers
Akash
4mo
13
2
53Consequentialists: One-Way Pattern Traps
David Udell
5mo
3
2
53Conditioning Generative Models for AlignmentΩ
Jozdien
1y
Ω
8
2
51More findings on Memorization and double descentΩ
Marius Hobbhahn
4mo
Ω
2
2
48Race Along Rashomon RidgeΩ
Stephen Fowler, Peter S. Park, MichaelEinhorn
1y
Ω
15
2
44Behavioural statistics for a maze-solving agentΩ
peligrietzer, TurnTrout
2mo
Ω
11
Load More (15/115)
Add Posts