LESSWRONGTags
LW

SERI MATS

EditHistorySubscribe
Discussion (0)
Help improve this page
EditHistorySubscribe
Discussion (0)
Help improve this page
SERI MATS
Random Tag
Contributors
1Multicore

The Stanford Existential Risks Initiative ML Alignment Theory Scholars program.

https://www.serimats.org/

Posts tagged SERI MATS
Most Relevant
7
71SERI MATS Program - Winter 2022 CohortΩ
Ryan Kidd, Victor Warlop, Christian Smith
6mo
Ω
12
6
21Project proposal: Testing the IBP definition of agent
Jeremy Gillen, Thomas Larsen, JamesH
8mo
4
4
119Taking the parameters which seem to matter and rotating them until they don't
Garrett Baker
7mo
48
3
646SolidGoldMagikarp (plus, prompt generation)Ω
Jessica Rumbelow, mwatkins
2mo
Ω
194
3
286Understanding and controlling a maze-solving policy networkΩ
TurnTrout, peligrietzer, Ulisse Mini, montemac, David Udell
15d
Ω
15
3
22Normative vs Descriptive Models of AgencyΩ
mattmacdermott
2mo
Ω
5
2
100Soft optimization makes the value target biggerΩ
Jeremy Gillen
3mo
Ω
18
2
94Predictions for shard theory mechanistic interpretability resultsΩ
TurnTrout, Ulisse Mini, peligrietzer
1mo
Ω
9
2
76Qualities that alignment mentors value in junior researchers
Akash
1mo
13
2
72Neural Tangent Kernel Distillation
Thomas Larsen, Jeremy Gillen
6mo
20
2
52Conditioning Generative Models for AlignmentΩ
Jozdien
8mo
Ω
8
2
49More findings on Memorization and double descentΩ
Marius Hobbhahn
2mo
Ω
2
2
48Race Along Rashomon RidgeΩ
Stephen Fowler, Peter S. Park, MichaelEinhorn
9mo
Ω
15
2
47Consequentialists: One-Way Pattern Traps
David Udell
2mo
3
2
29Broad Basins and Data Compression
Jeremy Gillen, Stephen Fowler, Thomas Larsen
8mo
6
Load More (15/100)
Add Posts