x

LESSWRONG

LW

MATS Program — LessWrong

MATS Program

Edited by Ryan Kidd, et al. last updated 18th Mar 2026

The Machine Alignment, Transparency, and Security (MATS) Program is an independent research and educational seminar program that provides emerging researchers with mentorship, talks & workshops, research support, and connections with the SF Bay Area and London AI safety research communities.

Add Posts

5

5

Posts tagged MATS Program

8

676SolidGoldMagikarp (plus, prompt generation)

Jessica Rumbelow, mwatkins

3y

208

8

72SERI MATS Program - Winter 2022 Cohort

Ryan Kidd, Victor Warlop, Christian Smith

4y

12

7

335Understanding and controlling a maze-solving policy network

TurnTrout, peligrietzer, Ulisse Mini, Monte M, David Udell

3y

28

6

21Project proposal: Testing the IBP definition of agent

Jeremy Gillen, Thomas Larsen, JamesH

4y

4

5

123Soft optimization makes the value target bigger

3y

20

5

71SERI MATS - Summer 2023 Cohort

Aris, Ryan Kidd, Christian Smith

3y

25

5

69SERI ML Alignment Theory Scholars Program 2022

Ryan Kidd, Victor Warlop, ozhang

4y

6

5

63How MATS addresses “mass movement building” concerns

3y

9

4

186Finite Factored Sets in Pictures

Magdalena Wache

3y

35

4

120Taking the parameters which seem to matter and rotating them until they don't

4y

48

4

26Talk: AI safety fieldbuilding at MATS

2y

2

4

0Sycophancy Towards Researchers Drives Performative Misalignment

Taywon Min, rustem17, David Vella Zarb, Shi

2mo

1

3

144Recontextualization Mitigates Specification Gaming Without Modifying the Specification

ariana_azarbal, Victor Gillioz, TurnTrout, cloud

7mo

15

3

118Efficient Dictionary Learning with Switch Sparse Autoencoders

2y

20

3

112I found >800 orthogonal "write code" steering vectors

Jacob G-W, TurnTrout

2y

20

Load More (15/316)

Add Posts