x

LESSWRONG
LW

Nelson Gardner-Challis

Nelson Gardner-Challis

Message

36

1

2y

Nelson Gardner-Challis

36

2y

Nelson Gardner-Challis — LessWrong

[Paper] When can we trust untrusted monitoring? A safety case sketch across collusion strategies

This research was completed for LASR Labs 2025 by Nelson Gardner-Challis, Jonathan Bostock, Georgiy Kozhevnikov and Morgan Sinclaire. The team was supervised by Joan Velja and Charlie Griffin (University of Oxford, UK AI Security Institute). The full paper can be found here. Tl;dr We did a deep dive into untrusted...

Widening AI Safety's talent pipeline by meeting people where they are

Summary The AI safety field has a pipeline problem: many skilled engineers and researchers are locked out of full‑time overseas fellowships. Our answer is the Technical Alignment Research Accelerator (TARA) — a 14‑week, part‑time program designed for talented professionals and students who can’t put their careers or studies on hold...

Sep 25, 2025•33