x

LESSWRONG

LW

Jesse Hoogland — LessWrong

Jesse Hoogland

Top postsTop post

Jesse Hoogland

Message

Cofounder at Resolution. Previously, executive director at Timaeus, where I worked on applications of singular learning theory and developmental interpretability.

Website: jessehoogland.com

Twitter: @jesse_hoogland

3480

Ω

877

28

91

6y

Jesse Hoogland

Cofounder at Resolution. Previously, executive director at Timaeus, where I worked on applications of singular learning theory and developmental interpretability.

Website: jessehoogland.com

Twitter: @jesse_hoogland

Top postsTop post

Resolution (fka Sequent): scale and automation for higher confidence in alignment

EDIT: We originally launched under the name Sequent. Read why we renamed to Resolution. Alignment is not on track Artificial superintelligence (ASI) may be developed in the next few years. It is unclear whether alignment is on track to be ready on the same timeframe. At a minimum, the empirical programs at AI labs are unlikely to deliver a priori confidence, before training ASI, that things will go well. We are starting a large nonprofit research organization, Resolution, that aims to clear a higher bar: 1. We are aiming at higher confidence via a portfolio of theory and empirics bets, all of which could fail, such that if any succeed, they would give us more a priori confidence in aligned outcomes. 2. We are investing heavily in automation to accelerate progress on these bets. 3. We believe that theory unlocks higher automation. Taking a more principled approach offers better filters for deciding which directions of automated research are promising (a proof is worth a thousand experiments, and even a pseudo-proof is worth hundreds). Who[1]: researchers from the UK AISI’s Alignment Team and Timaeus, with more to come. We’re aiming at 40-80 FTE two years from now. The Alignment Team ran the £30m Alignment Project, and Timaeus has pioneered applying singular learning theory (SLT) to alignment. Founding team: * Geoffrey Irving — Chief Scientist at UK AISI; ex-DeepMind, OpenAI, and Google Brain. * Daniel Murfet — Head of Research at Timaeus; left tenure to pioneer SLT for alignment. * AISI Alignment — Alex Holness-Tofts and Jacob Pfau. * Timaeus — Jesse Hoogland, Stan van Wingerden, and Marco Cozzi. * Joined by researchers from Timaeus and more researchers from the UK AISI’s Alignment Team Where: a large in-person presence in the Bay Area (Berkeley), as well as researchers working remotely from London, Melbourne, and elsewhere. In this post, we discuss: * What it means to aim at higher confidence * Why start a new big organization * Whether sufficien

Neural networks generalize because of this one weird trick

215Jan 18, 2023

Towards Developmental Interpretability

195Jul 12, 2023

Announcing Timaeus

188Oct 22, 2023

Announcing our $160M grant from Coefficient Giving

by Geoffrey Irving, Jesse Hoogland, Alex HT, Jacob Pfau, Daniel Murfet, Marco Cozzi, and Stan van Wingerden

We are excited to announce that Resolution (fka Sequent) has a $160M grant from Coefficient Giving (cG) to put rigorous alignment research on a (closer to) even footing with the frontier labs. We will use it to accelerate progress towards higher-confidence alignment, or to find evidence and obstacles showing why...

Resolution (fka Sequent): scale and automation for higher confidence in alignment

by Geoffrey Irving, Alex HT, Jesse Hoogland, Daniel Murfet, Jacob Pfau, Marco Cozzi, and Stan van Wingerden

EDIT: We originally launched under the name Sequent. Read why we renamed to Resolution. Alignment is not on track Artificial superintelligence (ASI) may be developed in the next few years. It is unclear whether alignment is on track to be ready on the same timeframe. At a minimum, the empirical...

SLT for AI Safety

> This sequence draws from a position paper co-written with Simon Pepin Lehalleur, Jesse Hoogland, Matthew Farrugia-Roberts, Susan Wei, Alexander Gietelink Oldenziel, Stan van Wingerden, George Wang, Zach Furman, Liam Carroll, Daniel Murfet. Thank you to Stan, Dan, and Simon for providing feedback on this post. Alignment ⊆ Capabilities. As...

Jul 1, 2025•78

The Sweet Lesson: AI Safety Should Scale With Compute

A corollary of Sutton's Bitter Lesson is that solutions to AI safety should scale with compute.[1] Let's consider a few examples of research directions that are aiming at this property: * Deliberative Alignment: Combine chain-of-thought with Constitutional AI to improve safety with inference-time compute (see Guan et al. 2025, Figure...

May 5, 2025•98

Timaeus in 2024

> TLDR: We made substantial progress in 2024: > > * We published a series of papers that verify key predictions of Singular Learning Theory (SLT) [1, 2, 3, 4, 5, 6]. > * We scaled key SLT-derived techniques to models with billions of parameters, eliminating our main concerns around...

Feb 20, 2025•100

The Simplest Good

Common Law AI worked better than anyone expected. Dr. Sarah Chen was skeptical from the start. "You're essentially training them to be moral judges," she warned during the initial architecture review. "What if they overfit on ethics?" The room laughed. "Better than the alternative," someone quipped. The idea was simple...

Feb 2, 2025•76

Kessler's Second Syndrome

It started as so many dooms do, with a flash in the night sky over the South China Sea. Testing a new ASAT weapon, the Chinese military shattered a derelict spy satellite into 40,000 shards of shrapnel. The debris pattern suggested a fragmentation warhead optimized for lethal scatter. Within 48...

Jan 26, 2025•70

Load More (7/32)