Lester Leong

Safe ASI Is Achievable: The Finite Game Argument

A few days ago, Anthropic dropped the central pledge of its Responsible Scaling Policy, the promise that it would never train an AI system unless it could guarantee in advance that its safety measures were adequate. The stated reason: unilateral safety commitments don't make sense when competitors are racing ahead...

Feb 278

AI Alignment via Slow Substrates: Early Empirical Results With StarCraft II

A few months ago, I wrote a post about using slower computing substrates as a possibly new way to safely train and align ASI. If you haven't read that post, basically the idea is that if we consider compute speed as a factor in Total Intelligence (alongside say, quality of...

Oct 14, 202460

Slowed ASI - a possible technical strategy for alignment

Lately, much has been discussed about PauseAI, or even stopping research completely, until further progress has been made in theory or technical approaches to alignment. After thinking about this for some time, I wondered if there was a way to formalize this reasoning in mathematical terms when I stumbled upon...

Jun 14, 20247

LESSWRONG
LW

LESSWRONG
LW

Lester Leong

Lester Leong

Lester Leong

Safe ASI Is Achievable: The Finite Game Argument

AI Alignment via Slow Substrates: Early Empirical Results With StarCraft II

Slowed ASI - a possible technical strategy for alignment