Writer

The Goddess of Everything Else - The Animation

This is an animation of The Goddess of Everything Else, by Scott Alexander. I hope you enjoy it :)

153Jul 13, 2023

Writer

Message

Rational Animations' head writer and helmsman

1871

AI Sleeper Agents: How Anthropic Trains and Catches Them - Video

In this video, we explain how Anthropic trained "sleeper agent" AIs to study deception. A "sleeper agent" is an AI model that behaves normally until it encounters a specific trigger in the prompt, at which point it awakens and executes a harmful behavior. Anthropic found that they couldn't undo the...

Aug 30, 20259

How Misaligned AI Personas Lead to Human Extinction – Step by Step

In this video, we walk you through a plausible scenario in which AI could lead to humanity’s extinction. There are many alternative possibilities, but this time we focus on superhuman AIs developing misaligned personas, similar to how Microsoft’s Bing Chat developed the misaligned “Sydney” persona shortly after its release. This...

Jul 19, 202514

Rational Animations' video about scalable oversight and sandwiching

In the future, AIs will likely be much smarter than we are. They'll produce outputs that may be difficult for humans to evaluate, either because evaluation is too labor-intensive, or because it's qualitatively hard to judge the actions of machines smarter than us. This is the problem of “scalable oversight.”...

Jul 6, 202518

When will AI automate all mental work, and how fast?

Rational Animations takes a look at Tom Davidson's Takeoff Speeds model (https://takeoffspeeds.com). The model uses formulas from economics to answer two questions: how long do we have until AI automates 100% of human cognitive labor, and how fast will that transition happen? The primary scriptwriter was Allen Liu (the first...

May 31, 202510

RA x ControlAI video: What if AI just keeps getting smarter?

The video is about extrapolating the future of AI progress, following a timeline that starts from today’s chatbots to future AI that’s vastly smarter than all of humanity combined–with God-like capabilities. We argue that such AIs will pose a significant extinction risk to humanity. This video came out of a...

May 2, 2025100

Can Knowledge Hurt You? The Dangers of Infohazards (and Exfohazards)

In this Rational Animations video, we look at dangerous knowledge: information hazards (infohazards) and external information hazards (exfohazards). We talk about one way they can be classified, what kinds of dangers they pose, and the dangers that come from too much secrecy. The primary scriptwriter was Allen Liu (the first...

Feb 8, 202519

Our new video about goal misgeneralization, plus an apology

Below is Rational Animations' new video about Goal Misgeneralization. It explores the topic through three lenses: * How humans are an example of goal misgeneralization with respect to evolution's implicit goals. * An example of goal misgeneralization in a very simple AI setting. * How deceptive alignment shares key features...

Jan 14, 202533

Load More (7/45)

LESSWRONG
LW

LESSWRONG
LW

Writer

Writer

Writer

The Goddess of Everything Else - The Animation

That Alien Message - The Animation

RA x ControlAI video: What if AI just keeps getting smarter?

The True Story of How GPT-2 Became Maximally Lewd

Writer

AI Sleeper Agents: How Anthropic Trains and Catches Them - Video

How Misaligned AI Personas Lead to Human Extinction – Step by Step

Rational Animations' video about scalable oversight and sandwiching

When will AI automate all mental work, and how fast?

RA x ControlAI video: What if AI just keeps getting smarter?

Can Knowledge Hurt You? The Dangers of Infohazards (and Exfohazards)

Our new video about goal misgeneralization, plus an apology

The Goddess of Everything Else - The Animation

That Alien Message - The Animation

RA x ControlAI video: What if AI just keeps getting smarter?

The True Story of How GPT-2 Became Maximally Lewd

AI Sleeper Agents: How Anthropic Trains and Catches Them - Video

How Misaligned AI Personas Lead to Human Extinction – Step by Step

Rational Animations' video about scalable oversight and sandwiching

When will AI automate all mental work, and how fast?

RA x ControlAI video: What if AI just keeps getting smarter?

Can Knowledge Hurt You? The Dangers of Infohazards (and Exfohazards)

Our new video about goal misgeneralization, plus an apology