x

LESSWRONG

LW

Austin McCaffrey — LessWrong

Austin McCaffrey

Austin McCaffrey

Message

2

3

9mo

Austin McCaffrey

9mo

Aurelius: Proposing Alignment as an Emergent Property

We've published a new whitepaper for Aurelius, a decentralized protocol designed to generate alignment training data under a completely new data paradigm, one that proposes alignment as an emergent property of multi-agent systems. The protocol operates as Subnet 37 on Bittensor, where independent participants compete to evolve alignment environments and...

Aurelius: A Peer-to-Peer Alignment Protocol

We’ve just published the first whitepaper for Aurelius, a decentralized protocol designed to generate verifiable alignment data through adversarial prompting, reasoning transparency, and contestable evaluation. The system draws from cryptoeconomic design principles (inspired by Bittensor and Bitcoin) but applies them to the AI alignment problem, with interpretability and reasoning coherence...

Jul 17, 2025•3