LESSWRONG
LW

Hypothesis Subspace

Sep 12, 2022 by Paul Bricman

A living collection of alignment proposals I'm exploring at Refine, a program hosted by Conjecture.

20Oversight Leagues: The Training Game as a Feature
Ω
Paul Bricman
3y
Ω
6
6Ideological Inference Engines: Making Deontology Differentiable*
Ω
Paul Bricman
3y
Ω
0
30Representational Tethers: Tying AI Latents To Human Ones
Ω
Paul Bricman
3y
Ω
0
15Interlude: But Who Optimizes The Optimizer?
Ω
Paul Bricman
3y
Ω
0
25(Structural) Stability of Coupled Optimizers
Ω
Paul Bricman
3y
Ω
0
13Cataloguing Priors in Theory and Practice
Ω
Paul Bricman
3y
Ω
8