LESSWRONGHypothesis Subspace
LW

Hypothesis Subspace

Sep 12, 2022 by Paul Bricman

A living collection of alignment proposals I'm exploring at Refine, a program hosted by Conjecture.

20Oversight Leagues: The Training Game as a FeatureΩ
Paul Bricman
9mo
Ω
6
6Ideological Inference Engines: Making Deontology Differentiable*Ω
Paul Bricman
9mo
Ω
0
30Representational Tethers: Tying AI Latents To Human OnesΩ
Paul Bricman
9mo
Ω
0
15Interlude: But Who Optimizes The Optimizer?Ω
Paul Bricman
9mo
Ω
0
25(Structural) Stability of Coupled OptimizersΩ
Paul Bricman
8mo
Ω
0
9Boolean Primitives for Coupled Optimizers
Paul Bricman
8mo
0
13Cataloguing Priors in Theory and PracticeΩ
Paul Bricman
8mo
Ω
8