Appendices to the live agendas

Stag

Honestly this isn't that long, I might say to re-merge it with the main post. Normally I'm a huge proponent of breaking posts up smaller, but yours is literally trying to be an index, so breaking a piece off makes it harder to use.

[-]technicalities2y10

yeah you're right

[-]Steven Byrnes2y20

For what it’s worth, I am not doing (and have never done) any research remotely similar to your text “maybe we can get really high-quality alignment labels from brain data, maybe we can steer models by training humans to do activation engineering fast and intuitively”.

I have a concise and self-contained summary of my main research project here (Section 2).

[-]technicalities2y30

I care a lot! Will probably make a section for this in the main post under "Getting the model to learn what we want", thanks for the correction.

LESSWRONG
LW

LESSWRONG
LW

16

Appendices to the live agendas

16

16

Appendix: Prior enumerations

Appendix: Graveyard

Appendix: Biology for AI alignment

Human enhancement

Merging

As alignment aid

Appendix: Research support orgs

Appendix: Meta, mysteries, more