x

LESSWRONG

LW

lisunshiny — LessWrong

lisunshiny

lisunshiny

Message

8

1

16d

lisunshiny

8

16d

AI emotions and aligned behavior

I participated in the BlueDot Technical AI Safety Project Sprint (April-May 2026) to better understand the field of AI Safety Research. This blog post summarizes my findings. Introduction Most AI safety research concentrates on two levers: alignment (building models that genuinely share human values) and control (layering in security and...