Alignment Gaps
Misaligned agendas and terminology among academic, industrial and independent AI alignment research This post aims to fill some gaps between technical AI Alignment topics and academic AI research. It summarises a quick but informed scouting of academic research papers that are closely connected to four Alignment topics: mechanistic interpretability, safety...
Jun 8, 202411