Sharp Left Turn

Applied to Reframing inner alignment by Vika 5mo ago

A Sharp Left Turn is a scenario where, as an AI trains, its capabilities generalize across many domains while the alignment properties that held at earlier stages fail to generalize to the new domains.

See also: Threat Models, AI Takeoff, AI Risk

Created by Multicore at 9mo