Published a post about integrity as a frame for trustworthiness in AI alignment, and how it relates to Claude's new constitution.
Cross-posting as I think this might interest folk here. All feedback welcome!
In October 1962, Soviet submarine B-59 was trapped underwater by American destroyers dropping depth charges. The captain, cut off from Moscow and believing war had begun, wanted to launch a nuclear torpedo. Launch required three officers. One of them, Vasily Arkhipov, refused, and nuclear war was averted.
A few months earlier in Jerusalem, Adolf Eichmann was put on trial for organizing the logistics of the Holocaust. His defense, famously documented in Hannah Arendt’s “Eichmann in Jerusalem: A Report on the Banality of... (read 1624 more words →)