Model Integrity and Character
Published a post about integrity as a frame for trustworthiness in AI alignment, and how it relates to Claude's new constitution. Cross-posting as I think this might interest folk here. All feedback welcome! In October 1962, Soviet submarine B-59 was trapped underwater by American destroyers dropping depth charges. The captain,...
Feb 912