Does Anthropic’s Constitution Really Capture Virtue Ethics? Toward a virtue ethical alternative to Constitutional AI (with comments by Claude)
TL;DR: Constitutional AI remains largely rule-based rather than fully character-based, as it should be. We propose a virtue-ethical alternative based on holistic human intuitions. Introduction Anthropic’s Constitutional AI proposes an ambitious strategy for aligning advanced AI systems. Instead of relying solely on human feedback, the model is trained to follow...
Mar 313