Why could current AI alignment strategies fail against unchained AIs?
Currently, AI alignment is more focused on a multi-layer safety strategy to refuse risky requests from users. This approach works well for LLMs and today’s chatbots. However, this ignores a more dangerous threat: The possibility that someone, at some point, runs an unchained AI without these limits. By unchained AI,...