Alignment & Capabilities: what's the difference?
In the AI safety literature, AI alignment is often presented as conceptually distinct from capabilities. However, (1) the distinction seems somewhat fuzzy and (2) many techniques that are supposed to improve alignment also improve capabilities. (1) The distinction is fuzzy because one common way of defining alignment is getting an...
Sep 13, 20236