x
Naturally learned behaviors in deep MLPs resist detection by both human and learned algorithms — LessWrong