x
On the discordance between AI systems' internal states and their outputs — LessWrong