[Research Note] Optimizing The Final Output Can Obfuscate CoT — LessWrong