x
Can activation verbalizers surface an internal chain of thought? — LessWrong