x
Introspection or entropy? Re-examining concept-injection “introspection” in open models — LessWrong