Anomalous Concept Detection for Detecting Hidden Cognition — LessWrong