Comparative Analysis of Black Box Methods for Detecting Evaluation Awareness in LLMs — LessWrong