Can Models be Evaluation Aware Without Explicit Verbalization? — LessWrong