Notes on countermeasures for exploration hacking (aka sandbagging) — LessWrong