AI Safety in secret — LessWrong