When does capability elicitation bound risk? — LessWrong