x
Trustworthy and untrustworthy models — LessWrong