x
Critiquing Risks From Learned Optimization, and Avoiding Cached Theories — LessWrong