x
Predicting LLM Safety Before Release by Simulating Deployment — LessWrong