x
Test your interpretability techniques by de-censoring Chinese models — LessWrong