x
A Tractarian Filter for Safer Language Models — LessWrong