A Tractarian Filter for Safer Language Models — LessWrong