Towards a Rigorous Model of Virtue-Signalling — LessWrong