[linkpost] Self-Rewarding Language Models — LessWrong