Generating the Funniest Joke with RL (according to GPT-4.1) — LessWrong