New Paper: It is time to move on from MCQs for LLM Evaluations — LessWrong