FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI — LessWrong