x
A benchmark for vericoding: formally verified program synthesis — LessWrong