AI Is Bad at Physics
There’s a new preprint from Peking University in China that assesses LLM capabilities in reproducing results from experimental physics papers. Their finding? All the agents had a 0% “end-to-end callback rate,” i.e. they were incapable of reproducing any full, numerical results from any of the papers. Other tests showed that...
Apr 2710