AI to Assist Mathematical Reasoning: A Workshop

2 June, 2023 in advertising | Tags: Artificial Intelligence | by Terence Tao

The National Academies of Science, Engineering, and Mathematics are hosting a virtual workshop on the topic of “AI to Assist Mathematical Reasoning” from June 12-14. The tentative program can be found here. I am one of the members of the organizing committee for this workshop, together with Petros Koumoutsakos, Jordan Ellenberg, Melvin Greer, Brendan Hassett, Yann A. LeCun, Heather Macbeth, Talia Ringer, Kavitha Srinivas, and Michelle Schwalbe. There is some thematic overlap (and a few speakers in common) with the recent IPAM program on machine assisted proof, though with more of a focus on the current and projected technical capabilities of machine learning algorithms for mathematics. Registration for the event is currently open at the web page for the workshop.

21 comments

Comments feed for this article

2 June, 2023 at 11:15 am

Guilherme Rocha de Rezende

Thanks for the tips

2 June, 2023 at 11:20 am

Nikita Sidorov

Sorry for the offtopic, Terry, but what is your opinion regarding Per Enflo’s recent theorem on the existence of proper closed invariant subspaces for any bounded linear operator on a separable Hilbert space?

2 June, 2023 at 11:21 am

Ashok Khanna

Would you be able to record these sessions as timezone (Australia) and work commitments make it difficult to watch live? Really excited to listen to these workshops!

2 June, 2023 at 4:42 pm

Amadi

Common get out of here.

3 June, 2023 at 10:05 pm

Anonymous

pretty sure it will be on Youtube.

2 June, 2023 at 1:42 pm

Anonymous

A possible drawbackof such AI assistance is to reduce human creativity (by relying on AI inteligence – therby becoming more “lazy” and degenarating creative thinking!)

2 June, 2023 at 8:42 pm

mathematicalsilence

Anonymous u could well do some creative thinking and learn how to spell … thereby generating a smidgin of credence for ur comments on ‘intelligence’ … note ‘ur’ is an AI approved … perhaps lazy … spelling of ‘your’

2 June, 2023 at 3:15 pm

AI辅助数学推理：研讨会 - 偏执的码农

[…] 详情参考 […]

2 June, 2023 at 4:53 pm

Mathur Maynard Amadi

Good day, Terence Tao.

What is the difference between Laplace Geometry and Cosmic Geometry?

What is the difference between Gravitational Fields and Magnetic fields?

3 June, 2023 at 4:37 am

Arman

Thanks PROF

7 June, 2023 at 4:13 am

Arnie Bebita-Dris

Thank you for this, Professor.

7 June, 2023 at 5:38 am

Anonymous

Regarding proof verification.. do we need ML when we have PCP?

9 June, 2023 at 9:35 am

Terence Tao

Probabilistically checkable proofs are practically feasible in a few limited cases (for instance, in checking that a given large integer is a (strong) probable prime), but for most types of mathematical statements one would like to know to be true, it would be impractical to set up a probabilistically checkable proof (the size of the probabilistically checkable proof, while in principle of “only” polynomial size in the length of the original proof, could be well beyond the capacity of current technology to create or store).

9 June, 2023 at 10:17 am

Anonymous

As I understand, in many cases proofs can be broken up. In many cases, it only takes a critical statement to fail and they fail for a certain reason which (should be at least in many non-trivial cases) can be dissected into smaller chunks and verified. Of course the dissection might need human intervention.

It is extremely plausible with ‘LLM type’ techniques and with such a strategy at least tedious hw grading could be automated to find flaws and correctness in arguments or jumps in arguments.

11 June, 2023 at 2:31 pm

Anonymous

https://people.csail.mit.edu/dmoshkov/courses/pcp/dinur_pcp_overview.pdf states on page 4 (top) “Classical proofs of length n are converted to PCPs of
length n · (logn)^(O(1)) in the work of Dinur”.

Why is this too long compared to the standard latex typed proof?

12 June, 2023 at 10:57 am

Anonymous

Up on reading the paper a bit it appears (??) PCPs of length just n (log(n))^2 suffices. A proof of length 10MBis going to be blown up to 10GB or so which can fit in the RAM of macbook air! Why go through this ML business?

12 June, 2023 at 8:07 pm

Terence Tao

The absolute constant in the O() notation may be non-negligible currently. But even disregarding the issue of the constant, PCPs aren’t really addressing the main issue here. The problem that needs solving isn’t that of converting a 10MB proof that could already be formally verified in a standard proof verifier such as Lean, into a 10GB PCP that can be probabilistically verified by some zero-knowledge protocol; the bottleneck is in producing the 10MB formally verifiable proof in the first place. Note that one cannot simply apply Dinur’s algorithm to, say, a 200-page proof written by a human mathematician, because at present such proofs are usually written in semi-formal mathematical English (or perhaps some other human language) and are not in a form that is easily converted into a PCP (with some rare exceptions, such as primality testing, as I mentioned previously.)

14 June, 2023 at 11:16 am

Anonymous

“The absolute constant in the O() notation may be non-negligible currently”.. I think the length of the proof is <= n (log n)^2. There is no O() notation involved here.

But agree converting regular human proofs to a standard computer proof (SCP) before conversion to PCPs is the bottleneck. But with current status of progress in AI, is it possible this bottleneck might be progressed eventually to a non-issue (in say a decade or two)? Might it involve converting standard tomes such as classification of simple groups, EGA & SGA etc into SCPs to start with?

[I severely doubt that there is no implied constant here, since the very notion of “length of proof” depends on the encoding (e.g., binary or ASCII), up to constants. -T]

12 June, 2023 at 8:37 am

Anonymous

This is a very topical workshop, since ChatGPT does reasonably well on math (GHOSTS datasets for mathematics: https://arxiv.org/abs/2301.13867 )

14 June, 2023 at 2:38 pm

Lars Ericson

Thanks for organizing this workshop. A few comments:
* Funding sources identified: Microsoft, Google, DARPA, Simons Foundation. Why does this matter? A key worker in the field complained recently that funding had dried up for interactive theorem proving, that AMS prioritized automated theorem proving over interactive theorem proving, and that mathematicians who originate new proofs in Math departments didn’t respect people who “just” worked on mechanically verifying those proofs. The people who do Lean and Coq need support or the projects could die out.
* Key use case (and maybe opportunity for funding): AMS and similar should encourage and incentivize mathematicians to use Lean or Coq to formally check their new proofs. Why? (a) Wiles’ proof of FLT took several years for mathematicians to have faith in it. A formally verified proof would take 0 years and 0 faith to accept. (b) It was noted that Wiles’ proof has a lot of “weasel words” like “analogous to” for constructions. With an interactive theorem proof, Wiles’ would have been forced to make these mappings explicit. (c) Some claim that no one mathematician, including Wile, has the whole proof of FLT in their heads. Is this true? Making mathematicians “program” their proofs in Coq or Lean would help suss this out. Mathematicians already use LaTeX all the time. There is no reason they should add Coq or Lean to their toolkit for daily use.
* So Terry, please learn Lean and start doing some proofs in Lean. It will be a great experience for you, and that will help you guide the AI/Math field. Maybe you can do FLT? It’s been waiting for 28 years for fully automated verification. You could do it in 2 weeks. You will be modelling this for other mathematicians. There is really a huge social gap between “mathematician’s mathematicians” doing pure math for math’s sake, and people who do automated theorem proving.
* AI? Meh. GPT doesn’t have the necessary structure to learn this material. People didn’t really talk about what it would take to extend it. Formally verifying proofs is computationally hard, depending on the number of variables in a goal state of the proof, even when you do it directly and as efficiently as you know how.

3 July, 2023 at 8:06 am

Lars Ericson

A NY Times article came out on the workshop:

The article mentions Heather MacBeth’s introductory Lean course:

https://github.com/hrmacbeth/math2001

Not to mention the classic Natural Number Game by Kevin Buzzard and Mohammad Pedramfar:

https://www.ma.imperial.ac.uk/~buzzard/xena/natural_number_game/

The article also mentions concerns by Michael Harris at Columbia, who vets the conference in multiple posts and gives low marks for computer scientists imposing methodology and value judgments about approach to work on mathematicians, and expresses concerns about access to models, the source code of models, and sufficient resources to reproduce training for models; and concern that the money for this work comes mostly and ultimately from DoD and IC:

https://siliconreckoner.substack.com/p/right-on-cue-the-military-industrial
https://siliconreckoner.substack.com/p/my-shallow-thoughts-about-deep-learning
https://siliconreckoner.substack.com/p/my-shallow-thoughts-about-deep-learning-917
https://siliconreckoner.substack.com/p/my-shallow-thoughts-about-deep-learning-186

	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on On writing
	Anonymous on 275A, Notes 3: The weak and st…
	Vahid on What is a gauge?
	AI颠覆数学研究！陶哲轩借AI破解数学猜… on Formalizing the proof of PFR i…
	Anonymous on Course announcement: Math 246A…
	Anonymous on Erratum for “An inverse…
	Anonymous on 245C, Notes 3: Distributi…
	Anonymous on Analysis I
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…

AI to Assist Mathematical Reasoning: A Workshop

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

21 comments

Leave a comment Cancel reply

For commenters

AI to Assist Mathematical Reasoning: A Workshop

Share this:

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

21 comments

Leave a comment Cancel reply

For commenters