Automatic Theorem Proving

Automatic Theorem Proving

Overviews

Jordan Meadows & André Freitas 2022 - A Survey in Mathematical Language Processing
Lu et al 2022 - A Survey of Deep Learning for Mathematical Reasoning
Li et al 2024 - Survey on Deep Learning for Theorem Proving
Formalizing mathematics
- Kaliszyk & Rabe 2020 - A Survey of Languages for Formalizing Mathematics
Autoformalization
- Weng et al 2025 - Autoformalization in the Era of Large Language Models: A Survey

Papers

Cramer et al 2009 - The Naproche Project Controlled Natural Language Proof Checking of Mathematical Texts
Kaliszyk et al 2013 - Learning-Assisted Automated Reasoning with Flyspeck
Alemi et al 2016 - DeepMath - Deep Sequence Models for Premise Selection
Kaliszyk et al 2017 - HolStep: A Machine Learning Dataset for Higher-order Logic Theorem Proving
Müller & Kaliszyk 2021 - Disambiguating Symbolic Expressions in Informal Documents
Welleck et al 2021 - NaturalProofs: Mathematical Theorem Proving in Natural Language Dataset
Welleck et al 2022 - NaturalProver: Grounded Mathematical Proof Generation with Language Models

Neural Theorem Proving

Overviews
- Li et al 2024 - A Survey on Deep Learning for Theorem Proving
Bibliographies
- Deep Learning for Theorem Proving (DL4TP)
Step-by-step Proof Generation
- GPT-f: Polu & Sutskever 2020 - Generative Language Modeling for Automated Theorem Proving Maintains a proof tree with a queue of open goals, and tries to expand the goal with the highest LLM logprob to reach it. Metamath formalization language. Introduces WebMath pre-training.
- Han et al 2021 - Proof Artifact Co-training for Theorem Proving with Language Models Generates the next tactic using an LLM. Uses a best-first search algorithm: maintains a priority queue of search nodes, of tactic state (partial proof) and metadata, sorted by LLM heuristic score. For Lean. Introduces the LeanStep dataset to to predict the next tactic give a goal state, built from mathlib. Uses an interesting PACT pretraining dataset (sec 3.2).
- Jiang et al 2021 - LISA: Language models of ISAbelle proofs
- Polu et al 2022 - Formal Mathematics Statement Curriculum Learning
- Jiang et al 2022 - Thor: Wielding Hammers to Integrate Language Models and Automated Theorem Provers Uses automated theorem prover premise selection (hammers) to retrieve the relevant premises, and then uses LLMs as in GPT-f (Polu & Sutskever 2020). Increases success rate on PISA from 39.0% to 57.0%.
- Lample et al 2022 - HyperTree Proof Search for Neural Theorem Proving
- Yang et al 2023 - LeanDojo: Theorem Proving with Retrieval-Augmented Language Models
- Wang et al 2023 - DT-Solver: Automated Theorem Proving with Dynamic-Tree Sampling Guided by Proof-level Value Function
- Wang et al 2023 - LEGO-Prover: Neural Theorem Proving with Growing Libraries
- Thakur et al 2023 - A Language-Agent Approach to Formal Theorem-Proving
- Wang et al 2024 - Proving Theorems Recursively
Construction All-In-One Go
Welleck & Saha 2023 - LLMSTEP: LLM proofstep suggestions in Lean
Li et al 2024 - HunyuanProver: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving
Dong & Ma 2025 - STP: Self-play LLM Theorem Provers with Iterative Conjecturing and Proving
Shen et al 2025 - REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning

Formal Automated Theorem Proving

Systems

Vampire
- Kovacs & Voronkov 2013 - First-Order Theorem Proving and VAMPIRE Intro to VAMPIRE
- Reger 2016 - Better Proof Output for Vampire
For Mizar dataset
- Kaliszyk et al 2013 - MizAR 40 for Mizar 40 Uses Vampire, Epar, and Z3 solvers.
- Suda 2021 - Vampire With a Brain Is a Good ITP Hammer github
For Lean
- Lean-Step: Han et al 2021 - Proof Artifact Co-training for Theorem Proving with Language Models
HOL Light
- HOList and DeepHOL: Bansal et al 2019 - HOList: An Environment for Machine Learning of Higher-Order Theorem Proving

Datasets

TPTP (Thousands of Problems for Theorem Provers QuickGuide TPTP Format
Mizar
- MPTP
  - Paper: Urban 2008 - Automated Reasoning for Mizar: Artificial Intelligence through Knowledge Exchange
  - Dataset: v2 github v1 github
- MizarTPTP
  - Paper: Urban et al 2007 - Combining Mizar and TPTP Semantic Presentation Tools pdf
  - Dataset: website
Tsoukalas et al 2024 - PUTNAMBENCH: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

Formalizing Mathematics

Overviews
- Kaliszyk & Rabe 2020 - A Survey of Languages for Formalizing Mathematics
Mizar
- Rudnicki 1992 - An Overview of the Mizar Project
- Grabowski et al 2010 - Mizar in a Nutshell pdf
- Urban 2006 - XML-izing Mizar: Making Semantic Processing and Presentation of MML Easy See also Byliński 2021. Cited by Urban 2008.
- Byliński et al 2021 - Syntactic-Semantic Form of Mizar Articles Gives an overview and a good historical introduction in the intro.
- Grzegorz Bancerek 2006 - Automatic Translation in Formalized Mathematics

Autoformalization

Overviews
- Szegedy 2019 - A Promising Path Towards Autoformalization and General Artificial Intelligence Great
- Avigad et al 2023 - Mathematics and the Formal Turn Great overview of the reasons to autoformalize mathematics
- Weng et al 2025 - Autoformalization in the Era of Large Language Models: A Survey

Informal Math to Formal Math Parsing

For Mizar dataset
- Kaliszyk et al 2014 - Developing Corpus-based Translation Methods between Informal and Formal Mathematics: Project Description
- Wang et al 2018 - First Experiments with Neural Translation of Informal to Formal Mathematics Automatic translation of informal mathematical statements in latex to formal Mizar statements. Uses a synthetically constructed dataset.
- Wang et al 2020 - Exploration of Neural Machine Translation in Autoformalization of Mathematics in Mizar Automatic translation of informal mathematical statements in latex to formal Mizar statements. Uses a synthetically constructed dataset.
For Isabelle
- Li et al 2021 - IsarStep: a Benchmark for High-level Mathematical Reasoning github Fill in a missing intermediate proposition given surrounding proofs.
- Wu et al 2022 - Autoformalization with Large Language Models

Automatic Premise Selection

Software

VAMPIRE website github
Mizar proof checker Open-source reimplementation of the Mizar proof checker
- Tutorial paper: Kovacs & Voronkov 2013 - First-Order Theorem Proving and VAMPIRE
- CMU slides: Vampire
- Lecture: slides
Mizar (miscellaneous software)
- tptp4mizar: Generating Mizar texts from TPTP problems and solutions
- Generate Latex files from Mizar articles:

Courses and Tutorials

Automated Theorem Proving (in general)
- Interactive Theorem Proving @ Innsbruck
Mizar
- Tutorial Good tutorial, use along with Mizar in a Nutshell and Mizar: An Impression (see also recommended reading at the end of the tutorial)
- Writing a Mizar Article in Nine Easy Steps
- A Brief Overview of Mizar (2009 slides)
- Bonarska 1991 - An Introduction to PC Mizar
- Mizar Bibliography
Lean
- Natural Number Game Intro to Lean (great). Youtube video
- Lean for the Curious Mathematician 2022 2023 (Workshop) Has videos
- Learning Lean
  - Learning Lean 4 - List of places to learn
  - A glimpse of Lean - Fast paced, for mathematicians
  - Mathematics in Lean: pdf online Good place to start for a mathematician
  - Theorem Proving in Lean 4 Lean 3 version - Jeff can't stand this book

Workshops and Conferences

See Lean-related conferences and events

Table of Contents

Automatic Theorem Proving

Overviews

Papers

Neural Theorem Proving

Formal Automated Theorem Proving

Systems

Datasets

Formalizing Mathematics

Autoformalization

Informal Math to Formal Math Parsing

Automatic Premise Selection

Software

Courses and Tutorials

Workshops and Conferences

People

Related Pages