Towards Finding Longer Proofs
Abstract
We present a reinforcement learning (RL) based guidance system for automated theorem proving geared towards Finding Longer Proofs (FLoP). Unlike most learning based approaches, we focus on generalising from very little training data and achieving near complete confidence. We use several simple, structured datasets with very long proofs to show that FLoP can successfully generalise a single training proof to a large class of related problems. On these benchmarks, FLoP is competitive with strong theorem provers despite using very limited search, due to its ability to solve problems that are prohibitively long for other systems.
- Publication:
-
arXiv e-prints
- Pub Date:
- May 2019
- DOI:
- 10.48550/arXiv.1905.13100
- arXiv:
- arXiv:1905.13100
- Bibcode:
- 2019arXiv190513100Z
- Keywords:
-
- Computer Science - Logic in Computer Science;
- Computer Science - Artificial Intelligence;
- Computer Science - Machine Learning
- E-Print:
- 16 pages, 3 figures, published at TABLEAUX2021