Learning definable hypotheses on trees

doi:10.48550/arXiv.1909.10994

Learning definable hypotheses on trees

We study the problem of learning properties of nodes in tree structures. Those properties are specified by logical formulas, such as formulas from first-order or monadic second-order logic. We think of the tree as a database encoding a large dataset and therefore aim for learning algorithms which depend at most sublinearly on the size of the tree. We present a learning algorithm for quantifier-free formulas where the running time only depends polynomially on the number of training examples, but not on the size of the background structure. By a previous result on strings we know that for general first-order or monadic second-order (MSO) formulas a sublinear running time cannot be achieved. However, we show that by building an index on the tree in a linear time preprocessing phase, we can achieve a learning algorithm for MSO formulas with a logarithmic learning phase.

Publication:

arXiv e-prints

Pub Date:

September 2019

DOI:

10.48550/arXiv.1909.10994

arXiv:

arXiv:1909.10994

Bibcode:

2019arXiv190910994G

Keywords:

Computer Science - Logic in Computer Science;
Computer Science - Artificial Intelligence;
Computer Science - Machine Learning

E-Print:

Full version of ICDT 2019 paper

NASA/ADS

Learning definable hypotheses on trees

Abstract