Dynamics and Reachability of Learning Tasks

doi:10.48550/arXiv.1810.02440

Dynamics and Reachability of Learning Tasks

We compute the transition probability between two learning tasks, and show that it decomposes into two factors. The first depends on the geometry of the loss landscape of a model trained on each task, independent of any particular model used. This is related to an information theoretic distance function, but is insufficient to predict success in transfer learning, as nearby tasks can be unreachable via fine-tuning. The second factor depends on the ease of traversing the path between two tasks. With this dynamic component, we derive strict lower bounds on the complexity necessary to learn a task starting from the solution to another, which is one of the most common forms of transfer learning.

Publication:

arXiv e-prints

Pub Date:

October 2018

DOI:

10.48550/arXiv.1810.02440

arXiv:

arXiv:1810.02440

Bibcode:

2018arXiv181002440A

Keywords:

Computer Science - Machine Learning;
Computer Science - Artificial Intelligence;
Statistics - Machine Learning

NASA/ADS

Dynamics and Reachability of Learning Tasks

Abstract