Active One-shot Learning
Abstract
Recent advances in one-shot learning have produced models that can learn from a handful of labeled examples, for passive classification and regression tasks. This paper combines reinforcement learning with one-shot learning, allowing the model to decide, during classification, which examples are worth labeling. We introduce a classification task in which a stream of images are presented and, on each time step, a decision must be made to either predict a label or pay to receive the correct label. We present a recurrent neural network based action-value function, and demonstrate its ability to learn how and when to request labels. Through the choice of reward function, the model can achieve a higher prediction accuracy than a similar model on a purely supervised task, or trade prediction accuracy for fewer label requests.
- Publication:
-
arXiv e-prints
- Pub Date:
- February 2017
- DOI:
- arXiv:
- arXiv:1702.06559
- Bibcode:
- 2017arXiv170206559W
- Keywords:
-
- Computer Science - Machine Learning
- E-Print:
- NIPS 2016, Deep Reinforcement Learning Workshop, Barcelona, Spain. See https://cs.stanford.edu/~woodward/ for the poster and a short video description of the paper