Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale

doi:10.48550/arXiv.1612.06519

Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale

Iandola, Forrest

In recent years, the research community has discovered that deep neural networks (DNNs) and convolutional neural networks (CNNs) can yield higher accuracy than all previous solutions to a broad array of machine learning problems. To our knowledge, there is no single CNN/DNN architecture that solves all problems optimally. Instead, the "right" CNN/DNN architecture varies depending on the application at hand. CNN/DNNs comprise an enormous design space. Quantitatively, we find that a small region of the CNN design space contains 30 billion different CNN architectures. In this dissertation, we develop a methodology that enables systematic exploration of the design space of CNNs. Our methodology is comprised of the following four themes. 1. Judiciously choosing benchmarks and metrics. 2. Rapidly training CNN models. 3. Defining and describing the CNN design space. 4. Exploring the design space of CNN architectures. Taken together, these four themes comprise an effective methodology for discovering the "right" CNN architectures to meet the needs of practical applications.

Publication:

arXiv e-prints

Pub Date:

December 2016

DOI:

10.48550/arXiv.1612.06519

arXiv:

arXiv:1612.06519

Bibcode:

2016arXiv161206519I

Keywords:

Computer Science - Computer Vision and Pattern Recognition;
Computer Science - Machine Learning;
Computer Science - Neural and Evolutionary Computing

E-Print:

thesis, UC Berkeley (2016)

ADS

Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale

Abstract