Binary neural networks: A survey
Abstract
The binary neural network, largely saving the storage and computation, serves as a promising technique for deploying deep models on resource-limited devices. However, the binarization inevitably causes severe information loss, and even worse, its discontinuity brings difficulty to the optimization of the deep network. To address these issues, a variety of algorithms have been proposed, and achieved satisfying progress in recent years. In this paper, we present a comprehensive survey of these algorithms, mainly categorized into the native solutions directly conducting binarization, and the optimized ones using techniques like minimizing the quantization error, improving the network loss function, and reducing the gradient error. We also investigate other practical aspects of binary neural networks such as the hardware-friendly design and the training tricks. Then, we give the evaluation and discussions on different tasks, including image classification, object detection and semantic segmentation. Finally, the challenges that may be faced in future research are prospected.
- Publication:
-
Pattern Recognition
- Pub Date:
- September 2020
- DOI:
- 10.1016/j.patcog.2020.107281
- arXiv:
- arXiv:2004.03333
- Bibcode:
- 2020PatRe.10507281Q
- Keywords:
-
- Binary neural network;
- Deep learning;
- Model compression;
- Network quantization;
- Model acceleration;
- Computer Science - Neural and Evolutionary Computing;
- Computer Science - Computer Vision and Pattern Recognition;
- Computer Science - Machine Learning
- E-Print:
- Pattern Recognition (2020) 107281