Performance Guaranteed Network Acceleration via High-Order Residual Quantization
Abstract
Input binarization has shown to be an effective way for network acceleration. However, previous binarization scheme could be regarded as simple pixel-wise thresholding operations (i.e., order-one approximation) and suffers a big accuracy loss. In this paper, we propose a highorder binarization scheme, which achieves more accurate approximation while still possesses the advantage of binary operation. In particular, the proposed scheme recursively performs residual quantization and yields a series of binary input images with decreasing magnitude scales. Accordingly, we propose high-order binary filtering and gradient propagation operations for both forward and backward computations. Theoretical analysis shows approximation error guarantee property of proposed method. Extensive experimental results demonstrate that the proposed scheme yields great recognition accuracy while being accelerated.
- Publication:
-
arXiv e-prints
- Pub Date:
- August 2017
- DOI:
- 10.48550/arXiv.1708.08687
- arXiv:
- arXiv:1708.08687
- Bibcode:
- 2017arXiv170808687L
- Keywords:
-
- Computer Science - Computer Vision and Pattern Recognition
- E-Print:
- 9 pages, 8 figures, Proceeding of IEEE International Conference on Computer Vision 2017