An Integer Programming Approach to Deep Neural Networks with Binary Activation Functions
Abstract
We study deep neural networks with binary activation functions (BDNN), i.e. the activation function only has two states. We show that the BDNN can be reformulated as a mixed-integer linear program which can be solved to global optimality by classical integer programming solvers. Additionally, a heuristic solution algorithm is presented and we study the model under data uncertainty, applying a two-stage robust optimization approach. We implemented our methods on random and real datasets and show that the heuristic version of the BDNN outperforms classical deep neural networks on the Breast Cancer Wisconsin dataset while performing worse on random data.
- Publication:
-
arXiv e-prints
- Pub Date:
- July 2020
- DOI:
- 10.48550/arXiv.2007.03326
- arXiv:
- arXiv:2007.03326
- Bibcode:
- 2020arXiv200703326B
- Keywords:
-
- Mathematics - Optimization and Control;
- Computer Science - Machine Learning;
- Computer Science - Neural and Evolutionary Computing
- E-Print:
- Workshop on Beyond first-order methods in ML systems at the 37th International Conference on Machine Learning, Vienna, Austria, 2020