Deep Networks with Adaptive Nyström Approximation

doi:10.48550/arXiv.1911.13036

Deep Networks with Adaptive Nyström Approximation

Recent work has focused on combining kernel methods and deep learning to exploit the best of the two approaches. Here, we introduce a new architecture of neural networks in which we replace the top dense layers of standard convolutional architectures with an approximation of a kernel function by relying on the Nystr{ö}m approximation. Our approach is easy and highly flexible. It is compatible with any kernel function and it allows exploiting multiple kernels. We show that our architecture has the same performance than standard architecture on datasets like SVHN and CIFAR100. One benefit of the method lies in its limited number of learnable parameters which makes it particularly suited for small training set sizes, e.g. from 5 to 20 samples per class.

Publication:

arXiv e-prints

Pub Date:

November 2019

DOI:

10.48550/arXiv.1911.13036

arXiv:

arXiv:1911.13036

Bibcode:

2019arXiv191113036G

Keywords:

Computer Science - Machine Learning;
Statistics - Machine Learning

E-Print:

IJCNN 2019 - International Joint Conference on Neural Networks, Jul 2019, Budapest, Hungary

NASA/ADS

Deep Networks with Adaptive Nyström Approximation

Abstract