ZOQO: Zero-Order Quantized Optimization

doi:10.48550/arXiv.2501.06736

ZOQO: Zero-Order Quantized Optimization

The increasing computational and memory demands in deep learning present significant challenges, especially in resource-constrained environments. We introduce a zero-order quantized optimization (ZOQO) method designed for training models with quantized parameters and operations. Our approach leverages zero-order approximations of the gradient sign and adapts the learning process to maintain the parameters' quantization without the need for full-precision gradient calculations. We demonstrate the effectiveness of ZOQO through experiments in fine-tuning of large language models and black-box adversarial attacks. Despite the limitations of zero-order and quantized operations training, our method achieves competitive performance compared to full-precision methods, highlighting its potential for low-resource environments.

Publication:

arXiv e-prints

Pub Date:

January 2025

DOI:

10.48550/arXiv.2501.06736

arXiv:

arXiv:2501.06736

Bibcode:

2025arXiv250106736B

Keywords:

Computer Science - Machine Learning;
Computer Science - Computation and Language

E-Print:

Accepted to ICASSP 2025

ADS

ZOQO: Zero-Order Quantized Optimization

Abstract