Optimizing CNN Using HPC Tools
Abstract
This paper optimizes the Convolutional Neural Network (CNN) algorithm using high-performance computing (HPC) technologies. It uses multi-core processors, GPUs, and parallel computing frameworks like OpenMPI and CUDA to speed up CNN model training. The approach improves performance and training time and is superior to alternative strategies. The study demonstrates how HPC technologies can refine the CNN method, resulting in faster and more accurate training of large-scale CNN models.
- Publication:
-
arXiv e-prints
- Pub Date:
- March 2024
- DOI:
- 10.48550/arXiv.2403.04870
- arXiv:
- arXiv:2403.04870
- Bibcode:
- 2024arXiv240304870R
- Keywords:
-
- Computer Science - Distributed;
- Parallel;
- and Cluster Computing;
- N/A
- E-Print:
- 5 pages