Curb Your Carbon Emissions: Benchmarking Carbon Emissions in Machine Translation
Abstract
In recent times, there has been definitive progress in the field of NLP, with its applications growing as the utility of our language models increases with advances in their performance. However, these models require a large amount of computational power and data to train, consequently leading to large carbon footprints. Therefore, it is imperative that we study the carbon efficiency and look for alternatives to reduce the overall environmental impact of training models, in particular large language models. In our work, we assess the performance of models for machine translation, across multiple language pairs to assess the difference in computational power required to train these models for each of these language pairs and examine the various components of these models to analyze aspects of our pipeline that can be optimized to reduce these carbon emissions.
- Publication:
-
arXiv e-prints
- Pub Date:
- September 2021
- DOI:
- 10.48550/arXiv.2109.12584
- arXiv:
- arXiv:2109.12584
- Bibcode:
- 2021arXiv210912584Y
- Keywords:
-
- Computer Science - Computation and Language;
- Computer Science - Artificial Intelligence;
- Computer Science - Machine Learning
- E-Print:
- The authors find these results limited as there are no clear, identifiable trends that provide useful information. We need/intend to make the experiments more robust as currently, they are not. The authors do not have access to the required computational sources for this at the moment. We will revisit the optimization of machine translation models using a single language pair at a later point