What to Prioritize? Natural Language Processing for the Development of a Modern Bug Tracking Solution in Hardware Development
Abstract
Managing large numbers of incoming bug reports and finding the most critical issues in hardware development is time consuming, but crucial in order to reduce development costs. In this paper, we present an approach to predict the time to fix, the risk and the complexity of debugging and resolution of a bug report using different supervised machine learning algorithms, namely Random Forest, Naive Bayes, SVM, MLP and XGBoost. Further, we investigate the effect of the application of active learning and we evaluate the impact of different text representation techniques, namely TF-IDF, Word2Vec, Universal Sentence Encoder and XLNet on the model's performance. The evaluation shows that a combination of text embeddings generated through the Universal Sentence Encoder and MLP as classifier outperforms all other methods, and is well suited to predict the risk and complexity of bug tickets.
- Publication:
-
arXiv e-prints
- Pub Date:
- September 2021
- DOI:
- 10.48550/arXiv.2109.13825
- arXiv:
- arXiv:2109.13825
- Bibcode:
- 2021arXiv210913825D
- Keywords:
-
- Computer Science - Machine Learning
- E-Print:
- Hawaii International Conference on System Sciences 2022 (HICSS-55)