Adaptable and Reliable Text Classification using Large Language Models

doi:10.48550/arXiv.2405.10523

Adaptable and Reliable Text Classification using Large Language Models

Text classification is fundamental in Natural Language Processing (NLP), and the advent of Large Language Models (LLMs) has revolutionized the field. This paper introduces an adaptable and reliable text classification paradigm, which leverages LLMs as the core component to address text classification tasks. Our system simplifies the traditional text classification workflows, reducing the need for extensive preprocessing and domain-specific expertise to deliver adaptable and reliable text classification results. We evaluated the performance of several LLMs, machine learning algorithms, and neural network-based architectures on four diverse datasets. Results demonstrate that certain LLMs surpass traditional methods in sentiment analysis, spam SMS detection, and multi-label classification. Furthermore, it is shown that the system's performance can be further enhanced through few-shot or fine-tuning strategies, making the fine-tuned model the top performer across all datasets. Source code and datasets are available in this GitHub repository: https://github.com/yeyimilk/llm-zero-shot-classifiers.

Publication:

arXiv e-prints

Pub Date:

May 2024

DOI:

10.48550/arXiv.2405.10523

arXiv:

arXiv:2405.10523

Bibcode:

2024arXiv240510523W

Keywords:

Computer Science - Computation and Language

E-Print:

ICDM Workshop ARRL 2024

ADS

Adaptable and Reliable Text Classification using Large Language Models

Abstract