Enhancing Black-Box Few-Shot Text Classification with Prompt-Based Data Augmentation

doi:10.48550/arXiv.2305.13785

Enhancing Black-Box Few-Shot Text Classification with Prompt-Based Data Augmentation

Training or finetuning large-scale language models (LLMs) such as GPT-3 requires substantial computation resources, motivating recent efforts to explore parameter-efficient adaptation to downstream tasks. One practical area of research is to treat these models as black boxes and interact with them through their inference APIs. In this paper, we investigate how to optimize few-shot text classification without accessing the gradients of the LLMs. To achieve this, we treat the black-box model as a feature extractor and train a classifier with the augmented text data. Data augmentation is performed using prompt-based finetuning on an auxiliary language model with a much smaller parameter size than the black-box model. Through extensive experiments on eight text classification datasets, we show that our approach, dubbed BT-Classifier, significantly outperforms state-of-the-art black-box few-shot learners and performs on par with methods that rely on full-model tuning.

Publication:

arXiv e-prints

Pub Date:

May 2023

DOI:

10.48550/arXiv.2305.13785

arXiv:

arXiv:2305.13785

Bibcode:

2023arXiv230513785L

Keywords:

Computer Science - Computation and Language

NASA/ADS

Enhancing Black-Box Few-Shot Text Classification with Prompt-Based Data Augmentation

Abstract