Analyzing Text Representations by Measuring Task Alignment

doi:10.48550/arXiv.2305.19747

Analyzing Text Representations by Measuring Task Alignment

Textual representations based on pre-trained language models are key, especially in few-shot learning scenarios. What makes a representation good for text classification? Is it due to the geometric properties of the space or because it is well aligned with the task? We hypothesize the second claim. To test it, we develop a task alignment score based on hierarchical clustering that measures alignment at different levels of granularity. Our experiments on text classification validate our hypothesis by showing that task alignment can explain the classification performance of a given representation.

Publication:

arXiv e-prints

Pub Date:

May 2023

DOI:

10.48550/arXiv.2305.19747

arXiv:

arXiv:2305.19747

Bibcode:

2023arXiv230519747G

Keywords:

Computer Science - Computation and Language

E-Print:

arXiv admin note: text overlap with arXiv:2210.05721

NASA/ADS

Analyzing Text Representations by Measuring Task Alignment

Abstract