Text Chunking using Transformation-Based Learning

doi:10.48550/arXiv.cmp-lg/9505040

Text Chunking using Transformation-Based Learning

Eric Brill introduced transformation-based learning and showed that it can do part-of-speech tagging with fairly high accuracy. The same method can be applied at a higher level of textual interpretation for locating chunks in the tagged text, including non-recursive ``baseNP'' chunks. For this purpose, it is convenient to view chunking as a tagging problem by encoding the chunk structure in new tags attached to each word. In automatic tests using Treebank-derived data, this technique achieved recall and precision rates of roughly 92% for baseNP chunks and 88% for somewhat more complex chunks that partition the sentence. Some interesting adaptations to the transformation-based learning approach are also suggested by this application.

Publication:

arXiv e-prints

Pub Date:

May 1995

DOI:

10.48550/arXiv.cmp-lg/9505040

arXiv:

arXiv:cmp-lg/9505040

Bibcode:

1995cmp.lg....5040R

Keywords:

Computer Science - Computation and Language

E-Print:

13 pages, LaTeX2e, 1 included figure

NASA/ADS

Text Chunking using Transformation-Based Learning

Abstract