Vietnamese Named Entity Recognition using Token Regular Expressions and Bidirectional Inference
Abstract
This paper describes an efficient approach to improve the accuracy of a named entity recognition system for Vietnamese. The approach combines regular expressions over tokens and a bidirectional inference method in a sequence labelling model. The proposed method achieves an overall $F_1$ score of 89.66% on a test set of an evaluation campaign, organized in late 2016 by the Vietnamese Language and Speech Processing (VLSP) community.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2016
- DOI:
- 10.48550/arXiv.1610.05652
- arXiv:
- arXiv:1610.05652
- Bibcode:
- 2016arXiv161005652L
- Keywords:
-
- Computer Science - Computation and Language
- E-Print:
- Submitted to the VLSP Workshop 2016