Three Generative, Lexicalised Models for Statistical Parsing
Abstract
In this paper we first propose a new statistical parsing model, which is a generative model of lexicalised context-free grammar. We then extend the model to include a probabilistic treatment of both subcategorisation and wh-movement. Results on Wall Street Journal text show that the parser performs at 88.1/87.5% constituent precision/recall, an average improvement of 2.3% over (Collins 96).
- Publication:
-
arXiv e-prints
- Pub Date:
- June 1997
- DOI:
- 10.48550/arXiv.cmp-lg/9706022
- arXiv:
- arXiv:cmp-lg/9706022
- Bibcode:
- 1997cmp.lg....6022C
- Keywords:
-
- Computer Science - Computation and Language
- E-Print:
- 8 pages, to appear in Proceedings of ACL/EACL 97.