Three Generative, Lexicalised Models for Statistical Parsing

doi:10.48550/arXiv.cmp-lg/9706022

Three Generative, Lexicalised Models for Statistical Parsing

Collins, Michael

In this paper we first propose a new statistical parsing model, which is a generative model of lexicalised context-free grammar. We then extend the model to include a probabilistic treatment of both subcategorisation and wh-movement. Results on Wall Street Journal text show that the parser performs at 88.1/87.5% constituent precision/recall, an average improvement of 2.3% over (Collins 96).

Publication:

arXiv e-prints

Pub Date:

June 1997

DOI:

10.48550/arXiv.cmp-lg/9706022

arXiv:

arXiv:cmp-lg/9706022

Bibcode:

1997cmp.lg....6022C

Keywords:

Computer Science - Computation and Language

E-Print:

8 pages, to appear in Proceedings of ACL/EACL 97.

NASA/ADS

Three Generative, Lexicalised Models for Statistical Parsing

Abstract