Automatic Keyword Extraction from Spoken Text. A Comparison of two Lexical Resources: the EDR and WordNet
Abstract
Lexical resources such as WordNet and the EDR electronic dictionary have been used in several NLP tasks. Probably, partly due to the fact that the EDR is not freely available, WordNet has been used far more often than the EDR. We have used both resources on the same task in order to make a comparison possible. The task is automatic assignment of keywords to multi-party dialogue episodes (i.e. thematically coherent stretches of spoken text). We show that the use of lexical resources in such a task results in slightly higher performances than the use of a purely statistically based method.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2004
- DOI:
- 10.48550/arXiv.cs/0410062
- arXiv:
- arXiv:cs/0410062
- Bibcode:
- 2004cs.......10062V
- Keywords:
-
- Computer Science - Computation and Language;
- Computer Science - Digital Libraries;
- Computer Science - Information Retrieval;
- H.3.1;
- H.3.3;
- I.5.3;
- I.7.3
- E-Print:
- 4 pages