NLTK: The Natural Language Toolkit
Abstract
NLTK, the Natural Language Toolkit, is a suite of open source program modules, tutorials and problem sets, providing ready-to-use computational linguistics courseware. NLTK covers symbolic and statistical natural language processing, and is interfaced to annotated corpora. Students augment and replace existing components, learn structured programming by example, and manipulate sophisticated models from the outset.
- Publication:
-
arXiv e-prints
- Pub Date:
- May 2002
- DOI:
- arXiv:
- arXiv:cs/0205028
- Bibcode:
- 2002cs........5028L
- Keywords:
-
- Computer Science - Computation and Language;
- D.2.6;
- I.2.7;
- J.5;
- K.3.2
- E-Print:
- 8 pages, 1 figure, Proceedings of the ACL Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics, Philadelphia, July 2002, Association for Computational Linguistics