A new ANEW: Evaluation of a word list for sentiment analysis in microblogs
Abstract
Sentiment analysis of microblogs such as Twitter has recently gained a fair amount of attention. One of the simplest sentiment analysis approaches compares the words of a posting against a labeled word list, where each word has been scored for valence, -- a 'sentiment lexicon' or 'affective word lists'. There exist several affective word lists, e.g., ANEW (Affective Norms for English Words) developed before the advent of microblogging and sentiment analysis. I wanted to examine how well ANEW and other word lists performs for the detection of sentiment strength in microblog posts in comparison with a new word list specifically constructed for microblogs. I used manually labeled postings from Twitter scored for sentiment. Using a simple word matching I show that the new word list may perform better than ANEW, though not as good as the more elaborate approach found in SentiStrength.
- Publication:
-
arXiv e-prints
- Pub Date:
- March 2011
- DOI:
- 10.48550/arXiv.1103.2903
- arXiv:
- arXiv:1103.2903
- Bibcode:
- 2011arXiv1103.2903A
- Keywords:
-
- Computer Science - Information Retrieval;
- Computer Science - Computation and Language;
- 68M11;
- H.4.3;
- J.4
- E-Print:
- 6 pages, 4 figures, 1 table, Submitted to "Making Sense of Microposts (#MSM2011)"