AFINN-96
Usage
var afinn96 = require( '@stdlib/datasets/afinn-96' );
afinn96()
Returns a list of 1468
unique English words (and phrases) rated for valence. Negative words have a negative valence ([-5,0)
). Positive words have a positive valence ((0,5]
). Neutral words have a valence of 0
.
var words = afinn96();
/* returns
[
['abandon',-2],
['abandons',-2],
['abandoned',-2],
['absentee',-1],
['absentees',-1],
['aboard',1],
['abducted',-2],
['abduction',-2],
['abductions',-2],
['abuse',-3],
['abused',-3],
...
]
*/
Notes
- The list is an earlier version of AFINN-111.
- The list contains duplicated words.
- The list includes misspelled words. Their presence is intentional, as such misspellings frequently occur in social media content.
- All words are lowercase.
- Some "words" are phrases; e.g.,
cashing in
,cool stuff
. - Words may contain apostrophes; e.g.,
can't stand
. - Words may contain dashes; e.g.,
cover-up
,made-up
.
Examples
var afinn96 = require( '@stdlib/datasets/afinn-96' );
var words;
var dict;
var len;
var i;
words = afinn96();
// Convert to a dictionary...
len = words.length;
dict = {};
for ( i = 0; i < len; i++ ) {
dict[ words[i][0] ] = words[i][1];
}
console.dir( dict );
CLI
Usage
Usage: afinn-96 [options]
Options:
-h, --help Print this message.
-V, --version Print the package version.
--format fmt Output format: 'csv' or 'ndjson'.
Notes
Examples
$ afinn-96
word,valence
abandon,-2
abandons,-2
abandoned,-2
...
References
- Nielsen, Finn Årup. 2011. "A new ANEW: Evaluation of a word list for sentiment analysis in microblogs." In Proceedings of the ESWC2011 Workshop on 'Making Sense of Microposts': Big things come in small packages., 718:93–98. CEUR Workshop Proceedings. http://ceur-ws.org/Vol-718/paper_16.pdf.
- Hansen, Lars Kai, Adam Arvidsson, Finn Årup Nielsen, Elanor Colleoni, and Michael Etter. 2011. "Good Friends, Bad News - Affect and Virality in Twitter." In Future Information Technology: 6th International Conference, FutureTech 2011, Loutraki, Greece, June 28-30, 2011, Proceedings, Part II, edited by James J. Park, Laurence T. Yang, and Changhoon Lee, 34–43. Berlin, Heidelberg: Springer Berlin Heidelberg. doi:10.1007/978-3-642-22309-9_5.
License
The data files (databases) are licensed under an Open Data Commons Attribution 1.0 License and their contents are licensed under a Creative Commons Attribution 4.0 International Public License. The original dataset is attributed to Finn Årup Nielsen and can be found here. The software is licensed under Apache License, Version 2.0.