AFINN-96

A list of English words rated for valence.

Usage

var afinn96 = require( '@stdlib/datasets/afinn-96' );

afinn96()

Returns a list of 1468 unique English words (and phrases) rated for valence. Negative words have a negative valence ([-5,0)). Positive words have a positive valence ((0,5]). Neutral words have a valence of 0.

var words = afinn96();
/* returns
    [
        ['abandon',-2],
        ['abandons',-2],
        ['abandoned',-2],
        ['absentee',-1],
        ['absentees',-1],
        ['aboard',1],
        ['abducted',-2],
        ['abduction',-2],
        ['abductions',-2],
        ['abuse',-3],
        ['abused',-3],
        ...
    ]
*/

Notes

  • The list is an earlier version of AFINN-111.
  • The list contains duplicated words.
  • The list includes misspelled words. Their presence is intentional, as such misspellings frequently occur in social media content.
  • All words are lowercase.
  • Some "words" are phrases; e.g., cashing in, cool stuff.
  • Words may contain apostrophes; e.g., can't stand.
  • Words may contain dashes; e.g., cover-up, made-up.

Examples

var afinn96 = require( '@stdlib/datasets/afinn-96' );

var words;
var dict;
var len;
var i;

words = afinn96();

// Convert to a dictionary...
len = words.length;
dict = {};
for ( i = 0; i < len; i++ ) {
    dict[ words[i][0] ] = words[i][1];
}
console.dir( dict );

CLI

Usage

Usage: afinn-96 [options]

Options:

  -h,    --help                Print this message.
  -V,    --version             Print the package version.
         --format fmt          Output format: 'csv' or 'ndjson'.

Notes

  • The CLI supports two output formats: comma-separated values (CSV) and newline-delimited JSON (NDJSON). The default output format is CSV.

Examples

$ afinn-96
word,valence
abandon,-2
abandons,-2
abandoned,-2
...

References

  • Nielsen, Finn Årup. 2011. "A new ANEW: Evaluation of a word list for sentiment analysis in microblogs." In Proceedings of the ESWC2011 Workshop on 'Making Sense of Microposts': Big things come in small packages., 718:93–98. CEUR Workshop Proceedings. http://ceur-ws.org/Vol-718/paper_16.pdf.
  • Hansen, Lars Kai, Adam Arvidsson, Finn Årup Nielsen, Elanor Colleoni, and Michael Etter. 2011. "Good Friends, Bad News - Affect and Virality in Twitter." In Future Information Technology: 6th International Conference, FutureTech 2011, Loutraki, Greece, June 28-30, 2011, Proceedings, Part II, edited by James J. Park, Laurence T. Yang, and Changhoon Lee, 34–43. Berlin, Heidelberg: Springer Berlin Heidelberg. doi:10.1007/978-3-642-22309-9_5.

License

The data files (databases) are licensed under an Open Data Commons Attribution 1.0 License and their contents are licensed under a Creative Commons Attribution 4.0 International Public License. The original dataset is attributed to Finn Årup Nielsen and can be found here. The software is licensed under Apache License, Version 2.0.