Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Suspended accounts: A source of Tweets with disgust and anger emotions for augmenting hate speech data sample

Alorainy, Wafa, Burnap, Pete, Liu, Han, Javed, Amir and Williams, Matthew 2018. Suspended accounts: A source of Tweets with disgust and anger emotions for augmenting hate speech data sample. Presented at: International Conference on Machine Learning and Cybernetics, Chengdu, China, 15-18 July 2018.

[img]
Preview
PDF - Accepted Post-Print Version
Download (161kB) | Preview

Abstract

In this paper we present a proposal to address the problem of the pricey and unreliable human annotation, which is important for detection of hate speech from the web contents. In particular, we propose to use the text that are produced from the suspended accounts in the aftermath of a hateful event as subtle and reliable source for hate speech prediction. The proposal was motivated after implementing emotion analysis on three sources of data sets: suspended, active and neutral ones, i.e. the first two sources of data sets contain hateful tweets from suspended accounts and active accounts, respectively, whereas the third source of data sets contain neutral tweets only. The emotion analysis indicated that the tweets from suspended accounts show more disgust, negative, fear and sadness emotions than the ones from active accounts, although tweets from both types of accounts might be annotated as hateful ones by human annotators. We train two Random Forest classifiers based on the semantic meaning of tweets respectively from suspended and active accounts, and evaluate the prediction accuracy of the two classifiers on unseen data. The results show that the classifier trained on the tweets from suspended accounts outperformed the one trained on the tweets from active accounts by 16% of overall F-score.

Item Type: Conference or Workshop Item (Paper)
Status: In Press
Schools: Social Sciences (Includes Criminology and Education)
Computer Science & Informatics
Last Modified: 03 Jul 2018 13:30
URI: http://orca.cf.ac.uk/id/eprint/112920

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics