Best feature performance in codeswitched hate speech texts
2019
Abstract
How well can hate speech concept be abstracted in order to inform automatic classification in codeswitched texts by machine learning classifiers? We explore different representations and empirically evaluate their predictiveness using both conventional and deep learning algorithms in identifying hate speech in a ~48k human-annotated dataset that contain mixed languages, a phenomenon common among multilingual speakers. This paper espouses a novel approach to handle this challenge by introducing a hierarchical approach that employs Latent Dirichlet Allocation to generate topic models that feed into another high-level feature set that we acronym PDC. PDC groups similar meaning words in word families during the preprocessing stage for supervised learning models. The high-level PDC features generated are based on Ombui et al, (2019) hate speech annotation framework that is informed by the triangular theory of hate (Stanberg,2003). Results obtained from frequency-based models using the PDC feature on the annotated dataset of ~48k short messages comprising of tweets generated during the 2012 and 2017 Kenyan presidential elections indicate an improvement on classification accuracy in identifying hate speech as compared to the baseline
References (28)
- Al-Hassan, A., & Al-Dosari, H. (2019). Detection of Hate Speech in Social Networks: A Survey on Multilingual Corpus. 6th International Conference on Computer Science and Information Technology.
- Ansaldo, A. I., Marcotte, K., Scherer, L., & Raboyeau, G. (2018). Language therapy and bilingual aphasia: Clinical Implications of psycholinguistic and neuroimaging research. Journal of Neurolinguistics, 21, 539-55. Retrieved from http://ilanguages.org/bilingual.php
- Badjatiya, P., Gupta, S., Gupta, M., & Varma, V. (2017). Deep Learning for Hate Speech Detection in Tweets. 2017 International World Wide Web Conference Committee.
- Burnap, P., & Williams, L. M. (2016). Us and them: identifying cyber hate on twitter across multiple protected characteristics. EPJ Data Science, 1(5).
- Burnap, P., & Williams, M. L. (2015). Cyber Hate Speech on Twitter: An Application of Machine Classification and Statistical Modeling for Policy and Decision Making. Policy & Internet, 2(7), 223-242.
- Chen, Y., Zhou, Y., Zhu, S., & Xu, H. (2012). Detecting offensive language in social media to protect adolescent online safety. The Fourth ASE/IEEE International Conference on Social Computing (SocialCom 2012). Amstadam.
- Coupland, N. (2010). "Other" representation, Society and Language. John Benjamins Publishing.
- Davidson, T., Warmsley, D., Macy, M., & Weber, I. (2017). AutomatedHateSpeechDetectionandtheProblemofOffensiveLanguage. ICWSM.
- Djuric, N., Zhou, J., Morris, Robin Grbovic, M., Radosavljevic, V., & Bhamidipati, N. (2015). Hate speech detection with comment embeddings. In Proceedings of the 24th InternationalConferenceonWorldWideWeb(WWW2015), 29-30.
- Fortuna, P. (2017). Automatic detection of hate speech in text: an overview of the topic and dataset annotation with hierarchical classes. University of Porto.
- Gitari, D. N., Zuping, Z., Damien, H., & Long, J. (2015). A lexicon-based approach for hate speech detection. Journal of Multimedia and Ubiquitous Engineering, 4(10), 215-230.
- Hasanuzzaman, M., Dias, G., & Way, A. (2017). DemographicWordEmbeddingsforRacismDetectiononTwitter. Proceedings of the The 8th International Joint Conference on Natural Language Processing, 926-936.
- Haslam, N. (2006). Dehumanization: An integrative review. Personality and Social Psychology Review, 10, 252-64.
- Hatzipanagos, R. (2018, November 30). How online hate turns into real-life violence. The Washington Post. Retrieved from https://www.washingtonpost.com/nation/2018/11/30/how- online-hate-speech-is-fueling-real-life-violence/?noredirect=on
- Kenet. (2018). Kenya Education Network. Retrieved September 16, 2016, from https://cert.kenet.or.ke/node/4
- King, R. ., & Sutton, G. M. (2013). High Times for Hate Crime: Explaining the Temporal Clustering of Hate Motivated Offending. Criminology, 51(4), 71-94.
- Kwok, I., & Wang, Y. (2013). Locate the hate: Detecting tweets against blacks. AAAI. Mossie, Z., & Wang, J.-H. (2018). SOCIAL NETWORK HATE SPEECH DETECTION FOR AMHARIC LANGUAGE. COMIT, 41-55.
- Muaka, L. (2011). Language Perceptions and Identity among Kenyan Speakers. Proceedings of the 40th Annual Conference on African Linguistics.
- NCIC. (2019). Functions of the Commission. Retrieved September 16, 2019, from https://cohesion.or.ke/index.php/about-us/functions-of-the-commission
- Razavi, A., Inkpen, D., Uritsky, S., & Matwin, S. (2010). Offensive Language Detection Using Multi-level Classification. Springer, 1627.
- Sandler, R. (2018, July). Facebook has apologized for flagging parts of the Declaration of Independence as hate speech. Business Insider. Retrieved from https://www.businessinsider.com/facebook-declaration-of-independence-hate-speech-2018- 7?r=US&IR=T
- Semin, G. R. (2009). Linguistic Markers of Social Distance and Proximity.
- Sternberg, R., & Sternberg, K. (2008). The Duplex Theory of Hate I: The Triangular Theory of the Structure of Hate. In The Nature of Hate. Cambridge University Press, 51-77. https://doi.org/0.1017/CBO9780511818707.004
- Tausczik, Y. R., & Pennebaker, J. W. (2010). The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods. Journal of Language and Social Psychology, 1(29). https://doi.org/10.1177/0261927X09351676
- Tulkens, S., Hilte, L., Lodewyckx, E., Verhoeven, B., & Daelemans, W. (2016). The Automated Detection of Racist Discourse in Dutch Social Media. CoRR, Abs/1608.08738.
- Warner, W., & Hirschberg, J. (2012). Detecting Hate Speech on the World Wide Web. Language in Social Media (LSM 2012).
- Waseem, Z. (2016). Are You a Racist or Am I Seeing Things? Annotator Influence on Hate Speech Detection on Twitter. EMNLP Workshop on NLP and CSS, 138-142.
- Waseem, Z., & Hovy, D. (2016). Hateful symbols or hateful people? predictive features for hate speech detection on twitter. In Proceedings of NAACL-HLT, 88-93.