Academia.eduAcademia.edu

Outline

Machine Learning and Affect Analysis Against Cyber-Bullying

2010, Proc. of the Ling. and Cogn. Approaches to Dialog Agents Symposium at AISB

Abstract

Online security has been an important issue for several years. One of the burning online security problems lately in Japan has been online slandering and bullying, which appear on unofficial Web sites. The problem has been becoming especially urgent on unofficial Web sites of Japanese schools. School personnel and members of Parent-Teacher Association (PTA) have started Online Patrol to spot Web sites and blogs containing such inappropriate contents. However, countless number of such data makes the job an uphill task. This paper presents a research aiming to develop a systematic approach to Online Patrol by automatically spotting suspicious entries and reporting them to PTA members and therefore help them do their job. We present some of the first results of analysis of the inappropriate data collected from unofficial school Web sites. The analysis is performed firstly with an SVM based machine learning method to detect the inappropriate entries. After analysis of the results we perform another analysis of the data, using an affect analysis system to find out how the machine learning model could be improved.

References (27)

  1. L. Robinson. Debating the events of September 11th: Discursive and in- teractional dynamics in three online fora. Journal of Computer-Mediated Communication, 10(4), article 4 (2005).
  2. L. Leets. Responses to Internet hate sites: Is speech too free in cy- berspace?", Comm. Law and Policy, vol. 6(2), pp. 287-317 (2001).
  3. H. Chen, J. Qin, E. Reid,W. Chung,Y. Zhou,W. Xi, G. Lai, T. Elhourani, A. Bonillas, F.-Y. Wang, and M. Sageman. The dark web portal: Collect- ing and analyzing the presence of domestic and international terrorist groups on the web,In: Proc. 7th IEEE Int. Conf. Intelligent Transporta- tion Systems, Washington, DC, pp. 106-111 (2004).
  4. A. Abbasi, and H. Chen. Affect Intensity Analysis of Dark Web Forums,In: IEEE Intelligence and Security Informatics, pp. 282-288 (2007).
  5. G. A. Gerstenfeld, D. R.Grant, C. P. Chiang. Hate online: A content anal- ysis of extremist Internet sites", Anal. of Soc. Issues and Pub. Policy, vol. 3(1), pp. 29-44 (2003).
  6. Ministry of Education, Culture, Sports, Science and Technology. 'Netto jou no ijime' ni kansuru taiou manyuaru jirei shuu (gakkou, kyouin muke) ["Bullying on the net" Manual for handling and the collection of cases (directed to school teachers)] (in Japanese). Ministry of Education, Culture, Sports, Science and Technology (2008).
  7. B. Belsey. Cyberbullying: An Emerging Threat for the "Always On" Generation, http://www.cyberbullying.ca/pdf/Cyberbullying Presentati on Description.pdf
  8. J. W. Patchin & S. Hinduja. Bullies move beyond the schoolyard: A pre- liminary look at cyberbullying. Youth Violence and Juvenile Justice, 4(2), 148-169 (2006).
  9. S. Hinduja, & J. W. Patchin. Bullying beyond the schoolyard: Prevent- ing and responding to cyberbullying. Thousand Oaks, CA: Corwin Press (2009).
  10. H. Watanabe, W. Sunayama. Denshi keijiban ni okeru yuuza no seishitsu no hyouka [User's nature evalution on Bulletin Board System] (in Japanese). IEICE Technical Report, 105(652), 2006-KBSE, pp. 25-30 (2006).
  11. V. I. Levenshtein. Binary Code Capable of Correcting Deletions, Inser- tions and Reversals. Doklady Akademii Nauk SSSR, Vol. 163, No. 4, pp. 845-848 (1965).
  12. H. Minoru, E. Hirohide. Nihongo OCR bun ni okeru eiji, katakana no superu ayamari teisei-hou [Spelling Correction Method for English and Katakana in Japanese OCR Text] (in Japanese). Transactions of Infor- mation Processing Society of Japan, 38(7), pp. 1317-1327 (1997).
  13. K. Ryozo, H. Koudai, S. Tatsuya. ProductionRule wo mochiita shisutemu hyougen to koshou shindan e no ouyou [Modeling and Fault Diagnosis of Controlled Plant based on Production Rule] (in Japanese). The Robotics and Mechatronics Conference 2005, p. 16 (2005).
  14. V. Vapnik. Statistical Learning Theory, Springer (1998).
  15. T. Hirotoshi, M. Takafumi, H. Masahiko. Support Vector Machine ni yoru tekisuto bunrui [Text Categorization Using Support Vector Ma- chines] (in Japanese), IPSJ SIG Notes, 98(99), pp.173-180 (1998).
  16. M. Ptaszynski, P. Dybala, R. Rzepka and K. Araki. Affecting Corpora: Experiments with Automatic Affect Annotation System -A Case Study of the 2channel Forum -', In Proceedings of The Conference of the Pa- cific Association for Computational Linguistics 2009 (PACLING-09), pp. 223-228 (2009).
  17. T. Kudo, 'MeCab: Yet Another Part-of-Speech and Morphological Ana- lyzer', 2001. http://mecab.sourceforge.net/
  18. M. Ptaszynski, P. Dybala, R. Rzepka and K. Araki. Towards Fully Auto- matic Emoticon Analysis System (ˆoˆ), In Proceedings of The Fifteenth Annual Meeting of The Association for Natural Language Processing (NLP-2010), Tokyo (2010).
  19. R. L. Birdwhistell, Introduction to kinesics: an annotation system for analysis of body motion and gesture, Univ. of Kentucky Press (1952).
  20. R. L. Birdwhistell, Kinesics and Context, University of Pennsylvania Press, Philadelphia (1970).
  21. M. Ptaszynski. Boisterous language. Analysis of structures and semi- otic functions of emotive expressions in conversation on Japanese Inter- net bulletin board forum -2channel -. (in Japanese). M.A. Dissertation, UAM, Poznan (2006).
  22. A. Nakamura. Kanjo hyogen jiten [Dictionary of Emotive Expressions] (in Japanese). Tokyodo Publishing, Tokyo (1993).
  23. A. Zaenen and L. Polanyi. Contextual Valence Shifters. In Computing At- titude and Affect in Text, J. G. Shanahan, Y. Qu, J. Wiebe (eds.), Springer Verlag, Dordrecht, The Netherlands, pp. 1-10 (2006).
  24. J. A. Russell. A circumplex model of affect. J. of Personality and Social Psychology, 39(6):1161-1178 (1980).
  25. M. Ptaszynski, P. Dybala, W. Shi, R. Rzepka and K. Araki. Towards Context Aware Emotional Intelligence in Machines: Computing Con- textual Appropriateness of Affective States. In Proceedings of Twenty- first International Joint Conference on Artificial Intelligence (IJCAI-09), Pasadena, California, USA, pp. 1469-1474 (2009).
  26. M. Ptaszynski, P. Dybala, W. Shi, R. Rzepka and K. Araki. Contextual Affect Analysis: A System for Verification of Emotion Appropriateness Supported with Contextual Valence Shifters. International Journal of Biometrics, 2(2), pp. 134-154 (2010).
  27. Proceedings of the Linguistic And Cognitive Approaches To Dialog Agents Symposium, Rafal Rzepka (Ed.), at the AISB 2010 convention, 29 March -1 April 2010, De Montfort University, Leicester, UK