Abstract
On the topic of journalistic integrity, the current state of accurate, impartial news reporting has garnered much debate in context to the 2016 US Presidential Election. In pursuit of computational evaluation of news text, the statements (attributions) ascribed by media outlets to sources provide a common category of evidence on which to operate. In this paper, we develop an approach to compare partisan traits of news text attributions and apply it to characterize differences in statements ascribed to candidate, Hilary Clinton, and incumbent President, Donald Trump. In doing so, we present a model trained on over 600 in-house annotated attributions to identify each candidate with accuracy > 88%. Finally, we discuss insights from its performance for future research.
References (35)
- Agha, A. (1998). Stereotypes and registers of honorific language. Language in Society, 27(2), 151-193.
- Agresti, A. (2013). Categorical data analysis: John Wiley & Sons.
- Baym, G. (2005). The Daily Show: Discursive integration and the reinvention of political journalism. Political communication, 22(3), 259-276.
- Covert, T. J. A., & Wasburn, P. C. (2007). Measuring media bias: A content analysis of Time and Newsweek coverage of domestic social issues, 1975-2000. Social science quarterly, 88(3), 690-706.
- D'Alessio, D., & Allen, M. (2000). Media bias in presidential elections: a meta-analysis. Journal of communication, 50(4), 133-156.
- Esser, F., & Umbricht, A. (2014). The Evolution of Objective and Interpretative Journalism in the Western Press Comparing Six News Systems since the 1960s. Journalism & Mass Communication Quarterly, 91(2), 229-249. doi:10.1177/1077699014527459
- Hovy, E., & Lavid, J. (2010). Towards a 'science'of corpus annotation: a new methodological challenge for corpus linguistics. International journal of translation, 22(1), 13-36.
- Johnson-Cartee, K. S. (2004). News narratives and news framing: Constructing political reality: Rowman & Littlefield Publishers.
- Jullian, P. M. (2011). Appraising through someone else's words: The evaluative power of quotations in news reports. Discourse & Society, 22(6), 766- 780.
- Lauf, A., Valette, M., & Khouas, L. (2013). Analyzing Variation Patterns In Quotes Over Time. Research in Computing Science, 70, 223-232.
- Lee, H., Chang, A., Peirsman, Y., Chambers, N., Surdeanu, M., & Jurafsky, D. (2013). Deterministic coreference resolution based on entity-centric, precision-ranked rules. Computational Linguistics, 39(4), 885-916.
- Mamede, N., & Chaleira, P. (2015). Character identification in children stories. In J. Hirschberg & C. D. Manning (Eds.), Advances in natural language processing (Vol. 349, pp. 82-90).
- Manning, C., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S., & McClosky, D. (2014). The Stanford CoreNLP natural language processing toolkit. Paper presented at the Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations.
- Marcus, M. P., Santorini, B., & Marcinkiewicz, M. A. (1993). Building a Large Annotated Corpus of English: The Penn Treebank. COMPUTATIONAL LINGUISTICS - ROCHESTER-, 19(2), 313.
- Miltsakaki, E., Prasad, R., Joshi, A. K., & Webber, B. L. (2004). The Penn Discourse Treebank. Paper presented at the LREC.
- Mitra, T., & Gilbert, E. (2015). CREDBANK: A Large- Scale Social Media Corpus With Associated Credibility Annotations. Paper presented at the ICWSM.
- Mohammad, S. (2017). Challenges in Sentiment Analysis. In E. Cambria, D. Das, S. Bandyopadhyay, & A. Feraco (Eds.), A practical guide to sentiment analysis (Vol. 5, pp. 65-66): Springer.
- Montoyo, A., MartíNez-Barco, P., & Balahur, A. (2012). Subjectivity and sentiment analysis: An overview of the current state of the area and envisaged developments. In: Elsevier.
- Newell, E., Schang, A., Margolin, D., & Ruths, D. (2017). Assessing the Verifiability of Attributions in News Text. Paper presented at the IJCNLP.
- O'Keefe, T., Pareti, S., Curran, J. R., Koprinska, I., & Honnibal, M. (2012). A sequence labelling approach to quote attribution. Paper presented at the Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning.
- Pareti, S. (2012). A Database of Attribution Relations. Lrec 2012 -Eighth International Conference on Language Resources and Evaluation, 3213- 3217.
- Pareti, S. (2015). Attribution: a computational approach.
- Pareti, S. (2016). PARC 3.0: A Corpus of Attribution Relations. Paper presented at the LREC.
- Piazza, R. (2009). News is Reporting What was Said. Techniques and Patterns of Attribution. In L. Haarman & L. Lombardo (Eds.), Evaluation and stance in war news : a linguistic analysis of American, British and Italian television news reporting of the 2003 Iraqi war (pp. 170-194). London; New York: Continuum.
- Pouliquen, B., Steinberger, R., & Best, C. (2007). Automatic detection of quotations in multilingual news. Paper presented at the Proceedings of Recent Advances in Natural Language Processing.
- Prasad, R., Dinesh, N., Lee, A., Miltsakaki, E., Robaldo, L., Joshi, A., & Webber, B. (2008). The Penn Discourse TreeBank 2.0. Sixth International Conference on Language Resources and Evaluation, Lrec 2008, 2961-2968.
- Qiu, J., Wu, Q., Ding, G., Xu, Y., & Feng, S. (2016). A survey of machine learning for big data processing. EURASIP Journal on Advances in Signal Processing, 2016(1), 67.
- Reich, Z. (2010). Source Credibility as a Journalistic Work Tool. In Journalists, sources, and credibility: New perspectives (pp. 19-36): Routledge.
- Ryan, M. (2001). Journalistic ethics, objectivity, existential journalism, standpoint epistemology, and public journalism. Journal of Mass Media Ethics, 16(1), 3-22.
- Sarmento, L., & Nunes, S. (2009). Automatic extraction of quotes and topics from news feeds. Paper presented at the DSIE'09-4th Doctoral Symposium on Informatics Engineering.
- Soni, S., Mitra, T., Gilbert, E., & Eisenstein, J. (2014). Modeling factuality judgments in social media text. Paper presented at the Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers).
- Sundar, S. S. (1998). Effect of source attribution on perception of online news stories. Journalism & Mass Communication Quarterly, 75(1), 55-68.
- Tankard, J. W. (2001). The empirical approach to the study of media framing. Framing public life: Perspectives on media and our understanding of the social world, 95-106.
- Wodak, R., & Fairclough, N. (2013). Critical discourse analysis: Sage London.
- Zhang, J. Y., Black, A. W., & Sproat, R. (2003). Identifying speakers in children's stories for speech synthesis. Paper presented at the INTERSPEECH.