Academia.eduAcademia.edu

Outline

VERA: A Platform for Veracity Estimation over Web Data

Proc. of the 25th International WWW16 Conference

https://doi.org/10.1145/2872518.2890536

Abstract

Social networks and the Web in general are characterized by multiple information sources often claiming conflicting data values. Data veracity is hard to estimate, especially when there is no prior knowledge about the sources or the claims in time-dependent scenarios (e.g., crisis situation) where initially very few observers can report first information. Despite the wide set of recently proposed truth discovery approaches, " no-one-fits-all " solution emerges for estimating the veracity of on-line information in open contexts. However, analyzing the space of conflicting information and disagreeing sources might be relevant, as well as ensembling multiple truth discovery methods. This demonstration presents VERA, a Web-based platform that supports information extraction from Web textual data and micro-texts from Twitter and estimates data veracity. Given a user query, VERA systematically extracts entities and relations from Web content, structures them as claims relevant to the query and gathers more conflicting/corroborating information. VERA combines multiple truth discovery algorithms through ensembling and returns the veracity label and score of each data value as well as the trustworthiness scores of the sources. VERA will be demonstrated through several real-world scenarios to show its potential value for fact-checking from Web data.

References (16)

  1. REFERENCES
  2. L. Berti-Equille. Data Veracity Estimation with Ensembling Truth Discovery Methods. In IEEE Big Data Workshop on Data Quality Issues in Big Data, 2015.
  3. L. Berti-Equille and J. Borge-Holthoefer. Veracity of Big Data: From Truth Discovery Computation Algorithms to Models of Misinformation Dynamics. Morgan & Claypool, 2015.
  4. S. Cohen, J. T. Hamilton, and F. Turner. Computational Journalism. CACM, 54(10):66-71, 2011.
  5. X. L. Dong, L. Berti-Equille, and D. Srivastava. Integrating Conflicting Data: The Role of Source Dependence. PVLDB, 2(1):550-561, 2009.
  6. X. L. Dong, E. Gabrilovich, G. Heitz, W. Horn, N. Lao, K. Murphy, T. Strohmann, S. Sun, and W. Zhang. Knowledge Vault: A Web-scale Approach to Probabilistic Knowledge Fusion. In KDD'14, pages 601-610, 2014.
  7. A. Galland, S. Abiteboul, A. Marian, and P. Senellart. Corroborating Information from Disagreeing Views. In WSDM'10, pages 131-140, 2010.
  8. N. Hassan, C. Li, and M. Tremayne. Detecting Check-worthy Factual Claims in Presidential Debates. In CIKM'15, pages 1835-1838, 2015.
  9. M. Imran, C. Castillo, J. Lucas, P. Meier, and S. Vieweg. AIDR: Artificial Intelligence for Disaster Response. In WWW'14, 2014.
  10. J. Pasternack and D. Roth. Latent Credibility Analysis. In WWW'13, pages 1009-1020, 2013.
  11. D. A. Waguih and L. Berti-Equille. Truth Discovery Algorithms: An Experimental Evaluation. CoRR, 1409.6428, 2014.
  12. D. A. Waguih, N. Goel, H. M. Hammady, and L. Berti-Equille. AllegatorTrack: Combining and Reporting Results of Truth Discovery from Multi-Source Data. In ICDE'15, pages 1440-1443, 2015.
  13. D. Wang, L. M. Kaplan, H. K. Le, and T. F. Abdelzaher. On Truth Discovery in Social Sensing: a Maximum Likelihood Estimation Approach. In IPSN'12, pages 233-244, 2012.
  14. X. Yin, J. Han, and P. S. Yu. Truth Discovery with Multiple Conflicting Information Providers on the Web. TKDE, 20(6):796-808, 2008.
  15. D. Yu, H. Huang, T. Cassidy, H. Ji, C. Wang, S. Zhi, J. Han, C. R. Voss, and M. Magdon-Ismail. The Wisdom of Minority: Unsupervised Slot Filling Validation based on Multi-dimensional Truth-Finding. In COLING 2014, pages 1567-1578, 2014.
  16. B. Zhao, B. I. P. Rubinstein, J. Gemmell, and J. Han. A Bayesian Approach to Discovering Truth from Conflicting Sources for Data Integration. PVLDB, 5(6):550-561, 2012.