Academia.eduAcademia.edu

Outline

From Business Intelligence to semantic data stream management

2016, Future Generation Computer Systems

https://doi.org/10.1016/J.FUTURE.2015.11.015

Abstract

h i g h l i g h t s • Evolution of Business Intelligence with emergence of Big Data technologies. • New technologies and approaches the 3Vs (Volume, Velocity and Variety) of Big data. • Stream reasoning over Big Data. • Summarizing data streams (semantic and classic data). • Semantic data matching in stream context.

References (52)

  1. Juan Trujillo, Alejandro Maté, Business intelligence 2.0: A general overview, in: Marie-Aude Aufaure, Esteban Zimányi (Eds.), Business Intelligence, in: Lecture Notes in Business Information Processing, vol. 96, Springer, Berlin, Heidelberg, 2012, pp. 98-116.
  2. Alfred Kobsa, Generic user modeling systems, in: Peter Brusilovsky, Al- fred Kobsa, Wolfgang Nejdl (Eds.), The Adaptive Web, in: Lecture Notes in Computer Science, vol. 4321, Springer, 2007.
  3. Micheline Elias, Marie-Aude Aufaure, Anastasia Bezerianos, Storytelling in visual analytics tools for business intelligence, in: INTERACT 2013-14th IFIP TC13 Conference on Human-Computer Interaction, in: Lecture Notes in Computer Science, vol. 8119, Springer, Cape Town, South Africa, 2013, pp. 280-297.
  4. Tim Berners-Lee, James Hendler, Ora Lassila, The semantic web, Sci. Am. 284 (5) (2001) 34-43.
  5. Pascal Hitzler, Markus Krtzsch, Sebastian Rudolph, Foundations of Semantic Web Technologies, first ed., Chapman & Hall/CRC, 2009.
  6. Cássio A. Melo, Alexander Mikheev, Bénédicte Le Grand, Marie-Aude Aufaure, Cubix: A visual analytics tool for conceptual and semantic data, in: Jilles Vreeken, Charles Ling, Mohammed Javeed Zaki, Arno Siebes, Jeffrey Xu Yu, Bart Goethals, Geoffrey I. Webb, Xindong Wu (Eds.), ICDM Workshops, IEEE Computer Society, 2012, pp. 894-897.
  7. Charu Aggarwal (Ed.), Data Streams-Models and Algorithms, Springer, 2007.
  8. Nesime Tatbul, Uğur Çetintemel, Stan Zdonik, Mitch Cherniack, Michael Stonebraker, Load shedding in a data stream manager, in: Proceedings of the 29th International Conference on Very Large Data Bases-Volume 29, VLDB'03, VLDB Endowment, 2003, pp. 309-320.
  9. Lukasz Golab, M. Tamer Özsu, Issues in data stream management, SIGMOD Rec. 32 (2) (2003) 5-14.
  10. Arvind Arasu, Brian Babcock, Shivnath Babu, Mayur Datar, Keith Ito, Rajeev Motwani, Itaru Nishizawa, Utkarsh Srivastava, Dilys Thomas, Rohit Varma, Jennifer Widom, Stream: The stanford stream data manager, IEEE Data Eng. Bull. 26 (1) (2003) 19-26.
  11. Sirish Chandrasekaran, Owen Cooper, Amol Deshpande, Michael J. Franklin, Joseph M. Hellerstein, Wei Hong, Sailesh Krishnamurthy, Samuel R. Madden, Fred Reiss, Mehul A. Shah, Telegraphcq: Continuous dataflow processing, in: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, SIGMOD'03, ACM, New York, NY, USA, 2003, p. 668.
  12. Amit Sheth, Cory Henson, Satya S. Sahoo, Semantic sensor web, IEEE Internet Comput. 12 (4) (2008) 78-83.
  13. Danh Le-Phuoc, Minh Dao-Tran, Josiane Xavier Parreira, Manfred Hauswirth, A native and adaptive approach for unified processing of linked streams and linked data, in: Proceedings of the 10th International Conference on the Semantic Web-Volume Part I, PISWC'11, Springer-Verlag, Berlin, Heidelberg, 2011, pp. 370-388.
  14. Srdjan Komazec, Davide Cerri, Dieter Fensel, Sparkwave: continuous schema- enhanced pattern matching over RDF data streams, in: Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems, DEBS'12, ACM, New York, NY, USA, 2012, pp. 58-68.
  15. Davide Francesco Barbieri, Daniele Braga, Stefano Ceri, Emanuele Della Valle, Michael Grossniklaus, C-sparql: Sparql for continuous querying, in: Proceedings of the 18th International Conference on World Wide Web, ACM, 2009, pp. 1061-1062.
  16. Edith Cohen, Graham Cormode, Nick Duffield, Structure-aware sampling on data streams, in: Proceedings of the ACM SIGMETRICS Joint International Conference on Measurement and Modeling of Computer Systems, ACM, 2011, pp. 197-208.
  17. Paul G. Brown, Peter J. Haas, Techniques for warehousing of sample data, in: Ling Liu, Andreas Reuter, Kyu-Young Whang, Jianjun Zhang (Eds.), ICDE, IEEE Computer Society, 2006, p. 6.
  18. Jeffrey S. Vitter, Random sampling with a reservoir, ACM Trans. Math. Software 11 (1) (1985) 37-57.
  19. Brian Babcock, Mayur Datar, Rajeev Motwani, Sampling from a moving win- dow over streaming data, in: Proceedings of the Thirteenth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA'02, Society for Industrial and Ap- plied Mathematics, Philadelphia, PA, USA, 2002, pp. 633-634.
  20. Phillip B. Gibbons, Yossi Matias, New sampling-based summary statistics for improving approximate query answers, in: Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data, SIGMOD'98, ACM, New York, NY, USA, 1998, pp. 331-342.
  21. Ankur Jain, Edward Y. Chang, Adaptive sampling for sensor networks, in: Proceeedings of the 1st International Workshop on Data Management for Sensor Networks: In Conjunction with VLDB 2004, DMSN'04, ACM, New York, NY, USA, 2004, pp. 10-16.
  22. A.D. Marbini, L.E. Sacks, Adaptive sampling mechanisms in sensor networks. 2003.
  23. Chong Liu, Kui Wu, Min Tsao, Energy efficient information collection with the arima model in wireless sensor networks, in: GLOBECOM, IEEE, 2005, p. 5.
  24. Graham Cormode, Minos N. Garofalakis, Approximate continuous querying over distributed streams, ACM Trans. Database Syst. 33 (2) (2008).
  25. Rebecca Willett, Aline Martin, Robert Nowak, Backcasting: Adaptive sampling for sensor networks, in: Proceedings of the 3rd International Symposium on Information Processing in Sensor Networks, IPSN'04, ACM, New York, NY, USA, 2004, pp. 124-133.
  26. Naman Jain, Manuel Pozo, Raja Chiky, Zakia Kazi-Aoul, Sampling semantic data stream: Resolving overload and limited storage issues, in: DaEng, 2013, pp. 41-48.
  27. Norberto Fernández, Jesús Arias, Luis Sánchez, Damaris Fuentes-Lorenzo, Óscar Corcho, RDSZ: An approach for lossless RDF stream compression, in: The Semantic Web: Trends and Challenges, Springer, 2014, pp. 52-67.
  28. Peter Deutsch, Jean-Loup Gailly, Zlib compressed data format specification version 3.3. Technical report, 1996.
  29. Javier D. Fernández, Alejandro Llaves, Oscar Corcho, Efficient RDF interchange (eri) format for RDF data streams, in: The Semantic Web-ISWC 2014, Springer, 2014, pp. 244-259.
  30. John Schneider, Takuki Kamiya, D. Peintner, R. Kyusakov, Efficient xml interchange (exi) format 1.0. W3C Proposed Recommendation, 20, 2011.
  31. Michael Hayes, Miriam A.M. Capretz, Contextual anomaly detection frame- work for big sensor data, J. Big Data 2 (1) (2015).
  32. Felix Naumann, Melanie Herschel, An Introduction to Duplicate Detection, Morgan and Claypool Publishers, 2010.
  33. Khai Nguyen, Ryutaro Ichise, Bac Le, SLINT: a schema-independent linked data interlinking system, in: 7th International Workshop on Ontology Matching, Boston, USA, 2012.
  34. Juanzi Li, Zhichun Wang, Xiao Zhang, Jie Tang, Large scale instance matching via multiple indexes and candidate selection, J. Knowl.-Based Syst. 50 (2013) 112-120.
  35. Rohan Baxter, Peter Christen, Tim Churches, A comparison of fast blocking methods for record linkage, in: ACM SIGKDD Workshop on Data Cleaning, Record Linkage, and Object Consolidation, 2003.
  36. Robert Isele, Anja Jentzsch, Christian Bizer, Efficient multidimensional blocking for link discovery without losing recall, in: 14th International Workshop on the Web and Databases, Athens, Greece, 2011.
  37. Axel-Cyrille Ngonga Ngomo, Lars Kolb, Norman Heino, Michael Hartung, Sören Auer, Erhard Rahm, When to reach for the cloud: Using parallel hardware for link discovery, in: 10th Extended Semantic Web Conference, ESWC, Montpellier, France, 2013.
  38. Zhengrong Yao, Like Gao, Xiaoyang Sean Wang, Using triangle inequality to efficiently process continuous queries on high-dimensional streaming time series, in: 15th International Conference on Scientific and Statistical Database Management, Cambridge, MA, USA, 2003.
  39. Xing Niu, Shu Rong, Yunlong Zhang, Haofen Wang, Zhishi.links results for OAEI 2011, in: 6th International Workshop on Ontology Matching, Bonn, Germany, 2011.
  40. Stanley Hillner, Axel-Cyrille Ngonga Ngomo, Parallelizing LIMES for large- scale link discovery, in: 7th International Conference on Semantic Systems, Graz, Austria, 2011.
  41. Houda Khrouf, Vuk Milicic, Raphaël Troncy, Mining events connections on the social web: Real-time instance matching and data analysis in eventmedia, J. Web Sem. 24 (2014) 3-10.
  42. Jennifer Sleeman, Online unsupervised coreference resolution for semi- structured heterogeneous data, in: Doctoral Consortium, 11th International Semantic Web Conference, Boston, USA, 2012.
  43. Olivier Curé, Blin Guillaume (Eds.), RDF Database Systems: Triples Storage and SPARQL Query Processing, first ed., Morgan Kaufmann, Boston, MA, USA, 2015.
  44. Michael Stonebraker, Ugur Çetintemel, Stanley B. Zdonik, The 8 requirements of real-time stream processing, SIGMOD Rec. 34 (4) (2005) 42-47.
  45. Franz Baader, Diego Calvanese, Deborah L. McGuinness, Daniele Nardi, Peter F. Patel-Schneider (Eds.), The Description Logic Handbook: Theory, Implementation, and Applications, Cambridge University Press, New York, NY, USA, 2003.
  46. Emanuele Della Valle, Stefan Schlobach, Markus Krötzsch, Alessandro Bozzon, Stefano Ceri, Ian Horrocks, Order matters! harnessing a world of orderings for reasoning over massive data, Semant. Web 4 (2) (2013) 219-231.
  47. Davide Francesco Barbieri, Daniele Braga, Stefano Ceri, Emanuele Della Valle, Michael Grossniklaus, Incremental reasoning on streams and rich background knowledge, in: The Semantic Web: Research and Applications, 7th Extended Semantic Web Conference, ESWC 2010, Heraklion, Crete, Greece, May 30-June 3, 2010, Proceedings, Part I, 2010, pp. 1-15.
  48. Darko Anicic, Paul Fodor, Sebastian Rudolph, Nenad Stojanovic, EP-SPARQL: a unified language for event processing and stream reasoning, in: Proceedings of the 20th International Conference on World Wide Web, WWW 2011, Hyderabad, India, March 28-April 1, 2011, 2011, pp. 635-644.
  49. Jacopo Urbani, Spyros Kotoulas, Jason Maassen, Frank van Harmelen, Henri E. Bal, Owl reasoning with webpie: Calculating the closure of 100 billion triples, in: ESWC (1), 2010, pp. 213-227.
  50. Aidan Hogan, Jeff Z. Pan, Axel Polleres, Stefan Decker, Saor: Template rule optimisations for distributed reasoning over 1 billion linked data triples, in: International Semantic Web Conference (1), 2010, pp. 337-353.
  51. Jeffrey Dean, Sanjay Ghemawat, Mapreduce: Simplified data processing on large clusters, in: OSDI, 2004, pp. 137-150.
  52. Jesper Hoeksema, Spyros Kotoulas, High-performance distributed stream reasoning using s4, in: Ordring Workshop at ISWC, 2011.