From Business Intelligence to semantic data stream management
2016, Future Generation Computer Systems
https://doi.org/10.1016/J.FUTURE.2015.11.015Abstract
h i g h l i g h t s • Evolution of Business Intelligence with emergence of Big Data technologies. • New technologies and approaches the 3Vs (Volume, Velocity and Variety) of Big data. • Stream reasoning over Big Data. • Summarizing data streams (semantic and classic data). • Semantic data matching in stream context.
References (52)
- Juan Trujillo, Alejandro Maté, Business intelligence 2.0: A general overview, in: Marie-Aude Aufaure, Esteban Zimányi (Eds.), Business Intelligence, in: Lecture Notes in Business Information Processing, vol. 96, Springer, Berlin, Heidelberg, 2012, pp. 98-116.
- Alfred Kobsa, Generic user modeling systems, in: Peter Brusilovsky, Al- fred Kobsa, Wolfgang Nejdl (Eds.), The Adaptive Web, in: Lecture Notes in Computer Science, vol. 4321, Springer, 2007.
- Micheline Elias, Marie-Aude Aufaure, Anastasia Bezerianos, Storytelling in visual analytics tools for business intelligence, in: INTERACT 2013-14th IFIP TC13 Conference on Human-Computer Interaction, in: Lecture Notes in Computer Science, vol. 8119, Springer, Cape Town, South Africa, 2013, pp. 280-297.
- Tim Berners-Lee, James Hendler, Ora Lassila, The semantic web, Sci. Am. 284 (5) (2001) 34-43.
- Pascal Hitzler, Markus Krtzsch, Sebastian Rudolph, Foundations of Semantic Web Technologies, first ed., Chapman & Hall/CRC, 2009.
- Cássio A. Melo, Alexander Mikheev, Bénédicte Le Grand, Marie-Aude Aufaure, Cubix: A visual analytics tool for conceptual and semantic data, in: Jilles Vreeken, Charles Ling, Mohammed Javeed Zaki, Arno Siebes, Jeffrey Xu Yu, Bart Goethals, Geoffrey I. Webb, Xindong Wu (Eds.), ICDM Workshops, IEEE Computer Society, 2012, pp. 894-897.
- Charu Aggarwal (Ed.), Data Streams-Models and Algorithms, Springer, 2007.
- Nesime Tatbul, Uğur Çetintemel, Stan Zdonik, Mitch Cherniack, Michael Stonebraker, Load shedding in a data stream manager, in: Proceedings of the 29th International Conference on Very Large Data Bases-Volume 29, VLDB'03, VLDB Endowment, 2003, pp. 309-320.
- Lukasz Golab, M. Tamer Özsu, Issues in data stream management, SIGMOD Rec. 32 (2) (2003) 5-14.
- Arvind Arasu, Brian Babcock, Shivnath Babu, Mayur Datar, Keith Ito, Rajeev Motwani, Itaru Nishizawa, Utkarsh Srivastava, Dilys Thomas, Rohit Varma, Jennifer Widom, Stream: The stanford stream data manager, IEEE Data Eng. Bull. 26 (1) (2003) 19-26.
- Sirish Chandrasekaran, Owen Cooper, Amol Deshpande, Michael J. Franklin, Joseph M. Hellerstein, Wei Hong, Sailesh Krishnamurthy, Samuel R. Madden, Fred Reiss, Mehul A. Shah, Telegraphcq: Continuous dataflow processing, in: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, SIGMOD'03, ACM, New York, NY, USA, 2003, p. 668.
- Amit Sheth, Cory Henson, Satya S. Sahoo, Semantic sensor web, IEEE Internet Comput. 12 (4) (2008) 78-83.
- Danh Le-Phuoc, Minh Dao-Tran, Josiane Xavier Parreira, Manfred Hauswirth, A native and adaptive approach for unified processing of linked streams and linked data, in: Proceedings of the 10th International Conference on the Semantic Web-Volume Part I, PISWC'11, Springer-Verlag, Berlin, Heidelberg, 2011, pp. 370-388.
- Srdjan Komazec, Davide Cerri, Dieter Fensel, Sparkwave: continuous schema- enhanced pattern matching over RDF data streams, in: Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems, DEBS'12, ACM, New York, NY, USA, 2012, pp. 58-68.
- Davide Francesco Barbieri, Daniele Braga, Stefano Ceri, Emanuele Della Valle, Michael Grossniklaus, C-sparql: Sparql for continuous querying, in: Proceedings of the 18th International Conference on World Wide Web, ACM, 2009, pp. 1061-1062.
- Edith Cohen, Graham Cormode, Nick Duffield, Structure-aware sampling on data streams, in: Proceedings of the ACM SIGMETRICS Joint International Conference on Measurement and Modeling of Computer Systems, ACM, 2011, pp. 197-208.
- Paul G. Brown, Peter J. Haas, Techniques for warehousing of sample data, in: Ling Liu, Andreas Reuter, Kyu-Young Whang, Jianjun Zhang (Eds.), ICDE, IEEE Computer Society, 2006, p. 6.
- Jeffrey S. Vitter, Random sampling with a reservoir, ACM Trans. Math. Software 11 (1) (1985) 37-57.
- Brian Babcock, Mayur Datar, Rajeev Motwani, Sampling from a moving win- dow over streaming data, in: Proceedings of the Thirteenth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA'02, Society for Industrial and Ap- plied Mathematics, Philadelphia, PA, USA, 2002, pp. 633-634.
- Phillip B. Gibbons, Yossi Matias, New sampling-based summary statistics for improving approximate query answers, in: Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data, SIGMOD'98, ACM, New York, NY, USA, 1998, pp. 331-342.
- Ankur Jain, Edward Y. Chang, Adaptive sampling for sensor networks, in: Proceeedings of the 1st International Workshop on Data Management for Sensor Networks: In Conjunction with VLDB 2004, DMSN'04, ACM, New York, NY, USA, 2004, pp. 10-16.
- A.D. Marbini, L.E. Sacks, Adaptive sampling mechanisms in sensor networks. 2003.
- Chong Liu, Kui Wu, Min Tsao, Energy efficient information collection with the arima model in wireless sensor networks, in: GLOBECOM, IEEE, 2005, p. 5.
- Graham Cormode, Minos N. Garofalakis, Approximate continuous querying over distributed streams, ACM Trans. Database Syst. 33 (2) (2008).
- Rebecca Willett, Aline Martin, Robert Nowak, Backcasting: Adaptive sampling for sensor networks, in: Proceedings of the 3rd International Symposium on Information Processing in Sensor Networks, IPSN'04, ACM, New York, NY, USA, 2004, pp. 124-133.
- Naman Jain, Manuel Pozo, Raja Chiky, Zakia Kazi-Aoul, Sampling semantic data stream: Resolving overload and limited storage issues, in: DaEng, 2013, pp. 41-48.
- Norberto Fernández, Jesús Arias, Luis Sánchez, Damaris Fuentes-Lorenzo, Óscar Corcho, RDSZ: An approach for lossless RDF stream compression, in: The Semantic Web: Trends and Challenges, Springer, 2014, pp. 52-67.
- Peter Deutsch, Jean-Loup Gailly, Zlib compressed data format specification version 3.3. Technical report, 1996.
- Javier D. Fernández, Alejandro Llaves, Oscar Corcho, Efficient RDF interchange (eri) format for RDF data streams, in: The Semantic Web-ISWC 2014, Springer, 2014, pp. 244-259.
- John Schneider, Takuki Kamiya, D. Peintner, R. Kyusakov, Efficient xml interchange (exi) format 1.0. W3C Proposed Recommendation, 20, 2011.
- Michael Hayes, Miriam A.M. Capretz, Contextual anomaly detection frame- work for big sensor data, J. Big Data 2 (1) (2015).
- Felix Naumann, Melanie Herschel, An Introduction to Duplicate Detection, Morgan and Claypool Publishers, 2010.
- Khai Nguyen, Ryutaro Ichise, Bac Le, SLINT: a schema-independent linked data interlinking system, in: 7th International Workshop on Ontology Matching, Boston, USA, 2012.
- Juanzi Li, Zhichun Wang, Xiao Zhang, Jie Tang, Large scale instance matching via multiple indexes and candidate selection, J. Knowl.-Based Syst. 50 (2013) 112-120.
- Rohan Baxter, Peter Christen, Tim Churches, A comparison of fast blocking methods for record linkage, in: ACM SIGKDD Workshop on Data Cleaning, Record Linkage, and Object Consolidation, 2003.
- Robert Isele, Anja Jentzsch, Christian Bizer, Efficient multidimensional blocking for link discovery without losing recall, in: 14th International Workshop on the Web and Databases, Athens, Greece, 2011.
- Axel-Cyrille Ngonga Ngomo, Lars Kolb, Norman Heino, Michael Hartung, Sören Auer, Erhard Rahm, When to reach for the cloud: Using parallel hardware for link discovery, in: 10th Extended Semantic Web Conference, ESWC, Montpellier, France, 2013.
- Zhengrong Yao, Like Gao, Xiaoyang Sean Wang, Using triangle inequality to efficiently process continuous queries on high-dimensional streaming time series, in: 15th International Conference on Scientific and Statistical Database Management, Cambridge, MA, USA, 2003.
- Xing Niu, Shu Rong, Yunlong Zhang, Haofen Wang, Zhishi.links results for OAEI 2011, in: 6th International Workshop on Ontology Matching, Bonn, Germany, 2011.
- Stanley Hillner, Axel-Cyrille Ngonga Ngomo, Parallelizing LIMES for large- scale link discovery, in: 7th International Conference on Semantic Systems, Graz, Austria, 2011.
- Houda Khrouf, Vuk Milicic, Raphaël Troncy, Mining events connections on the social web: Real-time instance matching and data analysis in eventmedia, J. Web Sem. 24 (2014) 3-10.
- Jennifer Sleeman, Online unsupervised coreference resolution for semi- structured heterogeneous data, in: Doctoral Consortium, 11th International Semantic Web Conference, Boston, USA, 2012.
- Olivier Curé, Blin Guillaume (Eds.), RDF Database Systems: Triples Storage and SPARQL Query Processing, first ed., Morgan Kaufmann, Boston, MA, USA, 2015.
- Michael Stonebraker, Ugur Çetintemel, Stanley B. Zdonik, The 8 requirements of real-time stream processing, SIGMOD Rec. 34 (4) (2005) 42-47.
- Franz Baader, Diego Calvanese, Deborah L. McGuinness, Daniele Nardi, Peter F. Patel-Schneider (Eds.), The Description Logic Handbook: Theory, Implementation, and Applications, Cambridge University Press, New York, NY, USA, 2003.
- Emanuele Della Valle, Stefan Schlobach, Markus Krötzsch, Alessandro Bozzon, Stefano Ceri, Ian Horrocks, Order matters! harnessing a world of orderings for reasoning over massive data, Semant. Web 4 (2) (2013) 219-231.
- Davide Francesco Barbieri, Daniele Braga, Stefano Ceri, Emanuele Della Valle, Michael Grossniklaus, Incremental reasoning on streams and rich background knowledge, in: The Semantic Web: Research and Applications, 7th Extended Semantic Web Conference, ESWC 2010, Heraklion, Crete, Greece, May 30-June 3, 2010, Proceedings, Part I, 2010, pp. 1-15.
- Darko Anicic, Paul Fodor, Sebastian Rudolph, Nenad Stojanovic, EP-SPARQL: a unified language for event processing and stream reasoning, in: Proceedings of the 20th International Conference on World Wide Web, WWW 2011, Hyderabad, India, March 28-April 1, 2011, 2011, pp. 635-644.
- Jacopo Urbani, Spyros Kotoulas, Jason Maassen, Frank van Harmelen, Henri E. Bal, Owl reasoning with webpie: Calculating the closure of 100 billion triples, in: ESWC (1), 2010, pp. 213-227.
- Aidan Hogan, Jeff Z. Pan, Axel Polleres, Stefan Decker, Saor: Template rule optimisations for distributed reasoning over 1 billion linked data triples, in: International Semantic Web Conference (1), 2010, pp. 337-353.
- Jeffrey Dean, Sanjay Ghemawat, Mapreduce: Simplified data processing on large clusters, in: OSDI, 2004, pp. 137-150.
- Jesper Hoeksema, Spyros Kotoulas, High-performance distributed stream reasoning using s4, in: Ordring Workshop at ISWC, 2011.