Incremental Consistent Updating of Incomplete Databases
2023, arXiv (Cornell University)
https://doi.org/10.48550/ARXIV.2302.06246Abstract
Efficient consistency maintenance of incomplete and dynamic real-life databases is a quality label for further data analysis. In prior work, we tackled the generic problem of database updating in the presence of tuple generating constraints from a theoretical viewpoint. The current paper considers the usability of our approach by (a) introducing incremental update routines (instead of the previous from-scratch versions) and (b) removing the restriction that limits the contents of the database to fit in the main memory. In doing so, this paper offers new algorithms, proposes queries and data models inviting discussions on the representation of incompleteness on databases. We also propose implementations under a graph database model and the traditional relational database model. Our experiments show that computation times are similar globally but point to discrepancies in some steps.
References (32)
- Abiteboul, S., and Grahne, G. Mise-à-jour des bases de données contenant de l'information incomplète. In Journées Bases de Données Avancés, 6-8 Mars 1985, St. Pierre de Chartreuse (Informal Proceedings). (1985).
- Abiteboul, S., Hull, R., and Vianu, V. Foundations of databases, vol. 8. Addison-Wesley Reading, 1995.
- Abrão, M. A., Bouchou, B., Halfeld Ferrari, M., Laurent, D., and Musicante, M. A. Incremental constraint checking for XML documents. In XSym (2004), no. 3186 in LNCS, pp. 112-127.
- Aho, A. V., Sagiv, Y., and Ullman, J. D. Efficient optimization of a class of relational expressions. ACM Trans. Database Syst. 4, 4 (1979), 435-454.
- Alotaibi, R., Lei, C., Quamar, A., Efthymiou, V., and Özcan, F. Property graph schema optimization for domain-specific knowledge graphs. In 37th IEEE International Con- ference on Data Engineering, ICDE 2021, Chania, Greece, April 19-22, 2021 (2021), IEEE, pp. 924-935.
- Balmin, A., Papakonstantinou, Y., and Vianu, V. Incremental validation of xml docu- ments. ACM Trans. Database Syst. 29, 4 (2004), 710-751.
- Benedikt, M., Konstantinidis, G., Mecca, G., Motik, B., Papotti, P., Santoro, D., and Tsamoura, E. Benchmarking the chase. In Proceedings of the 36th ACM SIGMOD- SIGACT-SIGAI Symposium on Principles of Database Systems, PODS 2017, Chicago, IL, USA, May 14-19, 2017 (2017), pp. 37-52.
- Bouchou, B., and Halfeld Ferrari Alves, M. Updates and incremental validation of XML documents. In The 9th International Workshop on Data Base Programming Languages (DBPL) (2003), Springer, Ed., no. 2921 in LNCS.
- Chabin, J., Halfeld Ferrari, M., and Laurent, D. Consistent updating of databases with marked nulls. Knowl. Inf. Syst. 62, 4 (2020), 1571-1609.
- Chandra, A. K., and Merlin, P. M. Optimal implementation of conjunctive queries in relational data bases. In Symposium on the Theory of Computing (1977).
- Fagin, R., Kolaitis, P. G., and Popa, L. Data exchange: getting to the core. ACM Trans. Database Syst. 30, 1 (2005), 174-210.
- Fagin, R., Kuper, G. M., Ullman, J. D., and Vardi, M. Y. Updating logical databases. Advances in Computing Research 3 (1986), 1-18.
- Fagin, R., Ullman, J. D., and Vardi, M. Y. On the semantics of updates in databases. In Proceedings of the Second ACM SIGACT-SIGMOD Symposium on Principles of Database Systems, Colony Square Hotel, Atlanta, Georgia, USA (1983), pp. 352-365.
- Fan, W., Tian, C., Xu, R., Yin, Q., Yu, W., and Zhou, J. Incrementalizing graph algorithms. In SIGMOD '21: International Conference on Management of Data, Virtual Event, China, June 20-25, 2021 (2021), G. Li, Z. Li, S. Idreos, and D. Srivastava, Eds., ACM, pp. 459- 471.
- Flouris, G., Konstantinidis, G., Antoniou, G., and Christophides, V. Formal foun- dations for RDF/S KB evolution. Knowl. Inf. Syst. 35, 1 (2013), 153-191.
- Francis, N., Green, A., Guagliardo, P., Libkin, L., Lindaaker, T., Marsault, V., Plantikow, S., Rydberg, M., Selmer, P., and Taylor, A. Cypher: An evolving query language for property graphs. In Proceedings of the 2018 International Conference on Man- agement of Data, SIGMOD Conference 2018, Houston, TX, USA, June 10-15, 2018 (2018), G. Das, C. M. Jermaine, and P. A. Bernstein, Eds., ACM, pp. 1433-1445.
- Gottlob, G., Orsi, G., and Pieris, A. Ontological queries: Rewriting and optimization. In Proceedings of the 27th International Conference on Data Engineering, ICDE, Germany (2011), pp. 2-13.
- Grahne, G. The Problem of Incomplete Information in Relational Databases, vol. 554 of Lecture Notes in Computer Science. Springer, 1991.
- Halfeld Ferrari, M., Hara, C. S., and Uber, F. R. RDF updates with constraints. In Knowledge Engineering and Semantic Web -8th International Conference, KESW, Szczecin, Poland, Proceedings (2017), pp. 229-245.
- Halfeld Ferrari, M., and Laurent, D. Updating RDF/S databases under constraints. In Advances in Databases and Information Systems -21st European Conference, ADBIS, Nicosia, Cyprus, Proceedings (2017), pp. 357-371.
- Halfeld Ferrari Alves, M., Laurent, D., and Spyratos, N. Update rules in datalog programs. J. Log. Comput. 8, 6 (1998), 745-775.
- Hell, P., and Nesetril, J. The core of a graph. Discrete Mathematics 109, 1-3 (1992), 117-126.
- Imielinski, T., and Lipski Jr., W. information in relational databases. J. ACM 31, 4 (1984), 761-791.
- Lausen, G., Meier, M., and Schmidt, M. Sparqling constraints for RDF. In EDBT, 11th International Conference on Extending Database Technology, France, Proceedings (2008), pp. 499-509.
- Link, S., and Schewe, K. An arithmetic theory of consistency enforcement. Acta Cybern. 15, 3 (2002), 379-416.
- Onet, A. The chase procedure and its applications in data exchange. In Data Exchange, Integration, and Streams. 2013, pp. 1-37.
- Patel-Schneider, P. F. Using description logics for RDF constraint checking and closed- world recognition. In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelli- gence, USA. (2015), pp. 247-253.
- Reiter, R. A sound and sometimes complete query evaluation algorithm for relational databases with null values. J. ACM 33, 2 (1986), 349-370.
- Schewe, K., and Thalheim, B. Limitations of rule triggering systems for integrity mainte- nance in the context of transition specifications. Acta Cybern. 13, 3 (1998), 277-304.
- Sirangelo, C. Representing and Querying Incomplete Information: a Data Interoperability Perspective. 2014.
- Winslett, M. Updating Logical Databases. Cambridge University Press, New York, NY, USA, 1990.
- Zaniolo, C. Database relations with null values. J. Comput. Syst. Sci. 28, 1 (1984), 142-166.