Academia.eduAcademia.edu

Outline

Put a Tree Pattern in Your Algebra

2007, 2007 IEEE 23rd International Conference on Data Engineering

https://doi.org/10.1109/ICDE.2007.367870

Abstract

To address the needs of data intensive XML applications, a number of efficient tree pattern algorithms have been proposed. Still, most XQuery compilers do not support those algorithms. This is due in part to the lack of support for tree patterns in XML algebras, but also because deciding which part of a query plan should be evaluated as a tree pattern is a hard problem. In this paper, we extend a tuple algebra for XQuery with a tree pattern operator, and present rewritings suitable to introduce that operator in query plans. We demonstrate the robustness of the proposed rewritings under syntactic variations commonly found in queries. The proposed tree pattern operator can be implemented using popular algorithms such as Twig joins and Staircase joins. Our experiments yield useful information to decide which algorithm should be used in a given plan.

References (30)

  1. $input/desc::t01[child::t02[child::t03[child::
  2. $input/desc::t01[desc::t02[desc::t03[desc::
  3. K. Beyer, R. Cochrane, V. Josifovski, J. Kleewein, G. Lapis, G. Lohman, B. Lyle, F. Özcan, H. Pirahesh, N. Seemann, T. Truong, B. V. der Linden, B. Vickery, and C. Zhang. Sys- tem RX: one part relational, one part XML. In SIGMOD, pages 347-358, 2005.
  4. S. Boag, D. Chamberlin, M. F. Fernandez, D. Florescu, J. Robie, and J. Simeon. XQuery 1.0: An XML query lan- guage. W3C Candidate Recommendation, June 2006.
  5. M. Brantner, S. Helmer, C.-C. Kanne, and G. Moerkotte. Full-fledged algebraic xpath processing in natix. In ICDE, pages 705-716, 2005.
  6. N. Bruno, N. Koudas, and D. Srivastava. Holistic twig joins: optimal xml pattern matching. In SIGMOD, pages 310-321, 2002.
  7. T. Chen, T. W. Ling, and C. Y. Chan. Prefix path stream- ing: A new clustering method for optimal holistic XML twig pattern matching. In DEXA, pages 801-810, 2004.
  8. Z. Chen, H. V. Jagadish, L. V. S. Lakshmanan, and S. Pa- parizos. From tree patterns to generalized tree patterns: On efficient evaluation of XQuery. In VLDB, pages 237-248, 2003.
  9. S.-Y. Chien, Z. Vagena, D. Zhang, V. J. Tsotras, and C. Zan- iolo. Efficient structural joins on indexed xml documents. In VLDB, pages 263-274, 2002.
  10. B. Choi, M. Mahoui, and D. Wood. On the optimality of holistic algorithms for twig queries. In DEXA, pages 28-37, 2003.
  11. C.-W. Chung, J.-K. Min, and K. Shim. Apex: an adaptive path index for xml data. In SIGMOD, pages 121-132, 2002.
  12. D. Colazzo, G. Ghelli, P. Manghi, and C. Sartiani. Types for path correctness of xml queries. In ICFP, pages 126-137, 2004.
  13. A. Deutsch, Y. Papakonstantinou, and Y. Xu. The NEXT logical framework for XQuery. In VLDB, pages 168-179, 2004.
  14. D. Draper, P. Fankhauser, M. Fernandez, A. Malhotra, K. Rose, M. Rys, J. Simeon, and P. Wadler. XQuery 1.0 and XPath 2.0 formal semantics, W3C working draft. Candidate Recommendation, Nov. 2005.
  15. M. Fernández, J. Siméon, B. Choi, A. Marian, and G. Sur. Implementing XQuery 1.0: The Galax Experience. In VLDB, Sept. 2003.
  16. M. F. Fernández, A. Malhorta, J. Marsh, M. Nagy, and N. Walsh. XQuery 1.0 and XPath 2.0 data model (XDM), July 2006. .
  17. M. Fontoura, V. Josifovski, E. Shekita, and B. Yang. Op- timizing cursor movement in holistic twig joins. In CIKM, pages 784-791, 2005.
  18. G. Gottlob, C. Koch, and R. Pichler. Efficient algorithms for processing XPath queries. In VLDB, pages 95-106, 2002.
  19. T. Grust and M. V. Keulen. Tree awareness for relational DBMS kernels: Staircase join. In Intelligent Search on XML Data, pages 231-245, 2003.
  20. T. Grust, M. V. Keulen, and J. Teubner. Accelerating XPath evaluation in any RDBMS. ACM Trans. Database Syst., 29(1):91-131, 2004.
  21. J. Hidders, P. Michiels, J. Siméon, and R. Vercam- men. How to recognize different kinds of tree patterns from quite a long way away. Technical Report TR UA 13-2006, Univ. of Antwerp and IBM Research, 2006. http://www.adrem.ua.ac.be.
  22. H. Jiang, H. Lu, and W. Wang. Efficient processing of twig queries with or-predicates. In SIGMOD, pages 59-70, 2004.
  23. H. Jiang, W. Wang, H. Lu, and J. X. Yu. Holistic twig joins on indexed XML documents. In VLDB, pages 273- 284, 2003.
  24. Q. Li and B. Moon. Indexing and querying xml data for regular path expressions. In VLDB, pages 361-370, 2001.
  25. J. Lu, T. W. Ling, C. Y. Chan, and T. Chen. From region en- coding to extended Dewey: On efficient processing of XML twig pattern matching. In VLDB, pages 193-204, 2005.
  26. J. Lu, T. W. Ling, T. Yu, C. Li, and W. Ni. Efficient process- ing of ordered XML twig pattern. In DEXA, pages 300-309, 2005.
  27. N. May, S. Helmer, and G. Moerkotte. Nested queries and quantifiers in an ordered context. In ICDE, pages 239-250, 2004.
  28. P. Michiels, G. A. Mihȃilȃ, and J. Siméon. Put a tree pat- tern in your tuple algebra. Technical Report TR UA 09- 2006, Univ. of Antwerp and IBM Research, Belgium, 2006. http://www.adrem.ua.ac.be.
  29. S. Paparizos, Y. Wu, L. V. S. Lakshmanan, and H. V. Jagadish. Tree logical classes for efficient evaluation of XQuery. In SIGMOD, pages 71-82, 2004.
  30. C. Re, J. Siméon, and M. F. Fernández. A complete and efficient algebraic compiler for xquery. In ICDE, page 14, 2006.