Alternation and redundancy analysis of the intersection problem
2008, ACM Transactions on Algorithms (TALG)
https://doi.org/10.1145/1328911.1328915Abstract
The intersection of sorted arrays problem has applications in search engines such as Google. Previous work propose and compare deterministic algorithms for this problem, in an adaptive analysis based on the encoding size of a certificate of the result (cost analysis). We define the alternation analysis, based on the non-deterministic complexity of an instance. In this analysis we prove that there is a deterministic algorithm asymptotically performing as well as any randomized algorithm in the comparison model. We define the redundancy analysis, based on a measure of the internal redundancy of the instance. In this analysis we prove that any algorithm optimal in the redundancy analysis is optimal in the alternation analysis, but that there is a randomized algorithm which performs strictly better than any deterministic algorithm in the comparison model. Finally, we describe how those results can be extended beyond the comparison model.
References (23)
- Baeza-Yates, R. A. 2004. A fast set intersection algorithm for sorted sequences. In CPM, S. C. Sahinalp, S. Muthukrishnan, and U. Dogrusöz, Eds. Lecture Notes in Computer Science, vol. 3109. Springer, 400-408.
- Barbay, J. 2003. Optimality of randomized algorithms for the intersection problem. In Proceedings of the Symposium on Stochastic Algorithms, Foundations and Applications (SAGA 2003), in Lecture Notes in Computer Science, A. Albrecht, Ed. Vol. 2827 / 2003. Springer- Verlag Heidelberg, 26-38. 3-540-20103-3.
- Barbay, J. 2004. Index-trees for descendant tree queries in the comparison model. Tech. Rep. TR-2004-11, University of British Columbia. July.
- Barbay, J. and Kenyon, C. 2002. Adaptive intersection and t-threshold problems. In Proceedings of the thirteenth ACM-SIAM Symposium On Discrete Algorithms (SODA). ACM-SIAM, ACM, 390-399.
- Barbay, J. and Kenyon, C. 2003. Deterministic algorithm for the t-threshold set problem. In Lecture Notes in Computer Science, H. O. Toshihide Ibaraki, Noki Katoh, Ed. Springer- Verlag, 575-584. Proceedings of the 14th Annual International Symposium on Algorithms And Computation (ISAAC).
- Beame, P. and Fich, F. E. 2002. Optimal bounds for the predecessor problem and related problems. J. Comput. Syst. Sci. 65, 1, 38-72.
- Bender, M. A., Cole, R., and Raman, R. 2002. Exponential structures for efficient cache- oblivious algorithms. In Proceedings of the 29th International Colloquium on Automata, Languages and Programming. Springer-Verlag, 195-207.
- Bentley, J. L. and Yao, A. C.-C. 1976. An almost optimal algorithm for unbounded searching. Information processing letters 5, 3, 82-87.
- Chaudhuri, S. 1998. An overview of query optimization in relational systems. 34-43.
- Christen, C. 1978. Improving the bound on optimal merging. In Proceedings of 19th FOCS. 259-266.
- de la Vega, W. F., Frieze, A. M., and Santha, M. 1998. Average-case analysis of the merging algorithm of hwang and lin. Algorithmica 22, 4, 483-489.
- de la Vega, W. F., Kannan, S., and Santha, M. 1993. Two probabilistic results on merging. SIAM J. Comput. 22, 2, 261-271.
- Demaine, E. D., López-Ortiz, A., and Munro, J. I. 2000. Adaptive set intersections, unions, and differences. In Proceedings of the 11 th ACM-SIAM Symposium on Discrete Algorithms (SODA). 743-752.
- Demaine, E. D., López-Ortiz, A., and Munro, J. I. 2001. Experiments on adaptive set intersections for text retrieval systems. In Proceedings of the 3rd Workshop on Algorithm Engineering and Experiments, Lecture Notes in Computer Science. Washington DC, 5-6.
- Frigo, M., Leiserson, C. E., Prokop, H., and Ramachandran, S. 1999. Cache-oblivious algorithms. In Proceedings of the 40th Annual Symposium on Foundations of Computer Science. IEEE Computer Society, 285.
- Hwang, F. K. and Lin, S. 1971. Optimal merging of 2 elements with n elements. Acta Informatica, 145-158.
- Hwang, F. K. and Lin, S. 1972. A simple algorithm for merging two disjoint linearly ordered sets. SIAM Journal of Computing 1, 1, 31-39.
- Manacher, G. K. 1979. Significant improvements to the hwang-ling merging algorithm. JACM 26, 3, 434-440.
- Neumann, J. V. and Morgenstern, O. 1944. Theory of games and economic behavior. 1st ed. Princeton University Press.
- Raman, R., Raman, V., and Rao, S. S. 2002. Succinct indexable dictionaries with applications to encoding k-ary trees and multisets. In SODA '02: Proceedings of the thirteenth annual ACM- SIAM symposium on Discrete algorithms. Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 233-242.
- Sion, M. 1958. On general minimax theorems. Pacic Journal of Mathematics, 171-176.
- Witten, I. H., Moffat, A., and Bell, T. C. 1994. Managing Gigabytes. VanNostrand Reinhold, 115 Fifth Avenue, New York, NY 10003.
- Yao, A. C. 1977. Probabilistic computations: Toward a unified measure of complexity. In Proc. 18th IEEE Symposium on Foundations of Computer Science (FOCS). 222-227.