Academia.eduAcademia.edu

Outline

Similarity-Based Queries for Time Series Data

1998

Abstract

We study a set of linear transformations on the Fourier series representation of a sequence that can be used as the basis for similarity queries on time-series data. We show that our set of transformations is rich enough to formulate operations such as moving average and time warping. We present a query processing algorithm that uses the underlying R-tree index of a multidimensional data set to answer similarity queries e ciently. Our experiments show that the performance of this algorithm is competitive to that of processing ordinary (exact match) queries using the index, and much faster than sequential scanning. We relate our transformations to the general framework for similarity queries of Jagadish et al.

References (13)

  1. Rakesh Agrawal, Christos Faloutsos, and Arun Swami. E cient similarity search in sequence databases. In Foundations Of Data Organizations and algorithms (FODO) con- ference, October 1993.
  2. ALSS95] Rakesh Agrawal, King-Ip Lin, Harpreet S. Sawhney, and Kyuseok Shim. Fast similar- ity search in the presence of noise, scaling, and translation in time-series databases. In Proceedings of the 21st VLDB Conference, pages 490{501, Zurich, Switzerland, 1995.
  3. APWZ95] R. Agrawal, G. Psaila, E. L. Wimmers, and M. Zait. Querying shapes of histories. In Proceedings of the 21st VLDB Conference, pages 502{514, Zurich, Switzerland, 1995.
  4. BKSS90] N. Beckmann, H.-P. Kriegel, R. Schneider, and B. Seeger. The R* tree: an e cient and robust index method for points and rectangles. In ACM SIGMOD Conf. on the Management Of Data, pages 322{331. ACM, 1990.
  5. R. D. Edwards and J. Magee. Techni- cal analysis of stock trends. John Magee, Spring eld, Massachsetts, 1969.
  6. FJMM95] C. Faloutsos, H. V. Jagadish, A. O. Mendel- zon, and T. Milo. A signature technique for similarity-based queries. technical report 112530-951110-16TM, AT&T, Murray Hill, NJ, November 1995.
  7. C. Faloutsos, M. Ranganathan, and Y. Manolopoulos. Fast subsequence matching in time-series databases. In Intl. Conf. on Management of Data -SIGMOD 94, pages 419{429, Minneapolis, May 1994. GK95] D. Q. Goldin and P. C. Kanellakis. On similarity queries for time-series data: con- straint speci cation and implementation. In 1st Intl. Conf. on the Principles and Prac- tice of Constraint Programming, pages 137{ 153. LNCS 976, Sept. 1995.
  8. Gut84] Antonin Guttman. R-trees: a dynamic index structure for spatial searching. In ACM SIGMOD Conf. on the Management Of Data, pages 47{57. ACM, 1984.
  9. Jag91] H. V. Jagadish. A retrieval technique for similar shapes. In ACM SIGMOD Symp. on the Management Of Data, pages 208{217, 1991.
  10. H. V. Jagadish, A. O. Mendelzon, and T. Milo. Similarity-based queries. PODS, 1995.
  11. A. V. Oppenheim and R. W. Schafer. Digi- tal Signal Processing. Prentice-Hall, Engle- wood Cli s, N.J., 1975.
  12. N. Roussopoulos, S. Kelley, and F. Vincent. Nearest neighbor queries. In Proceedings of the ACM SIGMOD Annual Conference, San Jose, CA, 1995.
  13. Rot93] William G. Roth. MIMSY: A system for an- alyzing time series data in the stock market domain. University of Wisconsin, Madison, 1993. Master Thesis. RS92] Raghu Ramakrishnan and Divesh Srivas- tava. CORAL: Control, relations and logic. In Proceedings of the Int. Conf. on VLDB, 1992. SK83] David Sanko and Joseph B. Kruskal. Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Com- parison. Addison-Wesley Publishing Com- pany, 1983.