Towards semantic search in building sensor data
2021, Proceedings of the 8th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation
https://doi.org/10.1145/3486611.3486647Abstract
This paper presents a search engine system for sensor time series data and metadata in the context of building management. It takes natural language queries as input and retrieves sensor time series data, ranks them with respect to their relevance to a given query, and visualizes the results as graphs. In addition, the system allows users to interact with the search results: they can define events of interest in the visualized results and search across sensor data for time series with similar shape, i.e. the search by example scheme. We leverage both a feature based cosine similarity model and DTW to find similar time series and rank them by relevance. Our quantitative evaluations and user studies demonstrate the value of this system for managing building sensor data. • Information systems → Database query processing; • Computer systems organization → Sensor networks.
References (16)
- Mustafa Gokce Baydogan, George Runger, and Eugene Tuv. 2013. A Bag-of- Features Framework to Classify Time Series. IEEE Trans. Pattern Anal. Mach. Intell. 35, 11 (2013), 2796-2802.
- Arka A Bhattacharya, Dezhi Hong, David Culler, Jorge Ortiz, Kamin Whitehouse, and Eugene Wu. 2015. Automated metadata construction to support portable building applications. In BuildSys. 3-12.
- Daniel Cer, Yinfei Yang, Sheng-yi Kong, Nan Hua, Nicole Limtiaco, Rhomni St John, Noah Constant, Mario Guajardo-Cespedes, Steve Yuan, Chris Tar, et al. 2018. Universal sentence encoder for English. In Proceedings of the 2018 EMNLP: System Demonstrations. 169-174.
- Cynthia Dwork, Ravi Kumar, Moni Naor, and Dandapani Sivakumar. 2001. Rank aggregation methods for the web. In Proceedings of the 10th WWW. 613-622.
- Harry Hochheiser and Ben Shneiderman. 2002. A dynamic query interface for finding patterns in time series data. In CHI'02. 522-523.
- Dezhi Hong, Hongning Wang, Jorge Ortiz, and Kamin Whitehouse. 2015. The building adapter: Towards quickly applying building analytics at scale. In BuildSys. 123-132.
- Jason Koh, Dezhi Hong, Rajesh Gupta, Kamin Whitehouse, Hongning Wang, and Yuvraj Agarwal. 2018. Plaster: An integration, benchmark, and development framework for metadata normalization methods. In BuildSys. 1-10.
- Fei Li and Hosagrahar V Jagadish. 2014. NaLIR: an interactive natural language interface for querying relational databases. In Proceedings of the 2014 SIGMOD. ACM, 709-712.
- Christopher Manning, Prabhakar Raghavan, and Hinrich Schütze. 2010. Intro- duction to information retrieval. Natural Language Engineering 16, 1 (2010), 100-103.
- Yannis Manolopoulos and RJ Alcock. 1999. Time-Series Similarity Queries Em- ploying a Feature-Based Approach. In 7th Hellenic Conference on Informatics. 27-29.
- George A Miller, Richard Beckwith, Christiane Fellbaum, Derek Gross, and Kather- ine J Miller. 1990. Introduction to WordNet: An on-line lexical database. Interna- tional journal of lexicography 3, 4 (1990), 235-244.
- Meinard Müller. 2007. Dynamic Time Warping. In Information Retrieval for Music and Motion. 69-84.
- Neelu Nihalani, Sanjay Silakari, and Mahesh Motwani. 2011. Natural language interface for database: a brief review. International Journal of Computer Science Issues (IJCSI) 8, 2 (2011), 600.
- Yonggang Qiu and Hans-Peter Frei. 1993. Concept based query expansion. In ACM SIGIR. 160-169.
- Thanawin Rakthanmanon, Bilson Campana, Abdullah Mueen, Gustavo Batista, Brandon Westover, Qiang Zhu, Jesin Zakaria, and Eamonn Keogh. 2012. Searching and mining trillions of time series subsequences under dynamic time warping. In Proceedings of the 18th ACM SIGKDD. 262-270.
- Bin Tan and Fuchun Peng. 2008. Unsupervised query segmentation using gener- ative language models and wikipedia. In Proceedings of the 17th WWW. 347-356.