Viewable scene modeling for geospatial video search
2008, Proceeding of the 16th ACM international conference on Multimedia - MM '08
https://doi.org/10.1145/1459359.1459401Abstract
Video sensors are becoming ubiquitous and the volume of captured video material is very large. Therefore, tools for searching video databases are indispensable. Current techniques that extract features purely based on the visual signals of a video are struggling to achieve good results. By considering video related meta-information, more relevant and precisely delimited search results can be obtained. In this study we propose a novel approach for querying videos based on the notion that the geographical location of the captured scene in addition to the location of a camera can provide valuable information and may be used as a search criterion in many applications. This study provides an estimation model of the viewable area of a scene for indexing and searching and reports on a prototype implementation. Among our objectives is to stimulate a discussion of these topics in the research community as information fusion of different georeferenced data sources is becoming increasingly important. Initial results illustrate the feasibility of the proposed approach.
References (20)
- REFERENCES
- Camera Calibration Toolbox for Matlab. http://www.vision.caltech.edu/bouguetj/calib doc/.
- Flickr. http://www.flickr.com.
- Geobloggers. http://www.geobloggers.com.
- Woophy. http://www.woophy.com.
- Boris Epshtein, Eyal Ofek, Yonatan Wexler, and Pusheng Zhang. Hierarchical Photo Organization Using Geo-Relevance. In 15 th ACM Intl. Symposium on Advances in Geographic Information Systems (GIS), pages 1-7, 2007.
- Clarence H. Graham, Neil R. Bartlett, John Lott Brown, Yun Hsia, Conrad C. Mueller, and Lorrin A. Riggs. Vision and Visual Perception. John Wiley & Sons, Inc., 1965.
- Antonin Guttman. R-Trees: A Dynamic Index Structure for Spatial Searching. In SIGMOD, Proceedings of Annual Meeting, Boston, Massachusetts, pages 47-57, 1984.
- Eugene Hecht. Optics. Addison-Wesley Publishing Company, 4 th edition, August 2001.
- Tae-Hyun Hwang, Kyoung-Ho Choi, In-Hak Joo, and Jong-Hun Lee. MPEG-7 Metadata for Video-Based GIS Applications. In Geoscience and Remote Sensing Symposium, pages 3641-3643, vol.6, 2003.
- Rieko Kadobayashi and Katsumi Tanaka. 3D Viewpoint-Based Photo Search and Information Browsing. In 28 th Intl. ACM SIGIR Conference on Research and Development in Information Retrieval, pages 621-622, 2005.
- Kyong-Ho Kim, Sung-Soo Kim, Sung-Ho Lee, Jong-Hyun Park, and Jong-Hyun Lee. The Interactive Geographic Video. In Geoscience and Remote Sensing Symposium, pages 59-61, vol.1, 2003.
- Xiaotao Liu, Mark Corner, and Prashant Shenoy. SEVA: Sensor-Enhanced Video Annotation. In 13 th ACM Intl. Conference on Multimedia, pages 618-627, 2005.
- Mor Naaman, Yee Jiun Song, Andreas Paepcke, and Hector Garcia-Molina. Automatic Organization for Digital Photographs with Geographic Coordinates. In 4 th ACM/IEEE-CS Joint Conference on Digital Libraries, pages 53-62, 2004.
- A. Pigeau and M. Gelgon. Building and Tracking Hierarchical Geographical & Temporal Partitions for Image Collection Management on Mobile Devices. In 13 th ACM Intl. Conference on Multimedia, 2005.
- Kerry Rodden and Kenneth R. Wood. How do People Manage their Digital Photographs? In SIGCHI Conference on Human Factors in Computing Systems, pages 409-416, 2003.
- Rainer Simon and Peter Fröhlich. A Mobile Application Framework for the Geospatial Web. In 16 th Intl. Conference on the World Wide Web, pages 381-390, 2007.
- Yannis Theodoridis, Michael Vazirgiannis, and Timos Sellis. Spatio-Temporal Indexing for Large Multimedia Applications. In IEEE Intl. Conf. on Multimedia Systems, 1996.
- Carlo Torniai, Steve Battle, and Steve Cayzer. Sharing, Discovering and Browsing Geotagged Pictures on the Web. Springer, 2006.
- Kentaro Toyama, Ron Logan, and Asta Roseway. Geographic Location Tags on Digital Images. In 11 th ACM Intl. Conference on Multimedia, pages 156-166, 2003.