Towards retrieving relevant information graphics
2013, Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
https://doi.org/10.1145/2484028.2484164Abstract
Information retrieval research has made significant progress in the retrieval of text documents and images. However, relatively little attention has been given to the retrieval of information graphics (non-pictorial images such as bar charts and line graphs) despite their proliferation in popular media such as newspapers and magazines. Our goal is to build a system for retrieving bar charts and line graphs that reasons about the content of the graphic itself in deciding its relevance to the user query. This paper presents the first steps toward such a system, with a focus on identifying the category of intended message of potentially relevant bar charts and line graphs. Our learned model achieves accuracy higher than 80% on a corpus of collected user queries.
References (12)
- REFERENCES
- S. Clark and J. Curran. Wide-coverage efficient statistical parsing with ccg and log-linear models. Computational Linguistics, 33(4):493-552, 2007.
- R. Datta, D. Joshi, J. Li, and J. Z. Wang. Image retrieval: Ideas, influences, and trends of the new age. ACM Computing Surveys (CSUR), 40(2):5, 2008.
- S. Demir, S. Carberry, and S. Elzer. Effectively realizing the inferred message of an information graphic. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP), pages 150-156, 2007.
- S. Elzer, S. Carberry, and I. Zukerman. The automated understanding of simple bar charts. Artificial Intelligence, 175(2):526-555, 2011.
- Y. Gao, M. Wang, H. Luan, J. Shen, S. Yan, and D. Tao. Tag-based social image search with visual-text joint hypergraph learning. In Proceedings of the 19th ACM international conference on Multimedia, pages 1517-1520. ACM, 2011.
- J. Jeon, V. Lavrenko, and R. Manmatha. Automatic image annotation and retrieval using cross-media relevance models. In Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, pages 119-126. ACM, 2003.
- M. Lapata. Image and natural language processing for multimedia information retrieval. Advances in Information Retrieval, pages 12-12, 2010.
- J. H. Larkin and H. A. Simon. Why a diagram is (sometimes) worth ten thousand words. Cognitive science, 11(1):65-100, 1987.
- A. Mishchenko and N. Vassilieva. Chart image understanding and numerical data extraction. In Digital Information Management (ICDIM), 2011 Sixth International Conference on, pages 115-120. IEEE, 2011.
- M. Shao and R. Futrelle. Recognition and classification of figures in pdf documents. Graphics Recognition. Ten Years Review and Future Perspectives, pages 231-242, 2006.
- P. Wu, S. Carberry, S. Elzer, and D. Chester. Recognizing the intended message of line graphs. Diagrammatic Representation and Inference, pages 220-234, 2010.