Academia.eduAcademia.edu

Outline

Multimedia information seeking through search and hyperlinking

2013, Proceedings of the 3rd ACM conference on International conference on multimedia retrieval - ICMR '13

https://doi.org/10.1145/2461466.2461511

Abstract

Searching for relevant webpages and following hyperlinks to related content is a widely accepted and effective approach to information seeking on the textual web. Existing work on multimedia information retrieval has focused on search for individual relevant items or on content linking without specific attention to search results. We describe our research exploring integrated multimodal search and hyperlinking for multimedia data. Our investigation is based on the Medi-aEval 2012 Search and Hyperlinking task. This includes a known-item search task using the Blip10000 internet video collection, where automatically created hyperlinks link each relevant item to related items within the collection. The search test queries and link assessment for this task was generated using the Amazon Mechanical Turk crowdsourcing platform. Our investigation examines a range of alternative methods which seek to address the challenges of search and hyperlinking using multimodal approaches. The results of our experiments are used to propose a research agenda for developing effective techniques for search and hyperlinking of multimedia content.

References (36)

  1. REFERENCES
  2. M Bron, B Huurnink, and M de Rijke. Linking archives using document enrichment and term selection. In Proceedings of TPDL 2011, pages 2357-2360, 2011.
  3. K. Chatfield, V. Lempitsky, A. Vedaldi, and A. Zisserman. The devil is in the details: an evaluation of recent feature encoding method. In Proceedings of BMVC 2011, 2011.
  4. R.G. Cinbis, Jakob Verbeek, and Cordelia Schmid. Unsupervised Metric Learning for Face Identification in TV Video. In Proceedings of ICCV 2011, Barcelona, Spain, 2011.
  5. M. Eskevich, G.J. F. Jones, S. Chen, R. Aly, R.J.F. Ordelman, and M. Larson. Search and Hyperlinking Task at Mediaeval 2012. In MediaEval, volume 927 of CEUR Workshop Proceedings. CEUR-WS.org, 2012.
  6. M. Eskevich, G.J.F. Jones, M. Larson, and R.J.F. Ordelman. Creating a data collection for evaluating rich speech retrieval. In Proceedings of LREC 2012, Istanbul, Turkey, 2012.
  7. M. Eskevich, G.J.F. Jones, M. Larson, C. Wartena, R. Aly, T. Verschoor, and R.J.F. Ordelman. Comparing retrieval effectiveness of alternative content segmentation methods for internet video search. In Proeedings of CBMI 2012, 2012.
  8. M. Eskevich, W. Magdy, and G.J.F. Jones. New metrics for meaningful evaluation of informally structured speech retrieval. In Proceedings of ECIR 2012, pages 170-181, 2012.
  9. J.S. Garofolo, C.G.P. Auzanne, and E.M. Voorhees. The TREC spoken document retrieval track: A success story. In Proceedings of RIAO 2000, pages 1-8, 2000.
  10. A. Girgensohn, L. Wilcox, F. Shipman, and S. Bly. Designing affordances for the navigation of detail-on-demand hypervideo. In Proceedings of AVI 2004, pages 290-297. ACM, 2004.
  11. L. Hardman. Modelling and authoring hypermedia documents. PhD thesis, Universiteit Amsterdam, 1998.
  12. D. Hiemstra. Using language models for information retrieval. PhD thesis, University of Twente, 2001.
  13. P. Hoffmann, T. Kochems, and M. Herczeg. HyLive: Hypervideo-Authoring for Live Television. In Changing Television Environments, pages 51-60. Springer, 2008.
  14. T. Kaneko, T. Takigami, and T. Akiba. STD based on hough transform and SDR using STD results: Experiments at NTCIR-9 SpokenDoc. In Proceedings of Ninth NTCIR Workshop Meeting, 2011.
  15. P. Kelm, S. Schmiedeke, and T. Sikora. Feature-based Video Key Frame Extraction for low Quality Video Sequences. In Proceedings of WIAMIS 2009.
  16. Lori Lamel and Jean-Luc Gauvain. Speech processing for audio indexing. In Advances in Natural Language Processing, volume 5221 of LNCS, pages 4-15. 2008.
  17. M. Larson, M. Eskevich, R. Ordelman, C. Kofler, S. Schmiedeke, and G. J. F. Jones. Overview of MediaEval 2011 Rich Speech Retrieval Task and Genre Tagging Task. In MediaEval 2011 Workshop, Pisa, Italy, 2011.
  18. M. Larson, C. Kofler, and A. Hanjalic. Reading between the tags to predict real-world size-class for visually depicted objects in images. In Proceedings of ACM MM, 2011.
  19. M. Larson, M. Soleymani, M. Eskevich, P. Serdyukov, R.J.F. Ordelman, and G. J. F. Jones. The community and the crowd: Multimedia benchmark dataset development. IEEE MultiMedia, 19(3):15, 2012.
  20. M.A. Larson, S. Schmiedeke, P. Kelm, A. Rae, V. Mezaris, T. Piatrik, M. Soleymani, F. Metze, and G.J.F. Jones, editors. Working Notes Proceedings of the MediaEval 2012 Workshop, volume 927 of CEUR Workshop Proceedings. CEUR-WS.org, 2012.
  21. B. Meixner, K. Matusik, C. Grill, and H. Kosch. Towards an easy to use authoring tool for interactive non-linear video. Multimedia Tools and Applications, pages 1-26, 2012.
  22. P. N. Mendes, M. Jakob, A. García-Silva, and C. Bizer. DBpedia spotlight: Shedding light on the web of documents. In Proceedings of the 7th International Conference on Semantic Systems (I-Semantics), 2011.
  23. D. Milne and I.H. Witten. Learning to link with wikipedia. In Proceeding of CIKM 2008, pages 509-518. ACM, 2008.
  24. J. Morang, R.J.F. Ordelman, F.M.G. de Jong, and A.J. van Hessen. InfoLink: analysis of Dutch broadcast news and cross-media browsing. In Proceedings of ICME 2005, Los Alamitos, 2005.
  25. P. Pecina, P. Hoffmannova, G. J. F. Jones, Y. Zhang, and D. W. Oard. Overview of the CLEF 2007 cross-language speech retrieval track. In Proceedings of CLEF 2007, pages 674-686, 2007.
  26. S. Robertson, H. Zaragoza, and M. Taylor. Simple BM25 extension to multiple weighted fields. In Proceedings of ACM CIKM 2004, 2004.
  27. A. Rousseau, F. Bougares, P. Deléglise, H. Schwenk, and Y. Estèv. Lium's systems for the iwslt 2011 speech translation tasks. In Proceedings of IWSLT 2011, 2011.
  28. I. Sawhney, N. and Balcom, D. and Smith. Authoring and navigating video in space and time. MultiMedia, IEEE, 4(4):30-39, 1997.
  29. F. Shipman, A. Girgensohn, and L. Wilcox. Authoring, viewing, and generating hypervideo: An overview of Hyper-Hitchcock. ACM Trans. Multimedia Comput. Commun. Appl., (2):15:1--15:19, 2008.
  30. J. Sivic and A. Zisserman. Video google: a text retrieval approach to object matching in videos. In Proceedings of ICCV 2003, pages 1470 -1477 vol.2, 2003.
  31. A. F. Smeaton, P. Over, and W. Kraaij. Evaluation campaigns and trecvid. In Proceedings of MIR 2006, Santa Barbara, California, USA, 2006.
  32. A. W. M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell., 22(12):1349-1380, 2000.
  33. M. Utiyama and H. Isahara. A statistical model for domain-independent text segmentation. In Proceedings of ACL 2001.
  34. E. Voorhees, D.K. Harman, National Institute of Standards, and Technology (US). TREC: Experiment and evaluation in information retrieval. MIT press USA, 2005.
  35. E.M. Voorhees. The TREC-8 Question Answering Track Report. In Proceedings of TREC-8, pages 77-82, 1999.
  36. R. Yan. Probabilistic Models for Combining Diverse Knowledge Sources in Multimedia Retrieval. PhD thesis, Carnegie Mellon University, 2006.