Academia.eduAcademia.edu

Outline

Interactive Movie Annotation

2003

https://doi.org/10.1109/MMUL.2003.1218254

Abstract

Effectively labelingthe visual content ofmovies is essentialfor annotation. Wepresent theinteractive andadaptive i-Notationsystem, whichdescribes actors’names, automaticallyprocessesmultimodalinformation sources,and deals withavailable sources’varying quality. Itprovides the basisfor intelligentinteraction anddemonstratessignificantimprovements inannotationefficiency.

References (18)

  1. F. Nack and A.T. Lindsay, "Everything You Wanted to Know about MPEG-7: Part 1," IEEE MultiMedia, vol. 6, no. 3, July-Sept. 1999, pp. 65-77.
  2. R. Weiss, A. Duda, and D.K. Gifford, "Composition and Search with a Video Algebra," IEEE MultiMedia, vol. 2, no. 1, Spring 1995, pp. 12-25.
  3. M. Davis, "Media Streams: An Iconic Visual Language for Video Representation," Readings in Human-Computer Interaction: Toward the Year 2000, 2nd ed., Morgan Kaufmann, 1995, pp. 854-866.
  4. R. Lienhart, "A System for Effortless Content Annotation to Unfold the Semantics in Videos," Proc. IEEE Int'l Workshop on Content-Based Access of Image and Video Databases, IEEE CS Press, 2000, pp. 45-49.
  5. S. Satoh, Y. Nakamura, and T. Kanade, "Name-It: Naming and Detecting Faces in News Videos," IEEE MultiMedia, vol. 6, no. 1, Jan.-Mar. 1999, pp. 22-35.
  6. J.S. Wachman and R.W. Picard, "Tools for Browsing a TV Situation Comedy Based on Content Specific Attributes," Multimedia Tools and Applications, vol. 13, no. 3, 2001, pp. 255-284.
  7. J. Korris and M. Macedonia, "The End of Celluloid: Digital Cinema Emerges," Computer, vol. 35, no. 4, Apr. 2002, pp. 96-98.
  8. J. Vendrig and M. Worring, "Systematic Evaluation of Logical Story Unit Segmentation," IEEE Trans.
  9. Multimedia, vol. 4, no. 4, Dec. 2002, pp. 492-499.
  10. G. Davenport, T. Aguierre Smith, and N. Pincever, "Cinematic Principles for Multimedia," IEEE Computer Graphics and Applications, vol. 11, no. 4, July 1991, pp. 67-74.
  11. P.J. Jang and A.G. Hauptmann, "Learning to Recog- nize Speech by Watching Television," IEEE Intelligent Systems, vol. 14, no. 5, Sept./Oct. 1999, pp. 51-58.
  12. M.-H. Yang, D.J. Kriegman, and N. Ahuja, "Detect- ing Faces in Images: A Survey," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 1, Jan. 2002, pp. 34-58.
  13. J. Vendrig and M. Worring, Multimodal Person Iden- tification, LNCS 2383, Springer Verlag, 2002, pp. 175-185.
  14. Y. Rui, T.S. Huang, and S. Mehrotra, "Constructing Table-of-Content for Videos," Multimedia Systems, vol. 7, no. 5, Sept. 1999, pp. 359-368.
  15. J. Vendrig, M. Worring, and A.W.M. Smeulders, "Fil- ter Image Browsing: Interactive Image Retrieval by Using Database Overviews," Multimedia Tools and Applications, vol. 15, no. 1, Sept. 2001, pp. 83-103.
  16. R.K. Srihari, "Automatic Indexing and Content- Based Retrieval of Captioned Images," Computer, vol. 28, no. 9, Sept. 1995, pp. 49-56.
  17. Jeroen Vendrig is a senior researcher at MediaMill, a Uni- versity of Amsterdam spin-off in conjunction with the Nether- lands Organization for Applied Scientific Research (TNO-TPD)
  18. that develops multimedia indexing tools. His research focuses on interactive video segmentation and visual- ization of video content and retrieval. Vendrig has an MS in business information systems and a PhD in com- puter science from the University of Amsterdam. Marcel Worring is a cofounder of MediaMill and an associate profes- sor of computer science at the University of Amsterdam. His main research interests are auto- matic structuring and indexing of multimedia content for content-based access, explo- ration, and presentation. Worring has an MS (honors) in computer science from the Free University Amsterdam and a PhD from the University of Amsterdam. Readers may contact Jeroen Vendrig at vendrig@ science.uva.nl. 37 July-September 2003