Semantic Multimedia
Abstract
Multimedia constitutes an interesting field of application for Semantic Web and Semantic Web reasoning, as the access and management of multimedia content and context depends strongly on the semantic descriptions of both. At the same time, multimedia resources constitute complex objects, the descriptions of which are involved and require the foundation on sound modeling practice in order to represent findings of low-and high level multimedia analysis and to make them accessible via Semantic Web querying of resources. This tutorial aims to provide a red thread through these different issues and to give an outline of where Semantic Web modeling and reasoning needs to further contribute to the area of semantic multimedia for the fruitful interaction between these two fields of computer science. 8 See also the forthcoming W3C Media Fragments Working Group:
References (101)
- Adali, S., Sapino, M.L., Subrahmanian, V.S.: An algebra for creating and querying multimedia presentations. Multimedia Syst. 8(3), 212-230 (2000)
- Ahern, S., Naaman, M., Nair, R., Yang, J.H.-I.: World explorer: visualizing aggre- gate data from unstructured text in geo-referenced collections. In: Proceedings of the 7th ACM/IEEE joint conference on Digital libraries, pp. 1-10. ACM Press, New York (2007)
- Arndt, R., Troncy, R., Staab, S., Hardman, L., Vacura, M.: COMM: Designing a Well-Founded Multimedia Ontology for the Web. In: 6th Int. Semantic Web Conference (2007)
- Blöhdorn, S., Petridis, K., Saathoff, C., Simou, N., Tzouvaras, V., Avrithis, Y., Handschuh, S., Kompatsiaris, Y., Staab, S., Strintzis, M.: Semantic Annotation of Images and Videos for Multimedia Analysis. In: 2nd European Semantic Web Conference (2005)
- Brunelli, R., Poggio, T.: Template matching: Matched spatial filters and beyond. Pattern Recognition 30(5), 751-768 (1997)
- Brusilovsky, P., Maybury, M.T.: From adaptive hypermedia to the adaptive Web. Communications of the ACM 45(5), 30-33 (2002)
- Chen, H., Shimshoni, I., Meer, P.: Model based object recognition by robust in- formation fusion. In: 17th International Conference on Pattern Recognition, Cam- brige, UK (August 2004)
- Christianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines. Cambridge University Press, Cambridge (2000)
- Cortes, C., Vapnik, V.N.: Support vector networks. Machine Learning 20, 273-297 (1995)
- Dey, A.K., Abowd, G.D.: Towards a Better Understanding of Context and Context- Awareness. Technical Report GIT-GVU-99-22, Graphics, Visualization and Usabil- ity Center and College of Computing, Georgia Institute of Technology, Atlanta, GA, USA (June 1999)
- Ding, D., Yang, J., Li, Q., Liu, W., Wang, L.: What can expressive semantics tell: Retrieval model for a flash-movie search engine. In: Leow, W.-K., Lew, M., Chua, T.-S., Ma, W.-Y., Chaisorn, L., Bakker, E.M. (eds.) CIVR 2005. LNCS, vol. 3568, pp. 123-133. Springer, Heidelberg (2005)
- Eco, U.: Einfuehrung in die Semiotik. Wilhelm Fink Verlag, Munich (1985)
- Falkovych, K., Nack, F.: Context Aware Guidance for Multimedia Authoring: Har- monizing domain and discourse knowledge. Multimedia Systems Journal 11(3) (2006)
- Falkovych, K., Nack, F., van Ossenbruggen, J., Rutledge, L.: Sample: Towards a framework for system-supported multimedia authoring. In: Multimedia Modelling, p. 362. IEEE Computer Society, Los Alamitos (2004)
- Fink, J., Kobsa, A., Schreck, J.: Personalized hypermedia information through adaptive and adaptable system features: User modeling, privacy and security issues. In: Mullery, A., Besson, M., Campolargo, M., Gobbi, R., Reed, R. (eds.) Intelligence in Services and Networks: Technology for Cooperative Competition, pp. 459-467.
- Springer, Heidelberg (1997)
- Flickner, M., Sawhney, H., Niblack, W., Ashley, J., Huang, Q., Dom, B., Gorkani, M., Hafner, J., Lee, D., Petkovic, D., Steele, D., Yanker, P.: Query by image and video content: the QBIC system. In: Readings in multimedia computing and net- working, pp. 255-264. Morgan Kaufmann, San Francisco (2001)
- Gangemi, A., Borgo, S., Catenacci, C., Lehmann, J.: Task Taxonomies for Knowl- edge Content. Technical report, Metokis Deliverable 7, (2004)
- Gangemi, A., Guarino, N., Masolo, C., Oltramari, A., Schneider, L.: Sweetening ontologies with dolce. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, pp. 166-181. Springer, Heidelberg (2002)
- Garcia, R., Celma, O.: Semantic Integration and Retrieval of Multimedia Meta- data. In: 5th International Workshop on Knowledge Markup and Semantic Anno- tation (2005)
- García, R., Gil, R.: Facilitating Business Interoperability from the Semantic Web. In: Abramowicz, W. (ed.) BIS 2007. LNCS, vol. 4439, pp. 220-232. Springer, Hei- delberg (2007)
- Garcia, R., Gil, R., Delgado, J.: A Web Ontologies Framework for Digital Rights Management. Journal of Artificial Intelligence and Law 15, 137-154 (2007)
- Gemmell, J., Bell, G., Lueder, R.: Mylifebits: a personal database for everything. Commun. ACM 49(1), 88-95 (2006)
- Gemmell, J., Williams, L., Wood, K., Lueder, R., Bell, G.: Passive capture and ensuing issues for a personal lifetime store. In: Proceedings of the the 1st ACM workshop on Continuous archival and retrieval of personal experiences, pp. 48-55. ACM Press, New York (2004)
- Geurts, J., van Ossenbruggen, J., Hardman, L.: Application-specific constraints for multimedia presentation generation. In: Multimedia Modeling. IEEE, Los Alamitos (2001)
- Geurts, J., van Ossenbruggen, J., Hardman, L.: Requirements for practical multi- media annotation. In: Workshop on Multimedia and the Semantic Web (2005)
- Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Prentice Hall, Englewood Cliffs (2001)
- Gräßl, C., Deinzer, F., Nieman, H.: Continuous parametrization of normal distri- bution for improving the discrete statistical eigenspace approach for object recog- nition. In: Krasnoproshin, V., Ablameyko, S., Soldek, J. (eds.) Pattern Recognition and Information Processing 2003, Minsk, Belarus, May 2003, pp. 73-77 (2003)
- Grzegorzek, M., Izquierdo, E.: Statistical 3d object classification and localization with context modeling. In: Domanski, M., Stasinski, R., Bartkowiak, M. (eds.) 15th European Signal Processing Conference, pp. 1585-1589. PTETiS, Poznan (2007)
- Halasz, F., Schwartz, M.: The Dexter Hypertext Reference Model. Communications of the ACM 37(2), 30-39 (1994)
- Harth, A., Umbrich, J., Hogan, A., Decker, S.: Yars2: A federated repository for searching and querying graph structured data. Technical report, Digital Enterprise Research Institute, Galway, 4 (2007)
- Hornegger, J.: Statistische Modellierung, Klassifikation und Lokalisation von Ob- jekten. Shaker Verlag, Aachen (1996)
- Hunter, J.: Adding Multimedia to the Semantic Web -Building an MPEG-7 Ontol- ogy. In: 1st International Semantic Web Working Symposium, pp. 261-281 (2001)
- Hunter, J.: Combining the CIDOC/CRM and MPEG-7 to Describe Multimedia in Museums. In: 6th Museums and the Web Conference (2002), http://www. archimuse.com/mw2002/papers/hunter/hunter.html
- Hunter, J.: Enhancing the semantic interoperability of multimedia through a core ontology. IEEE Transactions on Circuits and Systems for Video Technology 13(1), 49-58 (2003)
- Hunter, J., Armstrong, L.: A Comparison of Schemas for Video Metadata Repre- sentation. In: 8th International World Wide Web Conference, pp. 1431-1451 (1999)
- Hunter, J., Little, S.: A Framework to Enable the Semantic Inferencing and Query- ing of Multimedia Content. International Journal of Web Engineering and Tech- nology -Special Issue on the Semantic Web 2(2/3), 264-286 (2005)
- Isaac, A., Troncy, R.: Designing and Using an Audio-Visual Description Core On- tology. In: Workshop on Core Ontologies in Ontology Engineering (2004)
- Kang, H., Shneiderman, B.: Visualization methods for personal photo collections: Browsing and searching in the photofinder. In: IEEE International Conference on Multimedia and Expo (III), pp. 1539-1542 (August 2000)
- Kerr, J., Compton, P.: Toward generic model-based object recognition by knowl- edge acquisition and machine learning. In: Proceedings of the Eighteenth Inter- national Joint Conference on Artificial Intelligence, Acapulco, Mexico, pp. 9-15 (2003)
- King, R., Popitsch, N., Westermann, U.: METIS: a flexible database foundation for unified media management. In: Proc.of the 12th annual ACM Int. Conf. on Multimedia, pp. 744-745. ACM Press, New York (2004)
- Kobsa, A., Koenemann, J., Pohl, W.: Personalized Hypermedia Presentation Tech- niques for Improving Online Customer Relationships. In: The Knowledge Engineer- ing Review, vol. 16, pp. 111-155. Cambridge University Press, Cambridge (2001)
- Kochut, K., Janik, M.: Sparqler: Extended sparql for semantic association discov- ery. In: Franconi, E., Kifer, M., May, W. (eds.) ESWC 2007. LNCS, vol. 4519, pp. 145-159. Springer, Heidelberg (2007)
- Lagoze, C., Hunter, J.: The ABC Ontology and Model (v3.0). Journal of Digital Information 2(2) (2001)
- Latecki, L.J., Lakaemper, R., Wolter, D.: Optimal partial shape similarity. Image and Vision Computing Journal 23, 227-236 (2005)
- Lee, B.N., Chen, W., Chang, E.Y.: Fotofiti: web service for photo management. In: Proceedings of the 14th annual ACM international conference on Multimedia, pp. 485-486. ACM Press, New York (2006)
- Lee, T., Sheng, L., Balkir, N.H., Al-Hamdani, A., Özsoyoglu, G., Özsoyoglu, Z.M.: Query Processing Techniques for Multimedia Presentations. Multimedia Tools Appl. 11(1), 63-99 (2000)
- Lee, T., Sheng, L., Bozkaya, T., Balkir, N.H., Özsoyoglu, Z.M., Özsoyoglu, G.: Querying Multimedia Presentations Based on Content. IEEE Trans. on Knowledge and Data Engineering 11(3), 361-385 (1999)
- Leonardis, A., Bischof, H.: Dealing with occlusions in the eigenspace approach. In: Pelillo, M., Hancock, E.R. (eds.) EMMCVPR 1997. LNCS, vol. 1223, pp. 453-458.
- Springer, Heidelberg (1997)
- Moghaddam, B., Pentland, A.: Probabilistic visual learning for object representa- tion. PAMI 19(7), 696-710 (1997)
- Mokhtarian, F., Bober, M.: Curvature Scale Space Representation: Theory, Appli- cations, and MPEG7-Standardization. Springer, Heidelberg (2003)
- MPEG-21. Part 17: Fragment Identification of MPEG Resources. Standard No. ISO/IEC 21000-17 (2006)
- MPEG-7. Multimedia Content Description Interface. Standard No. ISO/IEC 15938 (2001)
- Murase, H., Nayar, S.K.: Visual learning and recognition of 3-d objects from ap- pearance. International Journal of Computer Vision 14(1), 5-24 (1995)
- Naaman, M., Yeh, R.B., Garcia-Molina, H., Paepcke, A.: Leveraging context to resolve identity in photo albums. In: Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries, pp. 178-187. ACM Press, New York (2005)
- Nack, F.: AUTEUR: The Application of Video Semantics and Theme Representa- tion for Automated Film Editing. PhD thesis, Lancaster University, UK (Septem- ber 1996)
- Nack, F., Hardman, L.: Denotative and connotative semantics in hypermedia: pro- posal for a semiotic-aware architecture. New Rev. Hypermedia Multimedia 7(1), 7-37 (2002)
- Nack, F., Lindsay, A.T.: Everything you wanted to know about MPEG-7 (Parts I & II). IEEE Multimedia 6(3-4) (1999)
- Nack, F., van Ossenbruggen, J., Hardman, L.: That Obscure Object of Desire: Multimedia Metadata on the Web (Part II). IEEE Multimedia 12(1) (2005)
- Neumann, B., Möller, R.: On Scene Interpretation with Description Logics. In: Cognitive Vision Systems, pp. 247-275. Springer, Heidelberg (2006)
- Niemann, H.: Klassifikation von Mustern. Springer, Heidelberg (1983)
- Oberle, D., Lamparter, S., Grimm, S., Vrandecic, D., Staab, S., Gangemi, A.: Towards Ontologies for Formalizing Modularization and Communication in Large Software Systems. Journal of Applied Ontology 1(2), 163-202 (2006)
- Sirma Group Corp Ontotext Lab. Bigowlim: System documentation (2006) [15-05- 2008], http://www.ontotext.com/owlim/big/BigOWLIMSysDoc.pdf
- van Ossenbruggen, J., Nack, F., Hardman, L.: That Obscure Object of Desire: Multimedia Metadata on the Web (Part I). IEEE Multimedia 11(4) (2004)
- Park, S., Lee, J., Kim, S.: Content-based image classification using a neural net- work. Pattern Recognition Letters 25(3), 287-300 (2004)
- Paulus, D., Hornegger, J.: Applied Pattern Recognition. Friedr. Vieweg & Sohn Verlagsgesellschaft GmbH, Braunschweig (2003)
- Pease, A., Niles, I., Li, J.: The Suggested Upper Merged Ontology: A Large Ontol- ogy for the Semantic Web and its Applications. In: Working Notes of the AAAI- 2002 Workshop on Ontologies and the Semantic Web (2002)
- Polleres, A., Scharffe, F., Schindlauer, R.: Sparql++ for mapping between rdf vo- cabularies. In: OTM Conferences (1), pp. 878-896 (2007)
- Polydoros, P., Tsinaraki, C., Christodoulakis, S.: GraphOnto: OWL-based ontology management and multimedia annotation in the DS-MIRF framework. Journal of Digital Information Management (JDIM) 4(4), 214-219 (2006)
- Popper, K.: Three worlds [the tanner lecture on human values: Delivered at the university of michigan], April (1978), http://www.tannerlectures.utah.edu/ lectures/documents/popper80.pdf
- Pösl, J.: Erscheinungsbasierte, statistische Objekterkennung. Shaker Verlag, Aachen (1999)
- Pratt, W.K.: Digital Image Processing. John Wiley & Sons Ltd., New York (2001)
- Reinhold, M.: Robuste, probabilistische, erscheinungsbasierte Objekterkennung. Logos Verlag, Berlin (2004)
- Ross, K., Westermann, G.U., Popitsch, N.: METIS -A Flexible Database Solution for the Management of Multimedia Assets. In: Proc. of the 10th Int. Workshop on Multimedia Information Systems, College Park, MD, USA (August 2004)
- Saathoff, C., Staab, S.: Exploiting Spatial Context in Images Using Fuzzy Con- straint Reasoning. In: 9th Int. Workshop on Image Analysis for Multimedia Inter- active Services, Klagenfurt, Austria. IEEE, Los Alamitos (2008)
- Schenk, S., Staab, S.: Networked graphs: A declarative mechanism for sparql rules, sparql views and rdf data integration on the web. In: Proceedings of the 17th International World Wide Web Conference, WWW2008, Bejing, China (2008)
- Scherp, A., Agaram, S., Jain, R.: Event-centric media management. In: Gevers, T., Jain, R.C., Santini, S. (eds.) Multimedia Content Access: Algorithms and Systems II. Proceedings of the SPIE Society of Photo-Optical Instrumentation Engineers (SPIE) Conference, vol. 6820, pp. 68200C-68200C-15 (January 2008)
- Scherp, A.: Semantics support for personalized multimedia content. In: Int. Conf. Internet and Multimedia Systems and Applications, Innsbruck, Austria, March 2008, pp. 57-65. IASTED (2008)
- Scherp, A., Boll, S., Cremer, H.: Emergent semantics in personalized multimedia content. J. of Digital Information Management 5(2) (April 2007)
- Scherp, A., Jain, R.: Towards an ecosystem for semantics. In: MS 2007: Workshop on multimedia information retrieval on The many faces of multimedia semantics, pp. 3-12. ACM Press, New York (2007)
- Schilit, B., Adams, N., Want, R.: Context-Aware Computing Applications. In: Workshop on Mobil Computing Systems and Applications, Santa Cruz, CA, USA, pp. 85-90. IEEE, Los Alamitos (1994)
- Schmidt, A., Beigl, M., Gellersen, H.-W.: There is more to context than location. Computers & Graphics 23(6), 893-901 (1999)
- Schueler, B., Sizov, S., Staab, S., Tran, D.T.: Querying for meta knowledge. In: WWW 2008: Proceeding of the 17th international conference on World Wide Web, pp. 625-634. ACM, New York (2008)
- Shneiderman, B., Kang, H.: Direct annotation: A drag-and-drop strategy for la- beling photos. In: Proceedings of the International Conference on Information Vi- sualisation, p. 88. IEEE Computer Society, Washington (2000)
- Straccia, U.: Managing Uncertainty and Vagueness in Description Logics, Logic Programs and Description Logic Programs. Springer, Heidelberg (2008)
- Sure, Y., Staab, S., Studer, R.: Methodology for development and employment of ontology based knowledge management applications. SIGMOD Rec. 31(4), 18-23 (2002)
- Tadeusiewicz, R.: Introduction to Practice of Application of Neural Networks (in Neuron Networks) StatSoft, Warsaw, Poland (1999)
- Troncy, R.: Integrating Structure and Semantics into Audio-visual Documents. In: 2nd International Semantic Web Conference, pp. 566-581 (2003)
- Troncy, R., Bailer, W., Hausenblas, M., Hofmair, P., Schlatte, R.: Enabling Multi- media Metadata Interoperability by Defining Formal Semantics of MPEG-7 Pro- files. In: 1st International Conference on Semantics And digital Media Technology, pp. 41-55 (2006)
- Troncy, R., Celma, Ó., Little, S., García, R., Tsinaraki, C.: MPEG-7 based Mul- timedia Ontologies: Interoperability Support or Interoperability Issue? In: 1st In- ternational Workshop on Multimedia Annotation and Retrieval enabled by Shared Ontologies, pp. 2-15 (2007)
- Troncy, R., Hardman, L., van Ossenbruggen, J., Hausenblas, M.: Identifying Spatial and Temporal Media Fragments on the Web. In: W3C Video on the Web Workshop (2007), http://www.w3.org/2007/08/video/positions/Troncy.pdf
- Tsinaraki, C., Christodoulakis, S.: Interoperability of XML Schema Applications with OWL Domain Knowledge and Semantic Web Tools. In: 6th International Conference on Ontologies, DataBases, and Applications of Semantics (ODBASE) (2007)
- Tsinaraki, C., Polydoros, P., Christodoulakis, S.: Interoperability support for Ontology-based Video Retrieval Applications. In: 3rd International Conference on Image and Video Retrieval (CIVR), pp. 582-591 (2004)
- Tsinaraki, C., Polydoros, P., Christodoulakis, S.: Interoperability support between MPEG-7/21 and OWL in DS-MIRF. Transactions on Knowledge and Data Engi- neering (TKDE) 19(2), 219-232 (2007) (Special Issue on the Semantic Web Era)
- Turk, M., Pentland, A.: Face recognition using eigenfaces. In: Conference on Com- puter Vision and Pattern Recognition, Maui, USA, pp. 586-591 (June 1991) 95.
- van Ossenbruggen, J., Hardman, L., Geurts, J., Rutledge, L.: Towards a multimedia formatting vocabulary. In: World Wide Web. ACM, New York (2003)
- Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, New York (1995)
- Walter, J., Arnrich, B.: Gabor filters for object localization and robot grasp- ing. In: Proceedings of the 15th International Conference on Pattern Recognition, Barcelona, Spain, September 2000. ICSP, pp. 124-127 (2000)
- Westermann, U., Jain, R.: Toward a common event model for multimedia applica- tions. IEEE MultiMedia 14(1), 19-29 (2007)
- Yuan, C., Niemann, H.: Neural networks for the recognition and pose estimation of 3-d objects from a single 2-d perspective view. International Journal of Image and Vision Computing 19, 585-592 (2001)