Academia.eduAcademia.edu

Multimedia semantics

description40 papers
group0 followers
lightbulbAbout this topic
Multimedia semantics is the study of meaning and interpretation in multimedia content, encompassing the integration of text, audio, images, and video. It focuses on how these elements convey information, emotions, and context, and how they interact to create a cohesive understanding for users.
lightbulbAbout this topic
Multimedia semantics is the study of meaning and interpretation in multimedia content, encompassing the integration of text, audio, images, and video. It focuses on how these elements convey information, emotions, and context, and how they interact to create a cohesive understanding for users.

Key research themes

1. How can ontologies and semantic frameworks improve the representation and retrieval of multimedia content?

This theme investigates the design, construction, and application of ontologies and semantic models to capture the meanings embedded in multimedia data. Unlike traditional metadata or low-level descriptors, ontologies provide a formal, structured representation of multimedia semantics, enabling interoperability, advanced querying, and bridging the semantic gap between raw media features and human interpretation. The theme underscores why effective semantic modeling is fundamental to multimedia indexing, search, and content management in distributed and heterogeneous environments.

Key finding: This paper outlines recent efforts to bridge low-level multimedia descriptors with human-understandable semantics via ontologies. It highlights MPEG-7's limitations and advocates ontology development for interoperable... Read more
Key finding: The work proposes COMM, a core multimedia ontology founded on DOLCE, to unify manual and automatic multimedia annotations. It critiques MPEG-7's XML-based approach and demonstrates how ontology-based annotations enable... Read more
Key finding: This survey synthesizes the design decisions in Semantic Web applications for supporting search functionality over semantic data. It explicates how explicit semantic structures underlying multimedia metadata improve query... Read more
Key finding: Although focused on context, this thesis identifies key contextual dimensions that enhance semantic understanding of multimedia beyond low-level features. It complements ontology-based approaches by addressing knowledge... Read more

2. What strategies and models effectively unify multimodal information to compute semantics and sentiments from multimedia content?

This area investigates models, algorithms, and computational frameworks that integrate heterogeneous modal data (e.g., audio, visual, textual) to extract unified semantic and affective information. Given user-generated content's multimodal nature, accurately deriving meaningful semantics and sentiments requires combining features across modalities and contextual metadata. Understanding effective multimodal fusion enhances multimedia summarization, tag relevance, personalized recommendation, and affective computing.

Key finding: This chapter demonstrates leveraging multimodal information (text, audio, visual, gaze) and contextual metadata enables more accurate, comprehensive semantics and sentiment extraction from user-generated content than unimodal... Read more
Key finding: The paper introduces a scalable framework combining taxonomy of image-text relations with journalism-derived news values to interpret multimodal news content. It empirically shows that understanding cross-modal semantic... Read more
Key finding: This research applies systemic functional linguistics and visual grammar to model and analyze student comprehension of image-language relations in multimodal texts. Using empirical test data, it identifies how different types... Read more

3. How can knowledge-driven, semantic-aware methods enhance multimedia segmentation, classification, and retrieval performance?

This research theme focuses on integrating domain knowledge, semantic reasoning, and contextual information to improve multimedia analysis tasks such as segmentation and classification. Traditional low-level feature-based segmentation and classification often face errors due to ambiguous boundaries or visually similar classes. Knowledge-driven approaches leverage semantic-level criteria, spatial and contextual relationships, and attribute-based classifiers to reduce these errors, enabling more accurate semantic labeling and retrieval of multimedia content.

Key finding: The paper presents methodologies that incorporate high-level semantic knowledge and context to refine initial multimedia segmentation and classification results. It demonstrates that semantic segmentation based on similarity... Read more
Key finding: Closely related to the former paper, this chapter further articulates the interaction between multimedia processing and knowledge representation. It details using contextual information such as spatial relations and... Read more
Key finding: This work introduces the Meta-PGN classifier that extends attribute feature spaces with metadata to bridge the gap between low-level feature regularities and high-level human semantic concepts. Applied in art painting... Read more

All papers in Multimedia semantics

In recent years several Semantic Web applications have been developed that support some form of search. These applications provide different types of search functionality and make use of the explicit semantics present in the data in... more
Internet-based e-Learning has experienced a boom and bust situation in the past 10 years [32]. To bring in new forces to knowledge-oriented e-Learning, this paper addresses the semantic integration issue of multi-media resources and... more
Video annotation tools are often compared in the literature, however, most reviews mix unstructured, semi-structured, and the very few structured annotation software. This paper is a comprehensive review of video annotations tools... more
In recent years several Semantic Web applications have been developed that support some form of search. These applications provide different types of search functionality and make use of the explicit semantics present in the data in... more
In this chapter a first attempt will be made to examine how the coupling of multimedia processing and knowledge representation techniques, presented separately in previous chapters, can improve analysis. No formal reasoning techniques... more
The present chapter investigates content authentication strategies and their use in media practice. Remarkable research progress has been conducted on media veracity methods and algorithms, however, without providing that much... more
Internet-based e-Learning has experienced a boom and bust situation in the past 10 years . To bring in new forces to knowledge-oriented e-Learning, this paper addresses the semantic integration issue of multimedia resources and learning... more
Abstract. An approach for extracting higher-level visual features for art painting classification based on MPEG-7 descriptors is presented in this paper. The MPEG-7 descriptors give a good presentation of different types of visual... more
Download research papers for free!