Academia.eduAcademia.edu

Multimedia semantics

description40 papers
group0 followers
lightbulbAbout this topic
Multimedia semantics is the study of meaning and interpretation in multimedia content, encompassing the integration of text, audio, images, and video. It focuses on how these elements convey information, emotions, and context, and how they interact to create a cohesive understanding for users.
lightbulbAbout this topic
Multimedia semantics is the study of meaning and interpretation in multimedia content, encompassing the integration of text, audio, images, and video. It focuses on how these elements convey information, emotions, and context, and how they interact to create a cohesive understanding for users.

Key research themes

1. How can ontologies and semantic frameworks improve the representation and retrieval of multimedia content?

This theme investigates the design, construction, and application of ontologies and semantic models to capture the meanings embedded in multimedia data. Unlike traditional metadata or low-level descriptors, ontologies provide a formal, structured representation of multimedia semantics, enabling interoperability, advanced querying, and bridging the semantic gap between raw media features and human interpretation. The theme underscores why effective semantic modeling is fundamental to multimedia indexing, search, and content management in distributed and heterogeneous environments.

Key finding: This paper outlines recent efforts to bridge low-level multimedia descriptors with human-understandable semantics via ontologies. It highlights MPEG-7's limitations and advocates ontology development for interoperable... Read more
Key finding: The work proposes COMM, a core multimedia ontology founded on DOLCE, to unify manual and automatic multimedia annotations. It critiques MPEG-7's XML-based approach and demonstrates how ontology-based annotations enable... Read more
Key finding: This survey synthesizes the design decisions in Semantic Web applications for supporting search functionality over semantic data. It explicates how explicit semantic structures underlying multimedia metadata improve query... Read more
Key finding: Although focused on context, this thesis identifies key contextual dimensions that enhance semantic understanding of multimedia beyond low-level features. It complements ontology-based approaches by addressing knowledge... Read more

2. What strategies and models effectively unify multimodal information to compute semantics and sentiments from multimedia content?

This area investigates models, algorithms, and computational frameworks that integrate heterogeneous modal data (e.g., audio, visual, textual) to extract unified semantic and affective information. Given user-generated content's multimodal nature, accurately deriving meaningful semantics and sentiments requires combining features across modalities and contextual metadata. Understanding effective multimodal fusion enhances multimedia summarization, tag relevance, personalized recommendation, and affective computing.

Key finding: This chapter demonstrates leveraging multimodal information (text, audio, visual, gaze) and contextual metadata enables more accurate, comprehensive semantics and sentiment extraction from user-generated content than unimodal... Read more
Key finding: The paper introduces a scalable framework combining taxonomy of image-text relations with journalism-derived news values to interpret multimodal news content. It empirically shows that understanding cross-modal semantic... Read more
Key finding: This research applies systemic functional linguistics and visual grammar to model and analyze student comprehension of image-language relations in multimodal texts. Using empirical test data, it identifies how different types... Read more

3. How can knowledge-driven, semantic-aware methods enhance multimedia segmentation, classification, and retrieval performance?

This research theme focuses on integrating domain knowledge, semantic reasoning, and contextual information to improve multimedia analysis tasks such as segmentation and classification. Traditional low-level feature-based segmentation and classification often face errors due to ambiguous boundaries or visually similar classes. Knowledge-driven approaches leverage semantic-level criteria, spatial and contextual relationships, and attribute-based classifiers to reduce these errors, enabling more accurate semantic labeling and retrieval of multimedia content.

Key finding: The paper presents methodologies that incorporate high-level semantic knowledge and context to refine initial multimedia segmentation and classification results. It demonstrates that semantic segmentation based on similarity... Read more
Key finding: Closely related to the former paper, this chapter further articulates the interaction between multimedia processing and knowledge representation. It details using contextual information such as spatial relations and... Read more
Key finding: This work introduces the Meta-PGN classifier that extends attribute feature spaces with metadata to bridge the gap between low-level feature regularities and high-level human semantic concepts. Applied in art painting... Read more

All papers in Multimedia semantics

This work is subject to copyright. Permission to make digital or hard copies of portions of this work for personal or classroom use is granted without fee, provided that the copies are not made or distrib-uted for profit or commercial... more
Web content nowadays can also be accessed through new generation of Internet connected TVs. However, these pro ducts failed to change users' behavior when consuming online content. Users still prefer personal computers to access Web... more
www.ijaret.com © IJARET All Rights Reserved Context-Aware Video Annotation Using Linked Data and Search Educational Video Resources for Supporting Distance Learning IPrasad V. Phalle, IISuresh K. Shirgave, IIIG. A. Patil I,IIIDept. of... more
Cyber-situational awareness is crucial to applications such as network monitoring and management, vulnerability assessment, and defense. To gain improved cyber-situational awareness, analysts can benefit from automated reasoning-based... more
Due to the volume, variety, and veracity of network data available, information fusion and reasoning techniques are needed to support network analysts' cyber-situational awareness. These techniques rely on formal knowledge representation... more
An approach for extracting higher-level visual features for art painting classification based on MPEG-7 descriptors is presented in this paper. The MPEG-7 descriptors give a good presentation of different types of visual features, but are... more
Research focuses on the perceptions of engineers towards highway projects contract types' (CTs) performance and their respective selection criteria (SC). A questionnaire survey evaluated the CTs against the selected criteria. The SPSS... more
It has been recently argued that it is rather beneficial to cultural institutions to provide their datasets as Linked Open Data, to achieve cross-referencing, interlinking, and integration with other datasets in the LOD cloud. In this... more
The evolution of the World Wide Web, increase in processing power, and more network bandwidth have contributed to the proliferation of digital multimedia data. Since multimedia data has become a critical resource in many organisations,... more
An approach for extracting higher-level visual features for art painting classification based on MPEG-7 descriptors is presented in this paper. The MPEG-7 descriptors give a good presentation of different types of visual features, but are... more
In the context of the European project “GIOCOnDa”, this paper describes the conversion process from Open Data to Linked Open Data (LOD) and its implementation in the GIOCOnDa LOD platform. The platform contains a number of conversion... more
This article addresses the interoperability between the semantic learning platforms and the educational resources banks, more precisely between the LOM and MPEG-7 standards. LOM is a set of metadata associated with e-learning content,... more
The present chapter investigates content authentication strategies and their use in media practice. Remarkable research progress has been conducted on media veracity methods and algorithms, however, without providing that much... more
A typical software engineer spends a significant amount of time and effort reading technical manuals to find answers to questions especially those related to features, versions, compatibilities and dependencies of software and hardware... more
An approach for extracting higher-level visual features for art painting classification based on MPEG-7 descriptors is presented in this paper. The MPEG-7 descriptors give a good presentation of different types of visual features, but are... more
by R. Gil
In order to make a Semantic Web dataset more usable to a wider range of users, specially Linked Data ones, Rhizomer constitutes a tool for data publishing in the web that in addition to common data browsing mechanisms based on HTML... more
Wikidata is the central data management platform of Wikipedia. By the efforts of thousands of volunteers, the project has produced a large, open knowledge base with many interesting applications. The data is highly interlinked and... more
A typical software engineer spends a significant amount of time and effort reading technical manuals to find answers to questions especially those related to features, versions, compatibilities and dependencies of software and hardware... more
In this paper, we present one approach for extending the learning set of a classification algorithm with additional metadata. It is used as a base for giving appropriate names to found regularities. The analysis of correspondence between... more
An approach for extracting higher-level visual features for art painting classification based on MPEG-7 descriptors is presented in this paper. The MPEG-7 descriptors give a good presentation of different types of visual features, but are... more
In this paper, we present the tools standardized by MPEG-7 for describing the semantics of multimedia. In particular, we focus on the Abstraction Model, entities, attributes and relations of MPEG-7 semantic descriptions. MPEG-7 tools can... more
This work is subject to copyright. Permission to make digital or hard copies of portions of this work for personal or classroom use is granted without fee, provided that the copies are not made or distrib-uted for profit or commercial... more
Scientists always look for the most accurate and relevant answers to their queries in the literature. Traditional scholarly digital libraries list documents in search results, and therefore are unable to provide precise answers to search... more
The difficulty of finding relevant information in the Web is increasing as web repositories grow in size. We propose a novel approach for navigation in the Semantic Web, which helps users find relevant information and enables them to... more
The annotation of documents and web pages with semantic metatdata is an activity that can greatly increase the accuracy of Infor-mation Retrieval and Personalization systems, but the growing amount of text data available is too large for... more
Tangible Chain of Custody (CoC) in cyber forensics (CF) is a document accompanying digital evidences. It records all information related to the evidences at each phase of the forensics investigation process in order to improve and... more
The quantity of data published on the Web according to principles of Linked Data is increasing intensely. However, this data is still largely limited to be used up by domain professionals and users who understand Linked Data technologies.... more
In this paper, we present the tools standardized by MPEG-7 for describing the semantics of multimedia. In particular, we focus on the Abstraction Model, entities, attributes and relations of MPEG-7 semantic descriptions. MPEG-7 tools can... more
In this paper a context-based algorithm to semantically annotate e-learning contents is presented. This algorithm explores the DBpedia graph and uses both syntactic and semantic analysis techniques to identify the RDF triples which... more
Multimedia does not exhibit a unique semantics but multiple semantics that are influenced by many factors. Current approaches and systems lack from considering this problem in its entirety. What is needed is a holistic approach that... more
The present chapter investigates content authentication strategies and their use in media practice. Remarkable research progress has been conducted on media veracity methods and algorithms, however, without providing that much... more
Metadata vocabularies are used in various domains of study. It provides an in-depth description of the resources. In this work, we develop Algorithm Metadata Vocabulary (AMV), a vocabulary for capturing and storing the metadata about the... more
Within clinical, biomedical, and translational science, an increasing number of projects are adopting graphs for knowledge representation. Graph-based data models elucidate the interconnectedness between core biomedical concepts, enable... more
The Internet is nowadays a fantastic source of information thanks to the quantity of the information it provides and its dynamicity. However, these features also represent challenges when we want to consider trustworthy information only.... more
In this work we present an approach to capture the total semantics in multimedia-multimodal web pages. Our research improves upon the state-ofthe-art with two key features: (1) capturing the semantics of text and imagebased media for... more
Note: The RO-Crate JSON-LD context and JSON-LD examples within this specification are distributed under CC0 1.0 Universal (CC0 1.0) Public Domain Dedication.
The Linked Data movement with the aims of publishing and interconnecting machine readable data has originated in the last decade. Although the set of (open) data sources is rapidly growing, the integration of multimedia in this Web of... more
The popularization of multimedia content on the Web has arised the need to automatically understand, index and retrieve it. In this paper we present ViTS, an automatic Video Tagging System which learns from videos, their web context and... more
Community Coordinated Multimedia (CCM) provides an extended and enhanced human experience by collaboratively consuming electronic and networked content and multimedia-intensive services. Community coordinated multimedia vision raises the... more
We describe the design and ongoing update of a knowledge graph and its assorted ontologies which describe beverages and their commercial availability as products. Previous approaches have focused on beverage types or brands with limited... more
Ontology is an important artifact of Semantic Web applications. Today, there are an enormous number of ontologies available on the Web. Even so, finding and identifying the right ontology is not easy. This is because the majority of... more
In this research, we investigate the problem of ontology construction in both automatic and semi-automatic approaches. There are two key issues for the ontology construction process: the cold start problem (i.e. starting the development... more
Ontologies are at the heart of the semantic web, i.e. making data published on the web comprehensible to intelligent added value services. Ontologies consensual design ensures its usefulness and wide acceptance by service developers. The... more
In this demo paper we present how the SIVA Suite can be used as a multimedia help system for technical applications in SMEs. After describing our use case, a mechanics scenario, we show how our software was extended to fit all... more
Mass media (e.g., TV) and social media (e.g., Facebook) have a large utilization nowadays; they are becoming an integral part of our life. This chapter describes the psychological effects of media bias and manipulation, along its impact... more
The study of audio-visual rhetorics of affect scientifically analyses the impact of auditory and visual staging patterns on the perception of media productions as well as the conveyed emotions. In the AdA-project, together with film... more
Download research papers for free!