Academia.eduAcademia.edu

Multimedia Information Indexing and Retrieval

description24 papers
group8 followers
lightbulbAbout this topic
Multimedia Information Indexing and Retrieval is the study of methods and systems for organizing, categorizing, and retrieving diverse forms of media content, such as text, images, audio, and video. It involves the development of algorithms and frameworks to enhance the efficiency and effectiveness of accessing multimedia data based on user queries and content characteristics.
lightbulbAbout this topic
Multimedia Information Indexing and Retrieval is the study of methods and systems for organizing, categorizing, and retrieving diverse forms of media content, such as text, images, audio, and video. It involves the development of algorithms and frameworks to enhance the efficiency and effectiveness of accessing multimedia data based on user queries and content characteristics.

Key research themes

1. How can integrated multimodal and fuzzy ontology-based frameworks enhance semantic multimedia indexing and retrieval?

This research theme focuses on leveraging semantic knowledge, contextual information, and fuzzy ontologies to improve the performance and semantic accuracy of multimedia indexing and retrieval systems. The aim is to bridge the semantic gap by modeling relationships among semantic concepts and utilizing reasoning engines to refine detection and annotation processes. Such approaches address the complexity of interpreting multimedia content by encompassing context and hierarchical semantic concepts.

Key finding: Proposes a fuzzy ontology framework that integrates contextual annotation and a fuzzy abduction engine to represent and infer fuzzy relationships among semantic concepts, improving semantic concept detection in video... Read more
Key finding: Describes a semantic indexing system combining low-level visual and audio features with semantic concept relationships embedded in the LSCOM Ontology, using multimodal fuzzy fusion and both deduction and abduction reasoning... Read more
Key finding: Introduces an ontology-based hierarchical image annotation framework that constructs a fuzzy ontology from the learning dataset to efficiently reduce concept detector complexity and computational cost. The reasoning engine... Read more

2. What are effective feature integration methods for improving content-based image retrieval (CBIR) in multimedia indexing?

This line of inquiry investigates the extraction, representation, and combination of low-level visual features such as color, texture, and shape to create robust descriptors for image indexing and retrieval. It includes studies on multi-feature integration strategies and new entropy-based or spatially-aware features to improve accuracy and retrieval speed in CBIR systems. Evaluations often employ benchmark datasets and analyze feature effectiveness relative to specific image categories.

Key finding: Proposes combining primitive features—color, texture, and shape—using image mining techniques to convert low-level attributes into high-level integral descriptors. The study shows that integrated features (color-texture,... Read more
Key finding: Introduces a novel entropy function as a measure of information content for color and texture features, combined with dominant color descriptors to form low-dimensional feature vectors. This approach accelerates query... Read more
Key finding: Presents the color correlogram as a new feature capturing the spatial correlation between colors in images, which robustly tolerates large appearance variations. Experimental results on a large image database show that color... Read more
Key finding: Develops a technique to evaluate and select MPEG-7 image features based on statistical characteristics of specific image datasets. User studies validate that tailoring the choice of features (e.g., Scalable Color, Dominant... Read more

3. How can multimedia retrieval systems effectively exploit both structured metadata and content-based approaches to bridge the semantic gap?

This research theme addresses the combination of text-based, semantic-based, and content-based retrieval strategies to manage multimedia content, aiming to overcome the semantic gap inherent in pure low-level feature approaches. It explores methods such as natural language query interfaces, semantic data fusion, structural XML node considerations, and text extraction from video frames, to enhance the precision and relevance of multimedia indexing and search results.

Key finding: Develops a system enabling natural language queries interpreted into formal retrieval representations for multimedia documents comprising text, tables, and images. Uses fuzzy data fusion and dedicated high-speed parallel... Read more
Key finding: Demonstrates that combining multiple low-level visual features—color histogram, texture, shape—with dynamic programming for similarity improves video retrieval accuracy. The system stores videos in an Oracle 9i database and... Read more
Key finding: Reviews techniques for extracting text from images and videos through detection, localization, enhancement, and OCR, highlighting the importance of textual information overlayed or embedded in multimedia for indexing and... Read more
Key finding: Proposes a novel similarity metric for multimedia retrieval within XML structured documents based on geometric distances between XML nodes accounting for kinship and proximity relations. Evaluation on INEX 2007 shows that... Read more

All papers in Multimedia Information Indexing and Retrieval

The image retrieval and semantic extraction play an important role in the multimedia systems such as geographic information system, hospital information system, digital library system, etc. Therefore, the research and development of... more
The image retrieval and semantic extraction play an important role in the multimedia systems such as geographic information system, hospital information system, digital library system, etc. Therefore, the research and development of... more
The image retrieval and semantic extraction play an important role in the multimedia systems such as geographic information system, hospital information system, digital library system, etc. Therefore, the research and development of... more
http://www.regim.org Description of Submitted Runs Semantic Indexing Regim_4: The indexing process is based on the visual modality analysis and relationships within LSCOM Ontology to improve the detection of large set of semantic... more
In this paper, a technique for making more effective the similarity search process of images in a Multimedia Content Management System is proposed. The contentbased retrieval process integrates the search on different multimedia... more
In this paper a technique for evaluating the effectiveness of MPEG-7 image features on specific image data sets is proposed. It is based on well defined statistical characteristics. The aim is to improve the effectiveness of the image... more
Most existing content-based video retrieval (CBVR) systems are now amenable to support automatic low-level feature extraction, but they still have limited effectiveness from a user's perspective because of the semantic gap. Automatic... more
In this paper, we describe our participation in the Image-CLEF 2015 Scalable Concept Image Annotation task. In this participation, we display our approach for an automatic image annotation by the use of an ontology-based semantic... more
In this paper a technique for evaluating the effectiveness of MPEG-7 image features on specific image data sets is proposed. It is based on well defined statistical characteristics. The aim is to improve the effectiveness of the image... more
In this paper, a technique for making more effective the similarity search process of images in a Multimedia Content Management System is proposed. The content-based retrieval process integrates the search on different multimedia... more
The ImageCLEF 2013 Scalable Concept Image Annotation Subtask was the second edition of a challenge aimed at developing more scalable image annotation systems. Unlike traditional image annotation challenges, which rely on a set of manually... more
In this paper, a technique for making more effective the similarity search process of images in a Multimedia Content Management System is proposed. The contentbased retrieval process integrates the search on different multimedia... more
The amount of audio-visual information has increased dramatically with the advent of High Speed Internet. Furthermore, technological advances in recent years in the field of information technology, have simplified the use of video data in... more
Semantic multimedia organization is an open challenge. In this chapter, we present an innovative way of automatically organizing multimedia information to facilitate content-based browsing. It is based on self-organizing maps. The... more
La visita a musei o a luoghi di interesse di città d'ar-te può essere completamente reinventata attraverso modalità di fruizione moderne e dinamiche, basa-te su tecnologie di riconoscimento e localizzazione visuale, ricerca per immagini e... more
In this paper a technique for evaluating the effectiveness of MPEG-7 image features on specific image data sets is proposed. It is based on well defined statistical characteristics. The aim is to improve the effectiveness of the image... more
In this paper, a technique for making more effective the similarity search process of images in a Multimedia Content Management System is proposed. The content-based retrieval process integrates the search on different multimedia... more
—Audio Event Detection (AED) aims to recognize sounds within audio and video recordings. AED employs machine learning algorithms commonly trained and tested on annotated datasets. However, available datasets are limited in number of... more
In this paper, we consider the automatic identification of video shots that are relevant to a given semantic concept from large video databases. We apply a method of representing semantic concepts as class models on a set of parallel... more
In this paper, we propose a semantic indexing system for reducing the semantic gap between the machine and human interpretations on a video document by generating a finer indexing quality. To do so, data fusion of analyzed interpretation... more
Multimedia indexing systems based on semantic concept detectors are incomplete in the semantic sense. We can improve the effectiveness of these systems by using knowledge-based approaches which utilize semantic knowledge. In this paper,... more
A video retrieval system user hopes to find relevant information when the pro- posed queries are ambiguous. The retrieval process based on detecting concepts remains ineffective in such a situation. Potential relationships between... more
In this paper, we describe our participation in the Image- CLEF 2015 Scalable Concept Image Annotation task. In this participa- tion, we display our approach for an automatic image annotation by the use of an ontology-based semantic... more
This paper introduces a novel approach of video annota- tion by the use of context-based assistance for the annotator. The notion of context plays, actually, a significant role in the multimedia content search and retrieval systems. In... more
Multimedia indexing systems based on semantic concept detectors are incomplete in the semantic sense. We can improve the effectiveness of these systems by using knowledge-based approaches which utilize semantic knowledge. In this paper,... more
In this paper, we propose a semantic indexing system for reducing the semantic gap between the machine and human interpretations on a video document by generating a finer indexing quality. To do so, data fusion of analyzed interpretation... more
In this paper, we describe an overview of a software platform that has been developed within REGIMVid project for TRECVID 2010 video retrieval experiments. The REGIMVID team participated in Semantic Indexing task. In TRECVID 2010, we... more
This paper proposes a learning method for Beta fuzzy systems (BFS) based on a multiagent genetic algo-rithm. This method, called Multi-Agent Genetic Algorithm for the Design of BFS has two advantages. First, thanks to genetic algorithms... more
Abstract Traditional Content Based Multimedia Retrieval (CBMR) systems measure the relevance of visual samples using a binary scale (Relevant/Non Relevant). However, a picture can be relevant to a semantic category with different degrees,... more
Abstract Content-based classification of audio data is an important problem for various applications such as overall analysis of audio-visual streams, boundary detection of video story segment, extraction of speech segments from video,... more
Abstract—In this paper, we propose to organize web images by actively creating visual clusters via crowdsourcing. We develop a two-phase framework to efficiently and effectively combine computers and a large number of human workers to... more
Download research papers for free!