Multimedia Information Indexing and Retrieval

description24 papers

group8 followers

lightbulbAbout this topic

Multimedia Information Indexing and Retrieval is the study of methods and systems for organizing, categorizing, and retrieving diverse forms of media content, such as text, images, audio, and video. It involves the development of algorithms and frameworks to enhance the efficiency and effectiveness of accessing multimedia data based on user queries and content characteristics.

lightbulbAbout this topic

Key research themes

1. How can integrated multimodal and fuzzy ontology-based frameworks enhance semantic multimedia indexing and retrieval?

This research theme focuses on leveraging semantic knowledge, contextual information, and fuzzy ontologies to improve the performance and semantic accuracy of multimedia indexing and retrieval systems. The aim is to bridge the semantic gap by modeling relationships among semantic concepts and utilizing reasoning engines to refine detection and annotation processes. Such approaches address the complexity of interpreting multimedia content by encompassing context and hierarchical semantic concepts.

A fuzzy ontology

by Adel M. Alimi

2016, Proceedings of the Eleventh International Workshop on Multimedia Data Mining - MDMKDD '11

Key finding: Proposes a fuzzy ontology framework that integrates contextual annotation and a fuzzy abduction engine to represent and infer fuzzy relationships among semantic concepts, improving semantic concept detection in video... Read more

articleView Paper downloadDownload

Regimvid at trecvid 2010: Semantic indexing

by Nizar Elleuch

2023

Key finding: Describes a semantic indexing system combining low-level visual and audio features with semantic concept relationships embedded in the LSCOM Ontology, using multimodal fuzzy fusion and both deduction and abduction reasoning... Read more

articleView Paper downloadDownload

Regimvid at ImageCLEF 2015 Scalable Concept Image Annotation Task: Ontology based Hierarchical Image Annotation

by Anis Benammar

2022

Key finding: Introduces an ontology-based hierarchical image annotation framework that constructs a fuzzy ontology from the learning dataset to efficiently reduce concept detector complexity and computational cost. The reasoning engine... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What are effective feature integration methods for improving content-based image retrieval (CBIR) in multimedia indexing?

This line of inquiry investigates the extraction, representation, and combination of low-level visual features such as color, texture, and shape to create robust descriptors for image indexing and retrieval. It includes studies on multi-feature integration strategies and new entropy-based or spatially-aware features to improve accuracy and retrieval speed in CBIR systems. Evaluations often employ benchmark datasets and analyze feature effectiveness relative to specific image categories.

Feature Integration for Image Information RetrievalUsing Image Mining Techniques

by Dr Madhubala Myneni

2023

Key finding: Proposes combining primitive features—color, texture, and shape—using image mining techniques to convert low-level attributes into high-level integral descriptors. The study shows that integrated features (color-texture,... Read more

articleView Paper downloadDownload

Content-based Image Retrieval by Information Theoretic Measure

by M. Hanmandlu

2021, Defence Science Journal

Key finding: Introduces a novel entropy function as a measure of information content for color and texture features, combined with dominant color descriptors to form low-dimensional feature vectors. This approach accelerates query... Read more

articleView Paper downloadDownload

Image indexing using color correlograms

by Ravi Kumar

2023, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition

Key finding: Presents the color correlogram as a new feature capturing the spatial correlation between colors in images, which robustly tolerates large appearance variations. Experimental results on a large image database show that color... Read more

articleView Paper downloadDownload

Selection of MPEG-7 image features for improving image similarity search on specific data sets

by Fabrizio Falchi

2022, 7-th IASTED International Conference on Computer Graphics and Imaging, CGIM

Key finding: Develops a technique to evaluate and select MPEG-7 image features based on statistical characteristics of specific image datasets. User studies validate that tailoring the choice of features (e.g., Scalable Color, Dominant... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How can multimedia retrieval systems effectively exploit both structured metadata and content-based approaches to bridge the semantic gap?

This research theme addresses the combination of text-based, semantic-based, and content-based retrieval strategies to manage multimedia content, aiming to overcome the semantic gap inherent in pure low-level feature approaches. It explores methods such as natural language query interfaces, semantic data fusion, structural XML node considerations, and text extraction from video frames, to enhance the precision and relevance of multimedia indexing and search results.

An Integrated Approach to Semantic Evaluation and Content-Based Retrieval of Multimedia Documents

by Ingo Glöckner

2021

Key finding: Develops a system enabling natural language queries interpreted into formal retrieval representations for multimedia documents comprising text, tables, and images. Uses fuzzy data fusion and dedicated high-speed parallel... Read more

articleView Paper downloadDownload

CONTENT BASED VIDEO RETRIEVAL

by B. Patel and

2015

Key finding: Demonstrates that combining multiple low-level visual features—color histogram, texture, shape—with dynamic programming for similarity improves video retrieval accuracy. The system stores videos in an Oracle 9i database and... Read more

articleView Paper downloadDownload

Text Based Approach For Indexing And Retrieval Of Image And Video: A Review

by Avinash N Bhute

2016

Key finding: Reviews techniques for extracting text from images and videos through detection, localization, enhancement, and OCR, highlighting the importance of textual information overlayed or embedded in multimedia for indexing and... Read more

articleView Paper downloadDownload

A new metric for multimedia retrieval in structured documents

by Walid MAHDI

2023

Key finding: Proposes a novel similarity metric for multimedia retrieval within XML structured documents based on geometric distances between XML nodes accounting for kinship and proximity relations. Evaluation on INEX 2007 shows that... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Multimedia Information Indexing and Retrieval

A Self-Balanced Clustering Tree for Semantic-Based Image Retrieval

by Uyển Nhi

2024, Journal of Computer Science and Cybernetics

The image retrieval and semantic extraction play an important role in the multimedia systems such as geographic information system, hospital information system, digital library system, etc. Therefore, the research and development of... more

descriptionView Paper arrow_downwardDownload

A Self-Balanced Clustering Tree for Semantic-Based Image Retrieval

by Nguyễn Phạm Uyên Nhi

2024, Journal of Computer Science and Cybernetics

descriptionView Paper arrow_downwardDownload

A Self-Balanced Clustering Tree for Semantic-Based Image Retrieval

by 22611091 Nguyen Thi Thao Nhi

2023, Journal of Computer Science and Cybernetics

descriptionView Paper arrow_downwardDownload

Regimvid at trecvid 2010: Semantic indexing

by Nizar Elleuch

2023

http://www.regim.org Description of Submitted Runs Semantic Indexing Regim_4: The indexing process is based on the visual modality analysis and relationships within LSCOM Ontology to improve the detection of large set of semantic... more

descriptionView Paper arrow_downwardDownload

Improving image similarity search effectiveness in a multimedia content management system

by Fabrizio Falchi

2022, Proc. of workshop on multimedia information system (MIS)

In this paper, a technique for making more effective the similarity search process of images in a Multimedia Content Management System is proposed. The contentbased retrieval process integrates the search on different multimedia... more

descriptionView Paper arrow_downwardDownload

Selection of MPEG-7 image features for improving image similarity search on specific data sets

by Fabrizio Falchi

2022, 7-th IASTED International Conference on Computer Graphics and Imaging, CGIM

In this paper a technique for evaluating the effectiveness of MPEG-7 image features on specific image data sets is proposed. It is based on well defined statistical characteristics. The aim is to improve the effectiveness of the image... more

descriptionView Paper arrow_downwardDownload

Incorporating Concept Ontology for Hierarchical Video Classification, Annotation, and Visualization

by Ramesh Jain

2022, IEEE Transactions on Multimedia

Most existing content-based video retrieval (CBVR) systems are now amenable to support automatic low-level feature extraction, but they still have limited effectiveness from a user's perspective because of the semantic gap. Automatic... more

descriptionView Paper arrow_downwardDownload

Regimvid at ImageCLEF 2015 Scalable Concept Image Annotation Task: Ontology based Hierarchical Image Annotation

by Anis Benammar

2022

In this paper, we describe our participation in the Image-CLEF 2015 Scalable Concept Image Annotation task. In this participation, we display our approach for an automatic image annotation by the use of an ontology-based semantic... more

descriptionView Paper arrow_downwardDownload

Selection of mpeg-7 image features for improving image similarity search on specific data sets

by Peter L Stanchev

2022

descriptionView Paper arrow_downwardDownload

Improving image similarity search effectiveness in a multimedia content management system

by Peter L Stanchev

2022

In this paper, a technique for making more effective the similarity search process of images in a Multimedia Content Management System is proposed. The content-based retrieval process integrates the search on different multimedia... more

descriptionView Paper arrow_downwardDownload

Overview of the imageclef 2013 scalable concept image annotation subtask

by Mauricio Villegas

2021

The ImageCLEF 2013 Scalable Concept Image Annotation Subtask was the second edition of a challenge aimed at developing more scalable image annotation systems. Unlike traditional image annotation challenges, which rely on a set of manually... more

descriptionView Paper arrow_downwardDownload

Improving image similarity search effectiveness in a multimedia content management system

by F. Rabitti

2021, Proc. of workshop on multimedia information system (MIS)

descriptionView Paper arrow_downwardDownload

An interactive engine for multilingual video browsing using semantic content

by SAMI BEN MOUSSA

2021, ArXiv

The amount of audio-visual information has increased dramatically with the advent of High Speed Internet. Furthermore, technological advances in recent years in the field of information technology, have simplified the use of video data in... more

descriptionView Paper arrow_downwardDownload

Organizing multimedia information with maps

by Andreas Nürnberger

2021, Studies in Computational Intelligence

Semantic multimedia organization is an open challenge. In this chapter, we present an innovative way of automatically organizing multimedia information to facilitate content-based browsing. It is based on self-organizing maps. The... more

descriptionView Paper arrow_downwardDownload

Intelligenza Artificiale, Retrieval e Beni Culturali

by Fabrizio Falchi

2019, Ital-IA - Convegno Nazionale CINI sull'Intelligenza Artificiale

La visita a musei o a luoghi di interesse di città d'ar-te può essere completamente reinventata attraverso modalità di fruizione moderne e dinamiche, basa-te su tecnologie di riconoscimento e localizzazione visuale, ricerca per immagini e visualizzazioni in realtà aumentata. Da anni il gruppo di ricerca AI-MIR porta avanti attività di ricerca su queste temati-che ricoprendo anche ruoli di responsabilità in pro-getti nazionali ed internazionali. Questo contributo riassume alcune delle attività di ricerca svolte e del-le tecnologie utilizzate, nonché la partecipazione a progetti che hanno utilizzato tecnologie di intelli-genza artificiale per la valorizzazione e la fruizione del patrimonio culturale. 1 Introduzione Il gruppo di ricerca Artificial Intelligence for Multimedia Information Retrieval (AIMIR) studia soluzioni di intelligen-za artificiale per l'analisi, ricerca e riconoscimento visuale in database di immagini di grandi dimensioni, tramite disposi-tivi mobili, sistemi informativi e motori di ricerca multime-diali. Negli ultimi anni, ha partecipato a numerosi progetti nazionali ed internazionali in ambito Beni Culturali, svilup-pando sistemi che consentono di riconoscere automaticamen-te, a partire da un'immagine, opere d'arte quali quadri, statue , edifici, iscrizioni antiche, effettuarne ricerche visuale su larga scala e visualizzazioni in realtà aumentata. Si consi-derino, ad esempio, il sistema http://art.isti.cnr.it/ capace di riconoscere e fornire informazioni su più di 100 mila quadri, o http://www.eagle-network.eu/image-search/ capace di rico-noscere visivamente iscrizioni antiche, in un database di più di un milione di immagini, anche da dispositivi mobili. Le tecniche sviluppate tengono in considerazione sia le problematiche di accuratezza che di scalabilità, garantendo lo sviluppo di sistemi con tempi di risposta fluidi e natura-li anche in situazioni e contesti dove la quantità di elementi da riconoscere, localizzare visivamente, e rendere aumentati è enorme, come all'interno di musei, o in zone di interesse di importanti città d'arte (piazze storiche, cattedrali, etc.). 2 Attività Scientifica L'attività scientifica portata avanti dal gruppo AIMIR sfrut-ta una sinergia di tecniche di analisi delle immagini, deep learning, strutture dati ed algoritmi di ricerca per similarità scalabili. I prototipi di ricerca sviluppati sono stati applica-ti con successo nell'ambito dei beni culturali, ad esempio, per riconoscere opere d'arte o edifici storici, per accedere ad informazioni in realtà aumentata, e per generare descri-zione automatiche di materiale digitale non adeguatamente annotato. Nell'ambito del riconoscimento visuale sono stati investi-gati sia approcci basati su aggregazioni (per es. BoW, VLAD, FV) di feature locali di immagini (quali SIFT ed ORB), sia feature estratte da reti neurali convoluzionali (CNN feature), che approcci ibridi (quale la combinazione di FV con CNN feature). Gli approcci ibridi basati sulla combinazione di aggregazioni di feature locali e CNN feature, per esempio, hanno mostrato una elevata efficacia nel riconoscimento di iscrizioni antiche [Amato et al., 2016b]. Approcci basati su "hand-crafted" feature e deep learning sono stati studiati ed utilizzati anche per la classifi-cazione automatica, il retrieval di immagini, la localizza-zione visuale ed applicazioni di realtà aumentata [Amato et al., 2015; Bolettieri et al., 2015; Amato et al., 2017b; Amato et al., 2017a]. Inoltre, per poter effettuare ricer-che visuali anche in datatabase di enormi dimensioni, sono state sviluppati innovativi algoritmi di ricerca per similari-tà approssimata [Amato et al., 2014; Amato et al., 2016a; Amato et al., 2018]. 3 Progetti in Ambito Beni Culturali Negli ultimi anni, il gruppo AIMIR ha partecipato a numerosi progetti nazionali ed internazionali su tematiche relative ai beni culturali e all'analisi del contenuto delle immagini per l'estrazione automatica di informazioni che ne permettano la descrizione automatica, il riconoscimento, la classificazione, la ricerca su larga scala, ed il loro accesso in realtà aumentata. Si citano a titolo d'esempio: VISECH-Visual Engines for Cultural Heritage, progetto regionale che ha lo scopo di avanzare lo stato dell'arte nel-l'ambito dell'analisi automatica delle immagini, sviluppando tecniche di riconoscimento e localizzazione visuale per effet-tuare realtà aumentata, mediante algoritmi altamente scala

descriptionView Paper arrow_downwardDownload

Selection of MPEG-7 Image Features for Improving Image Similarity Search on Specific Data Sets

by Fabrizio Falchi

2019, Proceedings of the Seventh IASTED International Conference on Computer Graphics and Imaging (CGIM 2004)

descriptionView Paper arrow_downwardDownload

IMPROVING IMAGE SIMILARITY SEARCH EFFECTIVENESS IN A MULTIMEDIA CONTENT MANAGEMENT SYSTEM

by Fabrizio Falchi

2019, MIS 2004: proceedings of the 10th Workshop on Multimedia Information Systems

descriptionView Paper arrow_downwardDownload

An Approach for Self-Training Audio Event Detectors Using Web Data

by Ankit Shah and

2017

—Audio Event Detection (AED) aims to recognize sounds within audio and video recordings. AED employs machine learning algorithms commonly trained and tested on annotated datasets. However, available datasets are limited in number of... more

descriptionView Paper arrow_downwardDownload

Semantic Concept Detection from News Videos with Self-Organizing Maps

by Jorma Laaksonen

2016, IFIP International Federation for Information Processing

In this paper, we consider the automatic identification of video shots that are relevant to a given semantic concept from large video databases. We apply a method of representing semantic concepts as class models on a set of parallel... more

Fig. 1. A hierarchical view on video data and associated multimodal feature indices.

Fig. 2. Stages in creating a class model from the very-high-dimensional pattern space
through the high-dimensional feature space to the two-dimensional SOM grid.

Fig. 3. An example class model (concept ezplosion/fire on the Color Layout SOM).
Areas occupied by objects of the concept are shown with gray shades.

Table 1. Features used in the experiments for each concept to be detected.

Table 2. Detection results for each concept.

descriptionView Paper arrow_downwardDownload

Multimodal Fuzzy Fusion System for Semantic Video Indexing

by Adel M. Alimi

2016

In this paper, we propose a semantic indexing system for reducing the semantic gap between the machine and human interpretations on a video document by generating a finer indexing quality. To do so, data fusion of analyzed interpretation... more

descriptionView Paper arrow_downwardDownload

A fuzzy ontology

by Adel M. Alimi

2016, Proceedings of the Eleventh International Workshop on Multimedia Data Mining - MDMKDD '11

Multimedia indexing systems based on semantic concept detectors are incomplete in the semantic sense. We can improve the effectiveness of these systems by using knowledge-based approaches which utilize semantic knowledge. In this paper,... more

descriptionView Paper arrow_downwardDownload

REGIMVID @ TRECVID 2010 Presentation

by Mohamed Zarka

2015

descriptionView Paper arrow_downwardDownload

Fuzzy reasoning framework to improve semantic video interpretation

by Mohamed Zarka and

2015

A video retrieval system user hopes to find relevant information when the pro- posed queries are ambiguous. The retrieval process based on detecting concepts remains ineffective in such a situation. Potential relationships between... more

descriptionView Paper arrow_downwardDownload

Regimvid at ImageCLEF 2015 Scalable Concept Image Annotation Task: Ontology based Hierarchical Image Annotation

by Mohamed Zarka

2015

In this paper, we describe our participation in the Image- CLEF 2015 Scalable Concept Image Annotation task. In this participa- tion, we display our approach for an automatic image annotation by the use of an ontology-based semantic... more

descriptionView Paper arrow_downwardDownload

Toward an Assisted Context Based Collaborative Annotation

by Mohamed Zarka and

2015

This paper introduces a novel approach of video annota- tion by the use of context-based assistance for the annotator. The notion of context plays, actually, a significant role in the multimedia content search and retrieval systems. In... more

descriptionView Paper arrow_downwardDownload

A Fuzzy Ontology – Based Framework for Reasoning in Visual Video Content Analysis and Indexing

by Mohamed Zarka

2015

Figure 2. A multi-label context annotation. The annotation process can be treated as an image in the contextual domain Based on expert observations, the knowledge gained still remains uncertain. To overcome this, the appropriate solution is to incorporate fuzzy theory in ontology. Thus, integrating a fuzzy ontology to represent the uncertain contextual information allows the video indexing systems to handle the uncertainty of knowledge and enrich the semantic interpretations. These systems have to deal with two basic problems: how to build and represent the knowledge, and how to integrate context-concept information in video analysis to improve its effectiveness.

Figure 3. Two Fuzzy feta function to represent Of qualifiers.

Figure 4. Semantic Enhancer based on Deduction Engine. 3.2 Semantic Concept and C ontext Categorization

ea i eee A BS LR REE OE In order to discover the fuzzy rules relating to the roles “Includes”, “IsPartOf” and “IsRelatedTo”, the abduction engine is trained based on the semantic knowledge. Thus, for every output of the above roles, feature vectors are firstly generated. A feature vector is a string of numerical values whose dimension is n + m that correspond to the number of concepts and contexts. A 1 or 0.5 or 0, at i® position, indicates, respectively, whether the i" concept or context is “Relevant” (1), “Not-Relevant” (0.5) or “Not-Exist” (0) for the expected output. Then, the abduction engine is consecutively learned and provides fuzzy rules by estimating the degree of confidence o and the Beta membership function up, as shown in Table II.

Table 1. Semantic Relationships between C oncepts and C ontexts

Table 2. A Partial view of the abducted Fuzzy rules Table 3. Concept retrieval performance (Inferred A verage Precision infAP, Precision P and Recall R ) for different Concept detection methodologies applied on TREC VID 2010 data set.

descriptionView Paper arrow_downwardDownload

Multimodal Fuzzy Fusion System for Semantic Video Indexing

by Mohamed Zarka and

2015

descriptionView Paper arrow_downwardDownload

REGIMVID at TRECVID 2010: Semantic Indexing

by Mohamed Zarka and

2015

In this paper, we describe an overview of a software platform that has been developed within REGIMVid project for TRECVID 2010 video retrieval experiments. The REGIMVID team participated in Semantic Indexing task. In TRECVID 2010, we... more

descriptionView Paper arrow_downwardDownload

MAGAD-BFS: A Learning Method for Beta Fuzzy Systems Based on a Multi-Agent Genetic Algorithm

by Ilhem Kallel

2015, Soft Computing-A Fusion of Foundations, …

This paper proposes a learning method for Beta fuzzy systems (BFS) based on a multiagent genetic algo-rithm. This method, called Multi-Agent Genetic Algorithm for the Design of BFS has two advantages. First, thanks to genetic algorithms... more

descriptionView Paper arrow_downwardDownload

A Multimedia Retrieval Framework Based on Automatic Graded Relevance Judgments

by Miriam Redi

2013

Abstract Traditional Content Based Multimedia Retrieval (CBMR) systems measure the relevance of visual samples using a binary scale (Relevant/Non Relevant). However, a picture can be relevant to a semantic category with different degrees,... more

descriptionView Paper arrow_downwardDownload

Mixed type audio classification with support vector machine

by Lei CHEN

2013

Abstract Content-based classification of audio data is an important problem for various applications such as overall analysis of audio-visual streams, boundary detection of video story segment, extraction of speech segments from video,... more

descriptionView Paper arrow_downwardDownload

Web Image Organization and Object Discovery by Actively Creating Visual Clusters through Crowdsourcing

by Gang Wang

2013

Abstract—In this paper, we propose to organize web images by actively creating visual clusters via crowdsourcing. We develop a two-phase framework to efficiently and effectively combine computers and a large number of human workers to... more

descriptionView Paper arrow_downwardDownload

Multimedia Information Indexing and Retrieval

Key research themes

1. How can integrated multimodal and fuzzy ontology-based frameworks enhance semantic multimedia indexing and retrieval?

2. What are effective feature integration methods for improving content-based image retrieval (CBIR) in multimedia indexing?

3. How can multimedia retrieval systems effectively exploit both structured metadata and content-based approaches to bridge the semantic gap?

Related Topics

All papers in Multimedia Information Indexing and Retrieval