Multimedia Retrieval

description591 papers

group38 followers

lightbulbAbout this topic

Multimedia Retrieval is the process of searching, accessing, and retrieving information from various types of media, including text, images, audio, and video. It involves the use of algorithms and techniques to index, query, and manage multimedia content, enabling efficient information retrieval based on user queries and preferences.

lightbulbAbout this topic

Key research themes

1. How can fusion of textual and visual information improve multimedia retrieval performance and semantic understanding?

This research theme investigates approaches that combine textual metadata, natural language queries, and visual features such as color, texture, and high-level semantic concepts to enhance multimedia retrieval accuracy and semantic understanding. It addresses the persistent semantic gap by mapping between low-level visual features and high-level textual or conceptual descriptions, enabling more effective retrieval of relevant multimedia content. This fusion leverages complementary strengths of each modality—text for semantic richness and visual features for specificity.

When textual and visual information join forces for multimedia retrieval

by Benoit Huet

2015

Key finding: This paper demonstrates that combining text-based query information with visual concept detectors via late fusion significantly improves video retrieval performance on real-world datasets. It finds that automatically mapping... Read more

articleView Paper downloadDownload

An Integrated Approach to Semantic Evaluation and Content-Based Retrieval of Multimedia Documents

by Ingo Glöckner

2021

Key finding: Presents an architecture (HPQS) integrating natural language query interpretation with semantic analysis and content-based retrieval of multimedia (images, tables, text). It exploits data fusion, caching, high-speed... Read more

articleView Paper downloadDownload

Visual Concept Features and Textual Expansion in a Multimodal System for Concept Annotation and Retrieval with Flickr Photos at ImageCLEF2012

by X. Benavent

2022

Key finding: This work extends a multimodal retrieval system by enriching textual features through external query expansion and visual features via logistic regression-based concept detectors. For retrieval, sequential use of textual... Read more

articleView Paper downloadDownload

A System Framework for Concept- and Credibility-Based Multimedia Retrieval

by Mihai Lupu

2023, Proceedings of International Conference on Multimedia Retrieval

Key finding: Introduces a multimedia retrieval framework that jointly indexes multi-modal content and incorporates a credibility model (expertise, trustworthiness, quality, reliability) to re-rank results. By integrating concept-based... Read more

articleView Paper downloadDownload

Investigating the combination of structural and textual information about multimedia retrieval

by Walid MAHDI

2023, International Journal of Advanced Computer Science and Applications

Key finding: Shows that combining textual and structural features of XML documents using geometric metrics significantly improves multimedia retrieval effectiveness compared to using either modality alone. The approach represents... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What advancements in feature representation and dimensionality reduction can enhance content-based multimedia retrieval efficiency and effectiveness?

This research theme focuses on novel representations of image and multimedia features, including combining local and global histograms of visual words, and dimensionality reduction techniques such as principal component analysis (PCA) and kernel PCA. Efficient feature extraction and selection improve retrieval scalability and accuracy by reducing high-dimensional data while preserving salient discriminative information. The exploration includes nonlinear dimension reduction and multilinear kernel mapping to better capture complex data structures and enhance retrieval precision.

A Novel Image Retrieval Based on a Combination of Local and Global Histograms of Visual Words

by Muhammad Rashid

2023

Key finding: Proposes representing an image by combining global histograms of visual words over the entire image with local histograms computed over salient object regions (local rectangular areas). Experiments on several benchmark... Read more

articleView Paper downloadDownload

Kernel principal component analysis for multimedia retrieval

by Global Journal of Information Technology: Emerging Technologies

2020, Global Journal of Information Technology

Key finding: This study applies kernel PCA, a nonlinear extension of PCA, to extract principal components in a high-dimensional feature space induced by Gaussian kernels for image retrieval. Experimental results indicate that kernel PCA... Read more

articleView Paper downloadDownload

Multilinear Kernel Mapping for Feature Dimension Reduction in Content Based Multimedia Retrieval System

by vinoda reddy

2022, The International journal of Multimedia & Its Applications

Key finding: Introduces a multilinear kernel modeling approach to reduce the dimensionality of feature vectors derived from multimedia content. This approach accounts for the interrelation among dataset features more effectively than... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How can structural metadata and query modification techniques address semantic challenges in multimedia retrieval systems?

This theme explores methods leveraging document structure (e.g., XML hierarchies) and interactive query adaptation to improve the retrieval of multimedia content. Techniques include geometric metrics exploiting XML node kinship to calculate relevance of multimedia elements in structured documents, addressing the limited descriptive content of multimedia elements themselves. Additionally, user-centric query modification methods, such as segment-based query refinement and intra-query learning, allow efficient alignment of retrieval systems with subjective human perception, reducing the semantic gap without repeated extensive database searches.

A new metric for multimedia retrieval in structured documents

by Walid MAHDI

2023

Key finding: Proposes a novel similarity metric based on geometric distances within XML document trees that leverages kinship ties (children, siblings, ancestors) to better assess multimedia element relevance without relying on physical... Read more

articleView Paper downloadDownload

Efficient query modification for image retrieval

by Sugata Ghosal

2024, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662)

Key finding: Introduces an intra-query learning methodology where modified versions of the user query image (generated through segment-level manipulations) are used to infer user perceptual preferences without repeated database searches.... Read more

articleView Paper downloadDownload

Query By image for efficient information retrieval- A NecessityPublished in International Journal of Computer Applications, IJCA, Impact factor-0.853

by Divya Chadha

2023, International Journal of Computer Applications

Key finding: Discusses the necessity of image-based querying in retrieval systems, especially for unknown or unfamiliar images, highlighting shortcomings of existing text or shape-based search requiring descriptive metadata. Emphasizes a... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Multimedia Retrieval

Supporting top-k join queries in relational databases

by Walid Aref

2004, The VLDB Journal

Ranking queries, also known as top-k queries, produce results that are ordered on some computed score. Typically, these queries involve joins, where users are usually interested only in the top-k join results. Top-k queries are dominant... more

descriptionView Paper arrow_downwardDownload

Semantic concept-based query expansion and re-ranking for multimedia retrieval

by Lexing Xie

2007

We study the problem of semantic concept-based query expansion and re-ranking for multimedia retrieval. In particular, we explore the utility of a fixed lexicon of visual semantic concepts for automatic multimedia retrieval and re-ranking... more

descriptionView Paper arrow_downwardDownload

Text From Corners: A Novel Approach to Detect Text and Caption in Videos

by Thomas Huang

2000, IEEE Transactions on Image Processing

Detecting text and caption from videos is important and in great demand for video retrieval, annotation, indexing, and content analysis. In this paper, we present a corner based approach to detect text and caption from videos. This... more

descriptionView Paper arrow_downwardDownload

Concept modeling: From origins to multimedia

by Zoran Babovic and

2011, Multimedia Tools and Applications

The origins of concept modeling are in the field of artificial intelligence. This is where the initial algorithms were introduced first. With the emerging developments in the field of multimedia systems, a strong need is generated to... more

descriptionView Paper arrow_downwardDownload

Exploring Context and Content Links in Social Media: A Latent Space Method

by Thomas Huang

2000, IEEE Transactions on Pattern Analysis and Machine Intelligence

Social media networks contain both content and context-specific information. Most existing methods work with either of the two for the purpose of multimedia mining and retrieval. In reality, both content and context information are rich... more

descriptionView Paper arrow_downwardDownload

Joining Ranked Inputs in Practice

by Walid Aref

2002, VLDB '02: Proceedings of the 28th International Conference on Very Large Databases

descriptionView Paper arrow_downwardDownload

Target testing and the PicHunter Bayesian multimedia retrieval system

by Stephen Omohundro

1996, Proceedings of the Third Forum on Research and Technology Advances in Digital Libraries,

This paper addresses how the e ectiveness of a contentbased, multimedia information retrieval system can be measured, and how such a system should best use response feedback in performing searches. We propose a simple, quanti able measure... more

descriptionView Paper arrow_downwardDownload

Semantic Combination of Textual and Visual Information in Multimedia Retrieval

by Julien Ah-Pine

2011

descriptionView Paper arrow_downwardDownload

by Xing Li

2004, Proceedings of the 12th annual …

In this paper, we propose an iterative similarity propagation approach to explore the inter-relationships between Web images and their textual annotations for image retrieval. By considering Web images as one type of objects, their... more

descriptionView Paper arrow_downwardDownload

Efficient k-NN search on vertically decomposed data

by Martin Kersten

2002, Proceedings of the 2002 ACM SIGMOD international conference on Management of data - SIGMOD '02

Applications like multimedia retrieval require efficient support for similarity search on large data collections. Yet, nearest neighbor search is a difficult problem in high dimensional spaces, rendering efficient applications hard to... more

descriptionView Paper arrow_downwardDownload

MediaNet: A multimedia information network for knowledge representation

by John Smith

2000

In this paper, we present MediaNet, which is a knowledge representation framework that uses multimedia content for representing semantic and perceptual information. The main components of MediaNet include conceptual entities, which... more

descriptionView Paper arrow_downwardDownload

Multimedia content description in the InfoPyramid

by Rakesh Mohan

1998

There is a growing need for developing a content description language for multimedia that improves searching. indexing and managing of the multimedia content. The MPEG group recendy established the MPEG-7 effort to standardize the... more

descriptionView Paper arrow_downwardDownload

Building a visual ontology for video retrieval

by Marcel Worring

2005, Proceedings of the 13th annual ACM international conference on Multimedia - MULTIMEDIA '05

To ensure access to growing video collections, annotation is becoming more and more important. Using background knowledge in the form of ontologies or thesauri is a way to facilitate annotation in a broad domain. Current ontologies are... more

descriptionView Paper arrow_downwardDownload

Bringing order to your photos: event-driven classification of flickr images based on social knowledge

by Mihai Paiu

2010

With the rapidly increasing popularity of Social Media sites, a lot of user generated content has been injected in the Web, thus resulting in a large amount of both multimedia items (music -Last.fm, MySpace.com, pictures -Flickr , Picasa,... more

5.3 Results B-Cubed estimates the precision and recall associated with each document in the data set individually, and then uses the average precision P, and average recall Ry values for the data set to compute B-Cubed as:

Figure 1: Classification results (Acc, P, R) for the experimental runs using only tags as features

Table 1: Example for clustering Flickr pictures

Table 4: Averaged classification results showing Accuracy, Precision, Recall, NMI, and B-Cubed (NMI and B-Cubed values are not available for the linear combination of the two classifiers)

Table 6: Examples of best and worst performing (by Acc) classifiers for the different experimental runs

descriptionView Paper arrow_downwardDownload

SkipIndex: Towards a Scalable Peer-to-Peer Index Service for High Dimensional Data

by Chi Zhang

Indexing of high-dimensional data is essential for building applications such as multimedia retrieval, data mining, and spatial databases. Traditional index structures rely on centralized processing. This approach does not scale with the... more

descriptionView Paper arrow_downwardDownload

VideOlympics: Real-Time Evaluation of Multimedia Retrieval Systems

by Marcel Worring

2000, IEEE Multimedia

I nteractive prototypes are often the best way to convince an audience of a new multimedia technology's possible impact. Because of its dynamic audiovisual nature, a multimedia application demonstration communicates applied science more... more

descriptionView Paper arrow_downwardDownload

Temporal Color Correlograms for Video Retrieval

by Mika Rautiainen

2002

EVWUDFW 7KLV SDSHU SUHVHQWV D QRYHO PHWKRG WR UHWULHYH VHJ PHQWHG YLGHR VKRWV EDVHG RQ WKHLU FRORU FRQWHQW 7KH 7HPSRUDO &RORU &RUUHORJUDP FDSWXUHV WKH VSDWLR WHPSRUDO UHODWLRQVKLS RI FRORUV LQ D YLGHR VKRW XVLQJ FR RFFXUUHQFH VWDWLVWLFV... more

descriptionView Paper arrow_downwardDownload

Structured document retrieval, multimedia retrieval, and entity ranking using PF/Tijah

by Henning Rode and

2008, Focused Access to XML Documents

CWI and University of Twente used PF/Tijah, a flexible XML retrieval system, to evaluate structured document retrieval, multimedia retrieval, and entity ranking tasks in the context of INEX 2007. For the retrieval of textual and... more

descriptionView Paper arrow_downwardDownload

Inferring semantics from textual information in multimedia retrieval

by Jorma Laaksonen

2008, Neurocomputing

We propose a method for inferring semantic information from textual data in content-based multimedia retrieval. Training examples of images and videos belonging to a specific semantic class are associated with their low-level visual and... more

descriptionView Paper arrow_downwardDownload

Web search engine multimedia functionality

by Dian Tjondronegoro

2008, Information Processing & Management

Web search engines are beginning to offer access to multimedia searching, including audio, video and image searching. In this paper we report findings from a study examining the state of multimedia search functionality on major general... more

Fig. 1. The process of extracting high-level semantic from multimedia data.

Comparison of multimedia search customization on general search engines

Comparison of multimedia search customization on specialized Web search engines (1)

Comparison of multimedia search customization on specialized Web search engines (2)

Supports for image, audio and video search by Web search engines Table 5 Table 6

Supports for filtering, domain, format and size by Web search engines

Supports for semantic summary, taxonomy (features and semantic based), and semantic taxonomy (related topics) by Web search engines Table 8

Supports for temporal customization (duration) by Web search engines

Supports for visual customization (color and background) by Web search engines 5.3. Multimedia types: image, audio and video

descriptionView Paper arrow_downwardDownload

Enriching user profiling with affective features for the improvement of a multimodal recommender system

by Yashar Moshfeghi

2009

Recommender systems have been systematically applied in industry and academia to help users cope with information uncertainty. However, given the multiplicity of the preferences and needs it has been shown that no approach is suitable for... more

descriptionView Paper arrow_downwardDownload

Acoi: A system for indexing multimedia objects

by Martin Kersten

1999

In this paper, we present a system that combines independent feature detector programs with multimedia database technology to provide a semantic rich index to multimedia data items on the World Wide Web.

descriptionView Paper arrow_downwardDownload

Incremental kernel learning for active image retrieval without global dictionaries

by Frédéric Precioso

2011, Pattern Recognition

In content-based image retrieval context, a classic strategy consists in computing off-line a dictionary of visual features. This visual dictionary is then used to provide a new representation of the data which should ease any task of... more

descriptionView Paper arrow_downwardDownload

Crossing textual and visual content in different application scenarios

by Julien Ah-Pine

2009, Multimedia Tools and …

descriptionView Paper arrow_downwardDownload

by Xiang Lian

2000, IEEE Transactions on Knowledge and Data Engineering

Similarity-based search has been a key factor for many applications such as multimedia retrieval, data mining, Web search and retrieval, and so on. There are two important issues related to the similarity search, namely, the design of a... more

descriptionView Paper arrow_downwardDownload

The role of high-level and low-level features in style-based retrieval and generation of multimedia presentations

by Eric Pauwels

2001, New Review of Hypermedia and Multimedia

In this article we argue that the automatic generation of dynamic multimedia presentation requires both low-level collections of objective measurements for media units representing prototypical style elements, and high-level conceptual... more

descriptionView Paper arrow_downwardDownload

A programmable router architecture supporting control plane extensibility

by P. Steenkiste

2000, IEEE Communications Magazine

The Internet is evolving from an infrastructure that provides basic communication services into a more sophisticated infrastructure that supports a wide range of electronic services such as virtual reality games and rich multimedia... more

descriptionView Paper arrow_downwardDownload

Lecture Video Indexing and Analysis Using Video OCR Technology

by Harald Sack

2011, 2011 Seventh International Conference on Signal Image Technology & Internet-Based Systems

The text displayed in a lecture video is closely related to the lecture content. Therefore, it provides a valuable source for indexing and retrieving lecture video contents. Textual content can be detected, extracted and analyzed... more

descriptionView Paper arrow_downwardDownload

<title>Content-based video retrieval and summarization using MPEG-7</title>

by Harald Mayer

Retrieval in current multimedia databases is usually limited to browsing and searching based on low-level visual features and explicit textual descriptors. Semantic aspects of visual information are mainly described in full text... more

descriptionView Paper arrow_downwardDownload

GraphREL: A decomposition-based and selectivity-aware relational framework for processing sub-graph queries

by Sherif Sakr

2009, Database Systems for Advanced Applications

descriptionView Paper arrow_downwardDownload

Semantic-driven multimedia retrieval with the MPEG Query Format

by Jaime Delgado

2010

descriptionView Paper arrow_downwardDownload

Adaptive Multimedia Retrieval: User, Context, and Feedback: Third International Workshop, AMR 2005, Glasgow, UK, July 28-29, 2005, Revised Selected Papers (Lecture Notes in Computer Science)

by Andreas Nürnberger

2006

descriptionView Paper arrow_downwardDownload

An index and retrieval framework integrating perceptive features and semantics for multimedia databases

by Qing He

2009, Multimedia Tools and Applications

Typically, in multimedia databases, there exist two kinds of clues for query: perceptive features and semantic classes. In this paper, we propose a novel framework for multimedia databases index and retrieval integrating the perceptive... more

descriptionView Paper arrow_downwardDownload

Towards to an automatic semantic annotation for multimedia learning objects

by Serge Linckels

2007, Proceedings of the international workshop on Educational multimedia and multimedia education - Emme '07

The number of digital video recordings has increased dramatically. The idea of recording lectures, speeches, and other academic events is not new. But, the accessibility and traceability of its content for further use is rather limited.... more

descriptionView Paper arrow_downwardDownload

Retrieving Geo-location of Videos with a Divide & Conquer Hierarchical Multimodal Approach

by Michele Trevisiol

2013, ACM International Conference on Multimedia Retrieval (ICMR '13)

This paper presents a strategy to identify the geographic location of videos. First, it relies on a multi-modal cascade pipeline that exploits the available sources of information, namely the user’s upload history, his social network and... more

descriptionView Paper arrow_downwardDownload

A hybrid ontology and visual-based retrieval model for cultural heritage multimedia collections

by Stefanos Vrochidis

2008, International Journal of Metadata, Semantics and Ontologies

Nowadays, an increasingly growing demand for advanced multimedia search engines is arising, as huge amounts of digital visual content are becoming available. The contribution of this paper is the introduction of a hybrid multimedia... more

descriptionView Paper arrow_downwardDownload

Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval

by Csurka Gabriela

2007

We present here some transmedia similarity measures that we recently designed by adopting some “intermediate level” fusion approaches. The main idea is to use some principles coming from pseudo-relevance feedback and, more specifically,... more

descriptionView Paper arrow_downwardDownload

Cross Document Ontology based Information Extraction for Multimedia Retrieval

by Horacio Saggion

2003

This paper describes the MUMIS project, which applies ontology based Information Extraction to improve the results of Information Retrieval in multimedia archives. It makes use of a domain specific ontology, multilingual lexicons and... more

descriptionView Paper arrow_downwardDownload

Modeling and retrieval of moving objects

by John Shepherd

2001

This paper presents a symbolic formalism for modeling and retrieving video data via the moving objects contained in the video images. The model integrates the representations of individual moving objects in a scene with the time-varying... more

descriptionView Paper arrow_downwardDownload

From multimedia retrieval to knowledge management

by JM Van Thong

2000, Computer

We explore how current traditional applications in multimedia indexing can evolve into fully-fledged knowledge management systems in which multimedia content, audio, video and images, are first class citizens and contribute as much as... more

descriptionView Paper arrow_downwardDownload

Semantic MPEG Query Format Validation and Processing

by Jaime Delgado

2000, IEEE Multimedia

descriptionView Paper arrow_downwardDownload

Question answering from lecture videos based on an automatic semantic annotation

by Serge Linckels

2008, ACM SIGCSE Bulletin

The number of digital lecture video recordings has increased dramatically. The accessibility, usability and the traceability of their content for students-use is limited. Therefore retrieval of audiovisual lecture recordings is a complex... more

descriptionView Paper arrow_downwardDownload

Managing video collections at large

by Bruno Janvier

2004, Proceedings of the 1st international workshop on Computer vision meets databases - CVDB '04

Video document retrieval is now an active part of the domain of multimedia retrieval. However, unlike for other media, the management of a collection of video documents adds the problem of efficiently handling an overwhelming volume of... more

descriptionView Paper arrow_downwardDownload

Interactive Indexing and Retrieval of Multimedia Content

by Hoàng Minh

2002

The indexing and retrieval of multimedia items is difficult due to the semantic gap between the user’s perception of the data and the descriptions we can derive automatically from the data using computer vision, speech recognition, and... more

descriptionView Paper arrow_downwardDownload

Flexible and scalable digital library search

by Martin Kersten

2001

descriptionView Paper arrow_downwardDownload

XFIRM at INEX 2006. Ad-hoc, Relevance Feedback and MultiMedia tracks

by lobna hlaoua

2007

This paper describes experiments carried out with the XFIRM system in the INEX 2006 framework. The XFIRM system uses a relevance propagation method to answer CO and CO+S queries. Runs were submitted to the ad-hoc, relevance feedback and... more

descriptionView Paper arrow_downwardDownload

A semantic-based and adaptive architecture for automatic multimedia retrieval composition

by Isaak Kavasidis

2011, Proceedings - International Workshop on Content-Based Multimedia Indexing

In this paper we present a domain-independent multimedia retrieval (MMR) platform. Currently, the use of MMR systems for different domains poses several limitations, mainly related to the poor flexibility and adaptability to different... more

descriptionView Paper arrow_downwardDownload

RUSHES Retrieval of Multimedia Semantic Units for Enhanced Reusability

by pedro concejero and

2009, 2009 Seventh International Workshop on Content-Based Multimedia Indexing

Multimedia analysis and reuse of raw un-edited audio visual content known as rushes is gaining acceptance by a large number of research labs and companies. A set of research projects are considering multimedia indexing, annotation, search... more

descriptionView Paper arrow_downwardDownload

Multidimensional Descriptor Indexing: Exploring the BitMatrix

by Gabriel David

2006, Lecture Notes in Computer Science

Multimedia retrieval brings new challenges, mainly derived from the mismatch between the level of the user interactionhigh-level concepts, and that of the automatically processed descriptorslow-level features. The eective use of the... more

descriptionView Paper arrow_downwardDownload

Multimedia Retrieval

Key research themes

1. How can fusion of textual and visual information improve multimedia retrieval performance and semantic understanding?

2. What advancements in feature representation and dimensionality reduction can enhance content-based multimedia retrieval efficiency and effectiveness?

3. How can structural metadata and query modification techniques address semantic challenges in multimedia retrieval systems?

Related Topics

All papers in Multimedia Retrieval