Academia.eduAcademia.edu

Multimedia Retrieval

description591 papers
group38 followers
lightbulbAbout this topic
Multimedia Retrieval is the process of searching, accessing, and retrieving information from various types of media, including text, images, audio, and video. It involves the use of algorithms and techniques to index, query, and manage multimedia content, enabling efficient information retrieval based on user queries and preferences.
lightbulbAbout this topic
Multimedia Retrieval is the process of searching, accessing, and retrieving information from various types of media, including text, images, audio, and video. It involves the use of algorithms and techniques to index, query, and manage multimedia content, enabling efficient information retrieval based on user queries and preferences.

Key research themes

1. How can fusion of textual and visual information improve multimedia retrieval performance and semantic understanding?

This research theme investigates approaches that combine textual metadata, natural language queries, and visual features such as color, texture, and high-level semantic concepts to enhance multimedia retrieval accuracy and semantic understanding. It addresses the persistent semantic gap by mapping between low-level visual features and high-level textual or conceptual descriptions, enabling more effective retrieval of relevant multimedia content. This fusion leverages complementary strengths of each modality—text for semantic richness and visual features for specificity.

Key finding: This paper demonstrates that combining text-based query information with visual concept detectors via late fusion significantly improves video retrieval performance on real-world datasets. It finds that automatically mapping... Read more
Key finding: Presents an architecture (HPQS) integrating natural language query interpretation with semantic analysis and content-based retrieval of multimedia (images, tables, text). It exploits data fusion, caching, high-speed... Read more
Key finding: This work extends a multimodal retrieval system by enriching textual features through external query expansion and visual features via logistic regression-based concept detectors. For retrieval, sequential use of textual... Read more
Key finding: Introduces a multimedia retrieval framework that jointly indexes multi-modal content and incorporates a credibility model (expertise, trustworthiness, quality, reliability) to re-rank results. By integrating concept-based... Read more
Key finding: Shows that combining textual and structural features of XML documents using geometric metrics significantly improves multimedia retrieval effectiveness compared to using either modality alone. The approach represents... Read more

2. What advancements in feature representation and dimensionality reduction can enhance content-based multimedia retrieval efficiency and effectiveness?

This research theme focuses on novel representations of image and multimedia features, including combining local and global histograms of visual words, and dimensionality reduction techniques such as principal component analysis (PCA) and kernel PCA. Efficient feature extraction and selection improve retrieval scalability and accuracy by reducing high-dimensional data while preserving salient discriminative information. The exploration includes nonlinear dimension reduction and multilinear kernel mapping to better capture complex data structures and enhance retrieval precision.

Key finding: Proposes representing an image by combining global histograms of visual words over the entire image with local histograms computed over salient object regions (local rectangular areas). Experiments on several benchmark... Read more
Key finding: This study applies kernel PCA, a nonlinear extension of PCA, to extract principal components in a high-dimensional feature space induced by Gaussian kernels for image retrieval. Experimental results indicate that kernel PCA... Read more
Key finding: Introduces a multilinear kernel modeling approach to reduce the dimensionality of feature vectors derived from multimedia content. This approach accounts for the interrelation among dataset features more effectively than... Read more

3. How can structural metadata and query modification techniques address semantic challenges in multimedia retrieval systems?

This theme explores methods leveraging document structure (e.g., XML hierarchies) and interactive query adaptation to improve the retrieval of multimedia content. Techniques include geometric metrics exploiting XML node kinship to calculate relevance of multimedia elements in structured documents, addressing the limited descriptive content of multimedia elements themselves. Additionally, user-centric query modification methods, such as segment-based query refinement and intra-query learning, allow efficient alignment of retrieval systems with subjective human perception, reducing the semantic gap without repeated extensive database searches.

Key finding: Proposes a novel similarity metric based on geometric distances within XML document trees that leverages kinship ties (children, siblings, ancestors) to better assess multimedia element relevance without relying on physical... Read more
Key finding: Introduces an intra-query learning methodology where modified versions of the user query image (generated through segment-level manipulations) are used to infer user perceptual preferences without repeated database searches.... Read more
Key finding: Discusses the necessity of image-based querying in retrieval systems, especially for unknown or unfamiliar images, highlighting shortcomings of existing text or shape-based search requiring descriptive metadata. Emphasizes a... Read more

All papers in Multimedia Retrieval

Ranking queries, also known as top-k queries, produce results that are ordered on some computed score. Typically, these queries involve joins, where users are usually interested only in the top-k join results. Top-k queries are dominant... more
We study the problem of semantic concept-based query expansion and re-ranking for multimedia retrieval. In particular, we explore the utility of a fixed lexicon of visual semantic concepts for automatic multimedia retrieval and re-ranking... more
Detecting text and caption from videos is important and in great demand for video retrieval, annotation, indexing, and content analysis. In this paper, we present a corner based approach to detect text and caption from videos. This... more
The origins of concept modeling are in the field of artificial intelligence. This is where the initial algorithms were introduced first. With the emerging developments in the field of multimedia systems, a strong need is generated to... more
Social media networks contain both content and context-specific information. Most existing methods work with either of the two for the purpose of multimedia mining and retrieval. In reality, both content and context information are rich... more
This paper addresses how the e ectiveness of a contentbased, multimedia information retrieval system can be measured, and how such a system should best use response feedback in performing searches. We propose a simple, quanti able measure... more
In this paper, we propose an iterative similarity propagation approach to explore the inter-relationships between Web images and their textual annotations for image retrieval. By considering Web images as one type of objects, their... more
Applications like multimedia retrieval require efficient support for similarity search on large data collections. Yet, nearest neighbor search is a difficult problem in high dimensional spaces, rendering efficient applications hard to... more
In this paper, we present MediaNet, which is a knowledge representation framework that uses multimedia content for representing semantic and perceptual information. The main components of MediaNet include conceptual entities, which... more
There is a growing need for developing a content description language for multimedia that improves searching. indexing and managing of the multimedia content. The MPEG group recendy established the MPEG-7 effort to standardize the... more
To ensure access to growing video collections, annotation is becoming more and more important. Using background knowledge in the form of ontologies or thesauri is a way to facilitate annotation in a broad domain. Current ontologies are... more
With the rapidly increasing popularity of Social Media sites, a lot of user generated content has been injected in the Web, thus resulting in a large amount of both multimedia items (music -Last.fm, MySpace.com, pictures -Flickr , Picasa,... more
Indexing of high-dimensional data is essential for building applications such as multimedia retrieval, data mining, and spatial databases. Traditional index structures rely on centralized processing. This approach does not scale with the... more
I nteractive prototypes are often the best way to convince an audience of a new multimedia technology's possible impact. Because of its dynamic audiovisual nature, a multimedia application demonstration communicates applied science more... more
EVWUDFW 7KLV SDSHU SUHVHQWV D QRYHO PHWKRG WR UHWULHYH VHJ PHQWHG YLGHR VKRWV EDVHG RQ WKHLU FRORU FRQWHQW 7KH 7HPSRUDO &RORU &RUUHORJUDP FDSWXUHV WKH VSDWLR WHPSRUDO UHODWLRQVKLS RI FRORUV LQ D YLGHR VKRW XVLQJ FR RFFXUUHQFH VWDWLVWLFV... more
CWI and University of Twente used PF/Tijah, a flexible XML retrieval system, to evaluate structured document retrieval, multimedia retrieval, and entity ranking tasks in the context of INEX 2007. For the retrieval of textual and... more
We propose a method for inferring semantic information from textual data in content-based multimedia retrieval. Training examples of images and videos belonging to a specific semantic class are associated with their low-level visual and... more
Web search engines are beginning to offer access to multimedia searching, including audio, video and image searching. In this paper we report findings from a study examining the state of multimedia search functionality on major general... more
Recommender systems have been systematically applied in industry and academia to help users cope with information uncertainty. However, given the multiplicity of the preferences and needs it has been shown that no approach is suitable for... more
In this paper, we present a system that combines independent feature detector programs with multimedia database technology to provide a semantic rich index to multimedia data items on the World Wide Web.
In content-based image retrieval context, a classic strategy consists in computing off-line a dictionary of visual features. This visual dictionary is then used to provide a new representation of the data which should ease any task of... more
Similarity-based search has been a key factor for many applications such as multimedia retrieval, data mining, Web search and retrieval, and so on. There are two important issues related to the similarity search, namely, the design of a... more
In this article we argue that the automatic generation of dynamic multimedia presentation requires both low-level collections of objective measurements for media units representing prototypical style elements, and high-level conceptual... more
The Internet is evolving from an infrastructure that provides basic communication services into a more sophisticated infrastructure that supports a wide range of electronic services such as virtual reality games and rich multimedia... more
The text displayed in a lecture video is closely related to the lecture content. Therefore, it provides a valuable source for indexing and retrieving lecture video contents. Textual content can be detected, extracted and analyzed... more
Retrieval in current multimedia databases is usually limited to browsing and searching based on low-level visual features and explicit textual descriptors. Semantic aspects of visual information are mainly described in full text... more
Typically, in multimedia databases, there exist two kinds of clues for query: perceptive features and semantic classes. In this paper, we propose a novel framework for multimedia databases index and retrieval integrating the perceptive... more
The number of digital video recordings has increased dramatically. The idea of recording lectures, speeches, and other academic events is not new. But, the accessibility and traceability of its content for further use is rather limited.... more
This paper presents a strategy to identify the geographic location of videos. First, it relies on a multi-modal cascade pipeline that exploits the available sources of information, namely the user’s upload history, his social network and... more
Nowadays, an increasingly growing demand for advanced multimedia search engines is arising, as huge amounts of digital visual content are becoming available. The contribution of this paper is the introduction of a hybrid multimedia... more
We present here some transmedia similarity measures that we recently designed by adopting some “intermediate level” fusion approaches. The main idea is to use some principles coming from pseudo-relevance feedback and, more specifically,... more
This paper describes the MUMIS project, which applies ontology based Information Extraction to improve the results of Information Retrieval in multimedia archives. It makes use of a domain specific ontology, multilingual lexicons and... more
This paper presents a symbolic formalism for modeling and retrieving video data via the moving objects contained in the video images. The model integrates the representations of individual moving objects in a scene with the time-varying... more
We explore how current traditional applications in multimedia indexing can evolve into fully-fledged knowledge management systems in which multimedia content, audio, video and images, are first class citizens and contribute as much as... more
The number of digital lecture video recordings has increased dramatically. The accessibility, usability and the traceability of their content for students-use is limited. Therefore retrieval of audiovisual lecture recordings is a complex... more
Video document retrieval is now an active part of the domain of multimedia retrieval. However, unlike for other media, the management of a collection of video documents adds the problem of efficiently handling an overwhelming volume of... more
The indexing and retrieval of multimedia items is difficult due to the semantic gap between the user’s perception of the data and the descriptions we can derive automatically from the data using computer vision, speech recognition, and... more
This paper describes experiments carried out with the XFIRM system in the INEX 2006 framework. The XFIRM system uses a relevance propagation method to answer CO and CO+S queries. Runs were submitted to the ad-hoc, relevance feedback and... more
In this paper we present a domain-independent multimedia retrieval (MMR) platform. Currently, the use of MMR systems for different domains poses several limitations, mainly related to the poor flexibility and adaptability to different... more
Multimedia analysis and reuse of raw un-edited audio visual content known as rushes is gaining acceptance by a large number of research labs and companies. A set of research projects are considering multimedia indexing, annotation, search... more
Multimedia retrieval brings new challenges, mainly derived from the mismatch between the level of the user interactionhigh-level concepts, and that of the automatically processed descriptorslow-level features. The eective use of the... more
Download research papers for free!