Academia.eduAcademia.edu

Multimedia Indexing and Searching

description635 papers
group24,461 followers
lightbulbAbout this topic
Multimedia Indexing and Searching is the process of organizing and retrieving diverse forms of media content, such as text, images, audio, and video, using algorithms and metadata. It involves creating efficient representations of multimedia data to facilitate quick and accurate search and retrieval in large databases.
lightbulbAbout this topic
Multimedia Indexing and Searching is the process of organizing and retrieving diverse forms of media content, such as text, images, audio, and video, using algorithms and metadata. It involves creating efficient representations of multimedia data to facilitate quick and accurate search and retrieval in large databases.

Key research themes

1. How can content-based visual features be effectively used to overcome the semantic gap in multimedia indexing and retrieval?

This theme addresses the challenge of bridging the semantic gap between low-level visual features extracted from multimedia content and high-level human semantic queries. It investigates methods for combining multiple visual features to improve retrieval accuracy and relevance, emphasizing content-based video and image retrieval techniques.

Key finding: The paper demonstrates that employing multiple low-level visual features including color histograms, texture features, and shape in video indexing and retrieval improves discrimination and search tasks, contributing to... Read more
Key finding: Introduces color correlograms that encode spatial correlation of colors in images, outperforming traditional global color histograms and histogram refinement methods for image indexing and retrieval. This approach is robust... Read more
Key finding: Presents a system (HPQS) integrating natural language querying with semantic analysis and fuzzy data fusion for multimedia content-based retrieval that goes beyond low-level features by interpreting textual and image content... Read more
Key finding: Finds that combining multiple automatically-extracted visual and auditory features supports user queries in video retrieval but user adjustments of feature weights do not significantly improve performance. Keywords and... Read more
Key finding: Beyond employing multi-feature extraction, the study emphasizes including human visual perception aspects in designing video retrieval systems to better interpret semantic content, by modeling intellectual and emotional... Read more

2. What scalable algorithms and data structures enable efficient large-scale multimedia indexing and searching?

Focused on the computational challenges in indexing and retrieving multimedia data at scale, this research theme explores data structures like permutation-based indexing and algorithmic advances aimed at optimizing indexing time and retrieval speeds, including GPU and multi-core processing for large datasets.

Key finding: Proposes the Metric Permutation Table (MPT) data structure for efficient approximate nearest neighbor search in very large multimedia databases, with indexing based on permutations relative to reference objects. By... Read more
Key finding: Introduces a linear-time, scalable algorithm for attribute selection that uses fractal dimension to approximate intrinsic data dimensionality, enabling fast identification and elimination of redundant or irrelevant features... Read more
Key finding: Presents a variant of the SALSA link-analysis ranking algorithm called sNorm(p), which incorporates vector norms to better balance hub and authority scores in web page ranking, improving ranking effectiveness metrics such as... Read more

3. How can multimodal and semantic-rich user interfaces and query systems enhance multimedia search experience?

This theme examines the integration of multimodal inputs—such as natural language queries, images, and low-level visual features—to improve search accuracy and user experience in multimedia retrieval. It covers the fusion of semantic metadata, user interaction components, and adaptive query mechanisms to contextualize multimedia search.

Key finding: Demonstrates a comprehensive natural language interface enabling users to submit semantically rich queries which are transformed into formal retrieval requests. This multimodal approach combines semantic analysis of text,... Read more
Key finding: Evaluates the impact of interactive interfaces that allow users to adjust weights on multiple automatically-extracted features for video search. Findings indicate that some semantic features (e.g., Indoors/Outdoors) are more... Read more
Key finding: Proposes a novel method using music taste and audio feature extraction integrated with machine learning for user similarity detection and social network formation. The system incorporates complex audio features beyond FFT,... Read more

All papers in Multimedia Indexing and Searching

This research explores the interaction of textual and photographic information in document understanding. The problem of performing generalpurpose vision without a priori knowledge is difficult at best. The use of collateral information... more
Despite recent advances in joint processing of images, sometimes it may not be as effective as single image processing for object discovery problems. In this paper while aiming for common object detection, we attempt to address this... more
Dans le cadre de la recherche d'information dans le Web des données, nous proposons une sorte de version compacte d'une base de triplets RDF agissant comme un aperçu – i.e. une vue d'ensemble – sur cette base. L'intérêt de... more
The synergies of current trends are shaping creation, discovery, access, use and reuse of information resources. The digital landscape has exploded, providing greater access to a larger portion of the world’s population since the creation... more
In this paper, we present an ongoing work aiming to improve content based image retrieval performance with the help of logical concept analysis. Domain semantic is formalized and used instead of classical CBIR visual features. This is... more
In this paper, we present our proposed approach that deals with two important steps in the retrieval process, which are query analysis and relevance-based ranking. First, query analysis takes into account two forms of queries: textual and... more
Content-based image retrieval (CBIR) has been a challenging problem and its performance relies on the efficiency in modeling the underlying content and the similarity measure between the query and the retrieved images. Most existing... more
The environment in the east, south and northeastern parts of Bangladesh has an abundance of solar radiation along with high levels of UV radiation and humidity. In addition, there are also remote areas where the establishment of... more
Social media sharing websites sanction users to annotate images with free tags, which significantly contribute to the development of the web image retrieval. Tag-predicated image search is a consequential method to find images shared by... more
Now a days P2P networks are widely used for voice and video communications and also in many transactions like file sharing. In P2P networks DHT (Distributed Hash Table) oriented routing protocols gives an efficient way to search the... more
The Specification and Description Language (SDL) and Message SequenceCharts (MSC) are formal description techniques which gain moreand more importance in the development and specification of complexreal-time systems, especially in the... more
The availability of research on the social web is an important factor to determine its societal impact. The inability of traditional citation-based metrics to provide a complete picture of web-based scholarly content has given rise to... more
This paper covers a use study of the Online Public Access Catalogues (OPACs) at the University Libraries of West Bengal. Highlights the subject access for Bengali documents in OPACs. It finds that most of the users are postgraduate... more
In this paper we describe our experiments in the automatic and interactive search tasks of TRECVID 2008. We submitted six runs, five of them are automatic and one is interactive. The automatic runs include, a text baseline, two runs based... more
Abstract: In this paper, we propose an extended framework structure designed for MUVIS multimedia indexing and retrieval scheme in order to achieve the dynamic integration and run-time execution for the following operations within the... more
Due to widespread use of the Internet, efficient management of multimedia databases has attracted many researchers. Variety of techniques including database indexing, classification and feature extraction are developed. In this study,... more
Budapest’s SEO experts are at the forefront of a transformative shift in search engine optimization, integrating advanced mathematical frameworks and AI-driven insights to redefine how information is accessed and utilized. By drawing... more
Search engines are popularly utilized for extracting desired information from World Wide Web by users.  Efficiency of these search engines are dependent on how fast search results can be retrieved and whether these results reflects the... more
Content-based multimedia information retrieval is an interesting research area since it allows retrieval based on inherent characteristic of multimedia objects. For example retrieval based on visual characteristics such as colour, shapes... more
When humans describe images they tend to use combinations of nouns and adjectives, corresponding to objects and their associated attributes respectively. To generate such a description automatically, one needs to model objects, attributes... more
Due to the rapid growth of the number of digital media elements like image, video, audio, graphics on Internet, there is an increasing demand for effective search and retrieval techniques. Recently, many search engines have made image... more
In this paper, we introduce a novel framework for automatic Semantic Video Annotation. As this framework detects possible events occurring in video clips, it forms the annotating base of video search engine. To achieve this purpose, the... more
The rapidly increasing amount of video collections, available on the web or via broadcasting, motivated research towards building intelligent tools for searching, rating, indexing and retrieval purposes. Establishing a semantic... more
The rapidly increasing amount of video collections, especially on the web, motivated the need for intelligent automated annotation tools for searching, rating, indexing and retrieval purposes. These videos collections contain all types of... more
Users are generally interested in the edge-ranked section of returning search results, according to an analysis of click-through data from a very big search engine log. As a result, search engines must achieve great accuracy with... more
Proximity searching consists in retrieving from a database, objects that are close to a query. For this type of searching problem, the most general model is the metric space, where proximity is defined in terms of a distance function. A... more
This paper describes a serious game intended to teach singing to children. The system shows the students a virtual world they can explore, and the evolution of the world is based on their performance. Automatic content generation... more
This chapter presents a broad overview of Computational Intelligence (CI) techniques including Neural Network (NN), Particle Swarm Optimization (PSO), Evolutionary Algorithm (GA), Fuzzy Set (FS), and Rough Sets (RS). In addition, a very... more
Web content nowadays can also be accessed through new generation of Internet connected TVs. However, these pro ducts failed to change users' behavior when consuming online content. Users still prefer personal computers to access Web... more
Multimedia provides an upscale content of knowledge and a large quantity of knowledge square measure obtainable within the field of video retrieval. To achieve the goal of indexing and retrieving video shots by their content still an... more
Os documentos audiovisuais trazem para a Ciência da Informação brasileira inúmeros desafi os referentes a identifi cação de seu assunto principal, e torna mais complexo o momento de elaboração de resumo e identifi cação de palavras-chave... more
This chapter presents a broad overview of Computational Intelligence (CI) techniques including Neural Network (NN), Particle Swarm Optimization (PSO), Evolutionary Algorithm (GA), Fuzzy Set (FS), and Rough Sets (RS). In addition, a very... more
The 'Papyrus Portal' is a project that aims to provide the user with an efficient and effective search of all digitized and electronically catalogued papyrus collections in Germany, and a unified presentation of the search results... more
This work represents the introduction to the proceedings of the 1 st International Workshop on Multilayer Music Representation and Processing (MMRP19) authored by the Program Co-Chairs. The idea is to explain the rationale behind such a... more
This letter from Mumbai includes a short biography of Dr. Vijayprasad Gopichand, a note on the need for physically handicapped physicians and medical personnel to be treated with dignity, pleads for respect to be paid to nurses by... more
Content-based image retrieval systems were introduced as an alternative to avoid the need of manual tagging in traditional keyword-based image retrieval systems. However, the representation of image using visual features only involves a... more
Global security concerns have raised a proliferation of video surveillance devices. Intelligent surveillance systems seek to discover possible threats automatically and raise alerts. Being able to identify the surveyed object can help... more
www.ijaret.com © IJARET All Rights Reserved Context-Aware Video Annotation Using Linked Data and Search Educational Video Resources for Supporting Distance Learning IPrasad V. Phalle, IISuresh K. Shirgave, IIIG. A. Patil I,IIIDept. of... more
We investigate the problem of retrieving similar shapes from a large database; in particular, we focus on medical tumor shapes ("Find tumors that are similar to a given pattern."). We use a natural similarity function for shape-matching,... more
In this paper, we study the automatic construction of personalized TV News programs, where we want to build a program with predefined duration and maximum content value for a specific user. We combine video indexing techniques to parse TV... more
In this paper, we study the automatic construction of personalized TV News programs, where we want to build a program with predefined duration and maximum content value for a specific user. We combine video indexing techniques to parse TV... more
This paper describes a novel browsing paradigm, taking benefit of the various types of links (e.g. thematic, temporal, references, etc.) that can be automatically built between multimedia documents. This browsing paradigm can help... more
The performance of a content based retrieval system is limited mainly because of the unavailability of sufficient annotated examples, descriptor noise and the semantic gap that is the representation difference between the high level... more
The performance of a content based retrieval system is limited mainly because of the unavailability of sufficient annotated examples, descriptor noise and the semantic gap that is the representation difference between the high level... more
In the context of mathematical morphology, component-graphs are complex but powerful structures for multi-band image modeling, processing, and analysis. In this work, we propose a novel multi-band object detection method relying on the... more
In intelligent environments, activity detection is a necessary pre-processing step for adaptive energy management and interaction with humans. To characterize the interactions between individuals or between an individual and the... more
Aborda o tratamento da informacao musical, sendo a selecionada neste estudo os documentos musicais impressos – que sera denominado preferencialmente por partituras, no que tange a catalogacao empregada nessa tipologia especifica de... more
The digital revolution that has somehow taken place with the World-Wide Web took advantage of the availability and interoperability of tools for visualisation and manipulation of text-based data, as well as the satisfying pertinence of... more
Download research papers for free!