Key research themes
1. How can fusion of textual and visual information improve multimedia retrieval performance and semantic understanding?
This research theme investigates approaches that combine textual metadata, natural language queries, and visual features such as color, texture, and high-level semantic concepts to enhance multimedia retrieval accuracy and semantic understanding. It addresses the persistent semantic gap by mapping between low-level visual features and high-level textual or conceptual descriptions, enabling more effective retrieval of relevant multimedia content. This fusion leverages complementary strengths of each modality—text for semantic richness and visual features for specificity.
2. What advancements in feature representation and dimensionality reduction can enhance content-based multimedia retrieval efficiency and effectiveness?
This research theme focuses on novel representations of image and multimedia features, including combining local and global histograms of visual words, and dimensionality reduction techniques such as principal component analysis (PCA) and kernel PCA. Efficient feature extraction and selection improve retrieval scalability and accuracy by reducing high-dimensional data while preserving salient discriminative information. The exploration includes nonlinear dimension reduction and multilinear kernel mapping to better capture complex data structures and enhance retrieval precision.
3. How can structural metadata and query modification techniques address semantic challenges in multimedia retrieval systems?
This theme explores methods leveraging document structure (e.g., XML hierarchies) and interactive query adaptation to improve the retrieval of multimedia content. Techniques include geometric metrics exploiting XML node kinship to calculate relevance of multimedia elements in structured documents, addressing the limited descriptive content of multimedia elements themselves. Additionally, user-centric query modification methods, such as segment-based query refinement and intra-query learning, allow efficient alignment of retrieval systems with subjective human perception, reducing the semantic gap without repeated extensive database searches.