Key research themes
1. How can machine learning and NLP improve the quality and semantic consistency of textual annotations in multilingual multimedia archives?
This research area focuses on applying advanced machine learning (ML), natural language processing (NLP), and deep learning techniques to automatically enhance the quality, harmonization, and semantic coherence of textual annotations (e.g., keywords and tags) linked to multimedia content, especially in multilingual and heterogeneous digital libraries. Improving annotation quality aids effective search, navigation, and visualization of large multimedia repositories and addresses challenges such as language identification, spelling correction, semantic similarity, and term specialization.
2. What are effective approaches to multimedia annotation that enhance collaborative reasoning and decision-making in distributed virtual and educational environments?
This research theme explores designing and evaluating multimedia annotation systems that enrich collaborative decision-making and reflective learning processes in virtual environments (VEs) and educational settings. It emphasizes multimodal annotations (audio, text, sketches, video-synchronized camera movements) combined with structured argumentation trees and shared tag vocabularies to capture provenance, facilitate asynchronous discussions, and promote critical thinking among practitioners and students. These approaches support geographically distributed teams or learners engaging with complex multimedia artifacts and professional contexts.
3. How can user-centered methodologies and tools facilitate effective manual or semi-automatic multimedia annotation integrating semantic web technologies and user expertise?
Given that fully automated semantic annotation remains inadequate for complex multimedia, this theme examines user-centered frameworks, methodologies, and tools that assist annotators—including non-expert users—in manually or semi-automatically creating ontology-based, multimedia annotations. It considers approaches that lower barriers to ontology navigation and extension, synchronize structured annotations with multimedia playback, and enable rich interaction with multimedia fragments. These methods are designed to produce precise, interoperable annotations while bridging the semantic gap through collaborative user involvement.