Key research themes
1. How can provenance capture and dataflow management enhance transparency and runtime analysis in scientific workflows?
This research area focuses on developing systems and methods that enable efficient capture, integration, and tracking of data provenance and dataflows during scientific workflow executions. By explicitly representing data transformations and execution dependencies, these approaches aim to enhance transparency, reproducibility, and runtime monitoring capabilities, which are critical for complex, distributed, and multi-workflow scientific analyses.
2. What methodologies and languages can improve specification, reuse, and interoperability of scientific workflow designs in data-intensive applications?
This theme addresses the development of domain-specific languages (DSLs) and structured frameworks that abstract workflow design from specific platforms, improving portability, modularity, and reuse across heterogeneous execution environments. It highlights innovations in workflow modeling that separate workflow intent from execution technology, allow structured composition of control and dataflows, and support collaborative development in complex scientific domains.
3. How can scheduling strategies and resource management be optimized for scientific workflows in cloud environments considering workflow structure and priorities?
Research under this theme develops scheduling algorithms and resource utilization strategies tailored to the structural characteristics of scientific workflows to minimize execution time and cost in cloud platforms. It investigates workload partitioning based on task dependencies, priority assignment, balancing computational requirements, and leveraging virtualization. These techniques aim to optimize performance in cloud-based workflow execution while managing the tradeoff between resource expenses and throughput.