Key research themes
1. How can virtualization-aware file systems enhance the flexibility and management of virtual disk snapshots beyond conventional coarse-grained rollback?
This research theme focuses on overcoming the inherent limitations of conventional virtual disks, mainly their coarse-grained, all-or-nothing rollback and lack of internal structure that impede fine-grained sharing, searching, and secure management. Virtualization-aware file systems (VAFS) integrate versioning, mobility, and access control features of virtual disks with the fine-grained sharing and usability benefits of distributed file systems. This hybrid approach aims to enable more flexible virtual machine (VM) snapshotting, minimize storage overhead, and improve security and manageability of virtual disk snapshots in dynamic VM environments.
2. What strategies optimize storage efficiency and reliability in cloud-based virtual disk snapshots using deduplication and erasure coding?
This theme explores techniques to reduce storage costs and improve fault tolerance for virtual disk snapshots, especially within Infrastructure-as-a-Service (IaaS) clouds hosting HPC or large-scale applications. Deduplication eliminates redundant data chunks across snapshots or VMs, considerably reducing space and I/O bandwidth requirements. Erasure codes, particularly Reed-Solomon coding, provide fault tolerance with less storage overhead compared to replication but require optimization for bandwidth efficiency during data writes. Together, these approaches address challenges of large-scale snapshot storage, balancing performance, reliability, and cost.
3. How do advanced snapshot management techniques improve scalability and efficiency of virtual disk snapshots in distributed storage systems and clouds?
This research area investigates algorithms and system designs that manage large volumes of virtual disk snapshots by optimizing snapshot creation, storage, and retrieval in distributed environments. Focus areas include reducing snapshot size and fragmentation, increasing snapshot accessibility, and supporting operations like suspend-resume and migration. Techniques such as copy-on-write with snapshot disentanglement, versioning file systems optimized for multi-deployment and multi-snapshot operations, and hybrid remote replication methods aim to balance storage overhead, network bandwidth use, and I/O performance for scalable snapshot management.