Academia.eduAcademia.edu

Grid Scheduling

description839 papers
group10 followers
lightbulbAbout this topic
Grid scheduling refers to the process of allocating resources and managing tasks across a distributed computing environment, known as a grid. It involves optimizing the execution of jobs by considering factors such as resource availability, job priorities, and execution times to enhance overall system performance and efficiency.
lightbulbAbout this topic
Grid scheduling refers to the process of allocating resources and managing tasks across a distributed computing environment, known as a grid. It involves optimizing the execution of jobs by considering factors such as resource availability, job priorities, and execution times to enhance overall system performance and efficiency.

Key research themes

1. How can online and hierarchical scheduling models effectively allocate jobs across multiple machines in Grid environments to minimize makespan?

This research area focuses on the design, analysis, and implementation of online scheduling algorithms tailored for Grids composed of multiple machines (clusters) with identical processors within each machine. It investigates non-preemptive scheduling of parallel jobs submitted over time, emphasizing allocation strategies that consider the structural constraints of Grids — notably that jobs run exclusively on processors within a single machine. The goal is to minimize makespan while managing the dynamic arrival of jobs and heterogeneity across machines. Hierarchical models separate scheduling into job-to-machine allocation followed by local scheduling, addressing practical limitations such as overheads in job migration and decentralized control. These models matter because efficient scheduling directly impacts Grid resource utilization, user satisfaction, and system throughput in highly distributed computational environments.

Key finding: Introduces a model extending classic multiprocessor scheduling by structuring processors into multiple machines, where jobs must execute wholly within one machine. Demonstrates that Garey and Graham’s list scheduling performs... Read more
Key finding: Proposes a two-stage online scheduling model where jobs are first assigned to machines and then locally scheduled. Introduces an adaptive admissible allocation algorithm and provides competitive analysis showing improved load... Read more
Key finding: Applies particle swarm optimization (PSO) meta-heuristics to centralized scheduling of independent jobs on heterogeneous Grid resources, noting that PSO tends to stagnate in local minima for high-dimensional problems. The... Read more
Key finding: Develops a novel hybrid meta-heuristic combining PSO and GA for centralized scheduling of independent jobs on a static Grid model, addressing PSO’s tendency to stagnate locally in large problem instances. Experimental... Read more

2. What is the comparative performance and theoretical basis of immediate (online) versus batch scheduling heuristics in Grid resource allocation?

This theme concerns the evaluation and analysis of heuristic scheduling approaches that decide how and when tasks/jobs are assigned to computational resources in Grid environments. Immediate (online) scheduling algorithms allocate tasks as soon as they arrive, which suits dynamic, real-time scenarios but may yield suboptimal load balancing. Batch scheduling collects a set of tasks and schedules them jointly at specific intervals, usually resulting in better makespan and resource utilization but incurring scheduling delay. Understanding the applicability, strengths, weaknesses, and trade-offs among heuristics such as Minimum Execution Time (MET), Minimum Completion Time (MCT), Min-Min, Max-Min, and their variants informs the design of practical Grid schedulers that balance efficiency, responsiveness, and load balance.

Key finding: Provides a detailed review and comparative analysis of immediate (online) scheduling heuristics like MET and MCT, alongside batch mode heuristics like Min-Min and Max-Min, highlighting their algorithmic principles and... Read more
Key finding: Proposes Max-Average, an enhanced Max-Min scheduling algorithm that improves load balancing and makespan minimization by combining characteristics of Max-Min and Min-Min heuristics. The algorithm operates in two phases:... Read more
Key finding: Develops and empirically evaluates hybrid scheduling algorithms combining deadline and slack time parameters in soft real-time Grid environments. By testing against real workload traces, the study finds that integrating slack... Read more

3. How do groupings and structural characteristics of job submissions affect scheduling performance and resource utilization in Grid systems?

This research domain examines the patterns in which jobs are submitted to Grid systems—such as batch submissions, continued sequences, or bursty job arrivals—and their implications on scheduling efficiency, resource consumption, and user experience. Characterizing and understanding such grouped job behaviors are essential to optimize scheduling policies, load balancing, and performance prediction. The focus lies on revealing real workload properties and how such groupings affect takeover scheduling mechanisms, slowdown metrics, and effective CPU utilization, contributing practical insights for improving Grid workload management.

Key finding: Analyzes three types of job groupings—batch, continued, and bursty submissions—in traces from production and experimental Grids, confirming that grouped jobs account for up to 96% of total CPU consumption. The study reveals... Read more

All papers in Grid Scheduling

Grid scheduling is a vital component of a Grid infrastructure. Reliability, efficiency (in terms of time consumption), effectiveness in resource utilization, and robustness are the desired characteristics of Grid scheduling systems. Many... more
The aim of this paper is to provide a description of deep-learning-based scheduling approach for academicpurpose high-performance computing systems. Academicpurpose distributed computing systems' (DCS) share reaches 17.4% amongst TOP500... more
Several Grid projects have been established that deploy a "first generation Grid". In order to categorise existing projects in Europe, we have developed a taxonomy and applied it to 20 European Grid projects funded by the European... more
Grid computing has become one of the most important research topics that appeared in the field of computing in the last years. Simultaneously, we have noticed the growing popularity of new Web-based technologies which allow us to create... more
Grid simulation tools provide frameworks for simulating application scheduling in various Grid infrastructures. However, while experimenting with many existing tools, we have encountered two main shortcomings: (i) lack of tools for... more
Grid simulation tools provide frameworks for simulating application scheduling in various Grid infrastructures. However, while experimenting with many existing tools, we have encountered two main shortcomings: (i) there are no tools for... more
In the paper we present two different models of Grid resource management problems: (i) Grid scheduling problems with no time characteristics available, and (ii) scheduling of jobs in presence of time characteristics achieved by using some... more
The problem of scheduling independent users' jobs to resources in Grid Computing systems is of paramount importance. This problem is known to be NP-hard, and many techniques have been proposed to solve it, such as heuristics, genetic... more
Due to the complex nature of Grid systems, the design of efficient Grid schedulers is challenging since such schedulers have to be able to optimize many conflicting criteria in very short periods of time. This problem has been tackled in... more
This paper describes the program execution framework being developed by the Grid Application Development Software (GrADS) Project. The goal of this framework is to provide good resource allocation for Grid applications and to support... more
The goal of the Grid Application Development Software (GrADS) Project is to provide programming tools and an execution environment to ease program development for the Grid. This paper presents recent extensions to the GrADS software... more
In this article we address the problem of scheduling on realistic high performance computing facilities using incomplete information about tasks execution times. We introduce a variation of our previous Penalty Scheduling Policy,... more
The Computational Grid (CG) provides a wide distributed platform for high end computing intensive applications. Scheduling on Computational grid is known to be NP-Hard problem and requires an efficient solution. Recently, quantum inspired... more
In this work we present the bio-backfill scheduler, a backfill scheduler for bioinformatics workflows applications running on shared, heterogeneous clusters. Backfill techniques advance low-priority jobs in cluster queues, if doing so... more
The applicability of min cost flow and multicommodity flow mathematical programming problems to steady state, multi-source divisible load scheduling is examined. Applying the linear model concept of superposition to such steady state... more
To date closed form solutions for optimal finish time and job allocation are largely obtained only for network topologies with a single load originating (root) processor. However in large-scale data intensive problems with geographically... more
In this paper, based on a thorough analysis of different policies for DAG scheduling, an improved algorithm ICPDP (Improved Critical Path using Descendant Prediction) is introduced. The algorithm performs well with respect to the total... more
Because of its sheer size, Computational Grids (CGs) require advanced methodologies and strategies to efficiently schedule users tasks and applications to resources. Scheduling becomes even more challenging when energyefficiency,... more
In traditional distributed computing systems a few user types are found having rather "flat" profiles, mainly due to the same administrative domain the users belong to. This is quite different in Computational Grids (CGs) in which several... more
Scheduling application in grid is a complex task that often fails due to nonavailability of resources and the required execution environment in the resources. The CARE Resource Broker (CRB) proposed in this paper addresses such scheduling... more
Grids are facing the challenge of moving from batch systems to interactive computing. In the 70s, standalone computer systems have met this challenge, and this was the starting point of pervasive computing. Meeting this challenge will... more
In the highly unpredictable environment of grid computing, it often makes sense to replicate the same job on several of the available servers. We prove that never replicating is optimal for a heterogeneous multi-server system in a random... more
Scheduling problems have been thoroughly explored by the research community, but they acquire challenging characteristics in grid computing systems. In this context, it is important to have a scheduling strategy that can make efficient... more
Workflow scheduling in Grids becomes an important area as it allows users to process large scale problems in an atomic way. However, validating the performance of workflow scheduling strategies in real production environment cannot be... more
Hierarchical Grids 1 2 3 Your article is protected by copyright and all rights are held exclusively by Springer Science+Business Media B.V.. This e-offprint is for personal use only and shall not be selfarchived in electronic... more
We evaluate job scheduling algorithms that integrate both tasks of Grid scheduling: job allocation to Grid sites and local scheduling at the sites. We propose and analyze an adaptive job allocation scheme named admissible allocation. The... more
The design of scheduling algorithms for a heterogeneous computing system interconnected with an arbitrary communication network is one of the main concerns in distributed systems research, due to the heterogeneous nature of resource,... more
An automatic Vehicle-to-Grid (V2G) technology can contribute to the utility grid. V2G technology has drawn great interest in the recent years. Success of the sophisticated automatic V2G research depends on efficient scheduling of gridable... more
A Grid Resource Broker for a Grid domain, or also known as meta-scheduler, is a middleware component used for matching works to available Grid resources from one or more IT organizations. A Grid meta-scheduler usually has its own... more
This paper proposes a two-level scheduler for dynamically scheduling a continuous stream of sequential and multi-threaded batch jobs on large-scale grids, made up of interconnected clusters of heterogeneous single-processor and/or... more
Bag-of-Tasks applications (those parallel applications whose tasks are independent) are both relevant and amendable for execution on computational grids. In fact, one can argue that Bag-of-Tasks applications are the applications most... more
Trust and reputation models play an important role in enabling trusted computations over large-scale distributed Grids. Many models have been recently proposed and implemented within trust management systems. Nevertheless, the existing... more
Grid computing environment is used for large computation problem. In Today’s world various complex tasks has been done in different scientific area so there is need of a lot of computational power to solve different types of complex... more
Grid computing infrastructures embody a cost-effective computing paradigm that virtualises heterogenous system resources to meet the dynamic needs of critical business and scientific applications. These applications range from batch... more
Grid computing infrastructures are inherently dynamic and unpredictable environments shared by many users. Grid schedulers aim to make efficient use of Grid resources while providing the best possible performance to the Grid applications... more
Grid computing infrastructures embody a cost-effective computing paradigm that virtualises heterogeneous system resources to meet the dynamic needs of critical business and scientific applications. These applications range from batch... more
This work presents a novel parallel micro evolutionary algorithm for scheduling tasks in distributed heterogeneous computing and grid environments. The scheduling problem in heterogeneous environments is NP-hard, so a significant effort... more
This work presents the application of a micro evolutionary algorithm to solve a biobjective scheduling problem in heterogeneous grid computing environments. A new formulation of the bi-objective scheduling problem is introduced, by... more
Workflow scheduling in Grids becomes an important area as it allows users to process large scale problems in an atomic way. However, validating the performance of workflow scheduling strategies in real production environment cannot be... more
Although Grids have been used extensively for executing applications with compute-intensive jobs, there exist several applications with a large number of lightweight jobs. The overall processing undertaking of these applications involves... more
Scheduling problems have been thoroughly explored by the research community, but they acquire challenging characteristics in grid computing systems. In this context, it is important to have a scheduling strategy that can make efficient... more
Grids organize resource sharing, a fundamental requirement of large scientific collaborations. Seamless integration of grids into everyday use requires responsiveness, which can be provided by elastic Clouds, in the Infrastructure as a... more
In High Performance Computing centers, queuing systems are used by the users to access and manage the HPC resources through a set of interfaces. After job submission, users lose control of the job and they only have a very restricted set... more
Grid Resource Management tools evolved from manual discovery and task submission to sophisticated brokering solutions. User requirements created certain properties that resource managers have learned to support. This development is still... more
The increasing demand for resources of the high performance computing systems has led to new forms of collaboration of distributed systems such as interoperable grid systems that contain and manage their own resources. While with a single... more
In High Performance Computing centers, queuing systems are used by the users to access and manage the HPC resources through a set of interfaces. After job submission, users lose control of the job and they only have a very restricted set... more
Dealing with a large amount of data in Data Grids makes the requirement for efficient data access more critical. In this paper, we proposed a new approach to replication problem by organizing the data into several data categories that it... more
Download research papers for free!