Academia.eduAcademia.edu

Grid Scheduling

description839 papers
group10 followers
lightbulbAbout this topic
Grid scheduling refers to the process of allocating resources and managing tasks across a distributed computing environment, known as a grid. It involves optimizing the execution of jobs by considering factors such as resource availability, job priorities, and execution times to enhance overall system performance and efficiency.
lightbulbAbout this topic
Grid scheduling refers to the process of allocating resources and managing tasks across a distributed computing environment, known as a grid. It involves optimizing the execution of jobs by considering factors such as resource availability, job priorities, and execution times to enhance overall system performance and efficiency.

Key research themes

1. How can online and hierarchical scheduling models effectively allocate jobs across multiple machines in Grid environments to minimize makespan?

This research area focuses on the design, analysis, and implementation of online scheduling algorithms tailored for Grids composed of multiple machines (clusters) with identical processors within each machine. It investigates non-preemptive scheduling of parallel jobs submitted over time, emphasizing allocation strategies that consider the structural constraints of Grids — notably that jobs run exclusively on processors within a single machine. The goal is to minimize makespan while managing the dynamic arrival of jobs and heterogeneity across machines. Hierarchical models separate scheduling into job-to-machine allocation followed by local scheduling, addressing practical limitations such as overheads in job migration and decentralized control. These models matter because efficient scheduling directly impacts Grid resource utilization, user satisfaction, and system throughput in highly distributed computational environments.

Key finding: Introduces a model extending classic multiprocessor scheduling by structuring processors into multiple machines, where jobs must execute wholly within one machine. Demonstrates that Garey and Graham’s list scheduling performs... Read more
Key finding: Proposes a two-stage online scheduling model where jobs are first assigned to machines and then locally scheduled. Introduces an adaptive admissible allocation algorithm and provides competitive analysis showing improved load... Read more
Key finding: Applies particle swarm optimization (PSO) meta-heuristics to centralized scheduling of independent jobs on heterogeneous Grid resources, noting that PSO tends to stagnate in local minima for high-dimensional problems. The... Read more
Key finding: Develops a novel hybrid meta-heuristic combining PSO and GA for centralized scheduling of independent jobs on a static Grid model, addressing PSO’s tendency to stagnate locally in large problem instances. Experimental... Read more

2. What is the comparative performance and theoretical basis of immediate (online) versus batch scheduling heuristics in Grid resource allocation?

This theme concerns the evaluation and analysis of heuristic scheduling approaches that decide how and when tasks/jobs are assigned to computational resources in Grid environments. Immediate (online) scheduling algorithms allocate tasks as soon as they arrive, which suits dynamic, real-time scenarios but may yield suboptimal load balancing. Batch scheduling collects a set of tasks and schedules them jointly at specific intervals, usually resulting in better makespan and resource utilization but incurring scheduling delay. Understanding the applicability, strengths, weaknesses, and trade-offs among heuristics such as Minimum Execution Time (MET), Minimum Completion Time (MCT), Min-Min, Max-Min, and their variants informs the design of practical Grid schedulers that balance efficiency, responsiveness, and load balance.

Key finding: Provides a detailed review and comparative analysis of immediate (online) scheduling heuristics like MET and MCT, alongside batch mode heuristics like Min-Min and Max-Min, highlighting their algorithmic principles and... Read more
Key finding: Proposes Max-Average, an enhanced Max-Min scheduling algorithm that improves load balancing and makespan minimization by combining characteristics of Max-Min and Min-Min heuristics. The algorithm operates in two phases:... Read more
Key finding: Develops and empirically evaluates hybrid scheduling algorithms combining deadline and slack time parameters in soft real-time Grid environments. By testing against real workload traces, the study finds that integrating slack... Read more

3. How do groupings and structural characteristics of job submissions affect scheduling performance and resource utilization in Grid systems?

This research domain examines the patterns in which jobs are submitted to Grid systems—such as batch submissions, continued sequences, or bursty job arrivals—and their implications on scheduling efficiency, resource consumption, and user experience. Characterizing and understanding such grouped job behaviors are essential to optimize scheduling policies, load balancing, and performance prediction. The focus lies on revealing real workload properties and how such groupings affect takeover scheduling mechanisms, slowdown metrics, and effective CPU utilization, contributing practical insights for improving Grid workload management.

Key finding: Analyzes three types of job groupings—batch, continued, and bursty submissions—in traces from production and experimental Grids, confirming that grouped jobs account for up to 96% of total CPU consumption. The study reveals... Read more

All papers in Grid Scheduling

The goal of the Grid Application Development Software (GrADS) Project is to provide programming tools and an execution environment to ease program development for the Grid. This paper presents recent extensions to the GrADS software... more
Even though middleware support for grid computing has been the subject of extensive research, scheduling policies for the grid context have not been much studied. In addition to processor utilization, it is important to consider the... more
In data-intensive applications data transfer is a primary cause of job execution delay. Data access time depends on bandwidth. The major bottleneck to supporting fast data access in Grids is the high latencies of Wide Area Networks and... more
Large distributed grid systems pose new challenges in job scheduling due to complex workload characteristics and system characteristics. Due to the numerous parameters that must be considered and the complex interactions that can occur... more
Desktop grids have already been used to perform some of the largest computations in the world and have the potential to grow by several more orders of magnitude. However current approaches to utilizing desktop resources require either... more
An automatic Vehicle-to-Grid (V2G) technology can contribute to the utility grid. V2G technology has drawn great interest in the recent years. Success of the sophisticated automatic V2G research depends on efficient scheduling of gridable... more
In this paper, we propose a new algorithm for fair scheduling, and we compare it to other scheduling schemes such as the Earliest Deadline First (EDF) and the First Come First Served (FCFS) schemes. Our algorithm uses a max-min fair... more
Over the past decade, the grid has emerged as an attractive platform to tackle various large-scale problems, especially in science and engineering. One primary issue associated with the efficient and effective utilization of heterogeneous... more
Control) has been developed by the CERN LHCb physics experiment to facilitate large scale simulation and user analysis tasks spread across both grid and non-grid computing resources. It consists of a small set of distributed stateless... more
Because of its sheer size, Computational Grids (CGs) require advanced methodologies and strategies to efficiently schedule users tasks and applications to resources. Scheduling becomes even more challenging when energyefficiency,... more
This work concentrates on the design of a system intended for study of advanced scheduling techniques for planning various types of jobs in a Grid environment. The solution is able to deal with common problems of the job scheduling in... more
In the applied basic model, the Grid system consists of a large number of identical processors that are divided into several machines. Jobs are independent, they have a fixed degree of parallelism, and they are submitted over time.... more
Optical networking technologies are expected to play an important role in creating an efficient infrastructure for supporting advanced grid applications. Since both the scheduling methods in grid computing and optical networks are limited... more
Grid computing infrastructures embody a cost-effective computing paradigm that virtualises heterogeneous system resources to meet the dynamic needs of critical business and scientific applications. These applications range from batch... more
Nowadays, grid computing is increasingly showing a service-oriented tendency and as a result, providing quality of service (QoS) has raised as a relevant issue in such highly dynamic and non-dedicated systems. In this sense, the role of... more
We address non-preemptive non-clairvoyant online scheduling of parallel jobs on a Grid. We consider a Grid scheduling model with two stages. At the first stage, jobs are allocated to a suitable Grid site, while at the second stage, local... more
Grid computing has become one of the most important research topics that appeared in the field of computing in the last years. Simultaneously, we have noticed the growing popularity of new Web-based technologies which allow us to create... more
Grid schedulers require individual activity performance predictions to map workflow activities on different Grid sites. The effectiveness of the scheduling systems is hampered by inaccurate predictions due to the inability of existing... more
In this paper, we propose an efficient non-linear task workload prediction mechanism incorporated with a fair scheduling algorithm for task allocation and resource management in Grid computing. Workload prediction is accomplished in a... more
Streaming applications such as video-based surveillance, habitat monitoring, and emergency response are good candidates for executing on high-performance computing (HPC) resources, due to their high computation and communication needs.... more
Grids consist of both dedicated and non-dedicated clusters. For effective mapping of parallel applications on grid resources, a grid metascheduler has to evaluate different sets of resources in terms of predicted execution times for the... more
Grid simulation tools provide frameworks for simulating application scheduling in various Grid infrastructures. However, while experimenting with many existing tools, we have encountered two main shortcomings: (i) lack of tools for... more
This paper proposes a novel schedule-based approach for scheduling a continuous stream of batch jobs on the machines of a computational Grid. Our new solutions represented by dispatching rule Earliest Gap-Earliest Deadline First (EG-EDF)... more
When deploying Grid infrastructure, the problem of dimensioning arises: how many servers to provide, where to place them, and which network to install for interconnecting server sites and users generating Grid jobs? In contrast to... more
The user-level brokers in grids consider individual application QoS requirements and minimize their cost without considering demands from other users. This results in contention for resources and sub-optimal schedules. Meta-scheduling in... more
We propose a biologically inspired and fullydecentralized approach to the organization of computation that is based on the autonomous scheduling of strongly mobile agents on a peer-to-peer network. Our approach achieves the following... more
The Service Level Agreement (SLA) based grid superscheduling approach promotes coordinated resource sharing. Superscheduling is facilitated between administratively and topologically distributed grid sites via grid schedulers such as... more
In multicluster systems, and more generally in grids, jobs may require co-allocation, that is, the simultaneous or coordinated access of single applications to resources of possibly multiple types in multiple locations managed by... more
Scheduling is an important factor for the efficient execution of computational workflows on Grid environments. A large number of static scheduling heuristics has been presented in the literature. These algorithms allocate tasks before job... more
The Organic Grid is a biologically inspired and fully-decentralized approach to the organization of computation that is based on the autonomous scheduling of strongly mobile agents on a peer-to-peer network. Through the careful design of... more
Grids organize resource sharing, a fundamental requirement of large scientific collaborations. Seamless integration of Grids into everyday use requires responsiveness, which can be provided by elastic Clouds, in the Infrastructure as a... more
Scheduling application in grid is a complex task that often fails due to nonavailability of resources and the required execution environment in the resources. The CARE Resource Broker (CRB) proposed in this paper addresses such scheduling... more
In this paper, we address non-preemptive online scheduling of parallel jobs on a Grid. Our Grid consists of a large number of identical processors that are divided into several machines. We consider a Grid scheduling model with two... more
Grid scheduling is essential to Quality of Service provisioning as well as to efficient management of grid resources. Grid scheduling usually considers the state of the grid resources as well application demands. However, such demands are... more
Computational grids are highly complex distributed systems (involving multiple organizations with different goals and policies) which aim at providing computing services without the users need to know the location and features of the... more
In this paper we revisit the supernode-shape selection problem, that has been widely discussed in bibliography. In general, the selection of the supernode transformation greatly affects the parallel execution time of the transformed... more
Scheduling and resource allocation in large scale distributed environments, such as Computational Grids (CGs), arise new requirements and challenges not considered in traditional distributed computing environments. Among these new... more
We evaluate job scheduling algorithms that integrate both tasks of Grid scheduling: job allocation to Grid sites and local scheduling at the sites. We propose and analyze an adaptive job allocation scheme named admissible allocation. The... more
A Grid Resource Broker for a Grid domain, or also known as metascheduler, is a middleware component used for matching works to available Grid resources from one or more IT organizations. A Grid meta-scheduler usually has its own... more
Implementation of a commercial application to a Grid infrastructure introduces new challenges in managing the quality-of-service (QoS) requirements; most stem from the fact that negotiation on QoS between the user and the service provider... more
Complex applications are describing using workflows. Execution of these workflows in Grid environments require optimized assignment of tasks on available resources according with different constrains. This paper presents a decentralized... more
Grids consist of the aggregation of numerous dispersed computational, storage and network resources, able to satisfy even the most demanding computing jobs. Due to the data-intensive nature of Grid jobs, there is an increasing interest in... more
Grid computing infrastructures are inherently dynamic and unpredictable environments shared by many users. Grid schedulers aim to make efficient use of grid resources while providing the best possible performance to the grid applications... more
The workload management task of the DataGrid project was mandated to define and implement a suitable architecture for distributed scheduling and resource management in a Grid environment. The result was the design and implementation of a... more
The United States Environmental Protection Agency forecasts the 2011 national IT electric energy expendi ture will grow toward $7.4 billion [1]. In parallel to economic IT energy concerns, the general public and environmental advocacy... more
Grid resource management is not just about scheduling jobs on the fastest machines, but rather about scheduling all compute objects and all data objects on machines whose capabilities match the requirements, while preserving site... more
The computing-intensive data mining for inherently Internet-wide distributed data, referred to as Distributed Data Mining (DDM), calls for the support of a powerful Grid with an effective scheduling framework. DDM often shares the... more
The use of a dynamic reconfigurable optical network is an important requirement for the new advanced resource-intensive, highly distributed Grid applications that begin to emerge on the e-Science field. In this paper, we propose an... more
Download research papers for free!