Academia.eduAcademia.edu

Distributed Operating Systems

description27 papers
group14 followers
lightbulbAbout this topic
Distributed Operating Systems are systems that manage a collection of independent computers and make them appear to users as a single coherent system. They coordinate resources, provide communication, and ensure reliability and security across multiple nodes, enabling efficient execution of applications in a distributed computing environment.
lightbulbAbout this topic
Distributed Operating Systems are systems that manage a collection of independent computers and make them appear to users as a single coherent system. They coordinate resources, provide communication, and ensure reliability and security across multiple nodes, enabling efficient execution of applications in a distributed computing environment.

Key research themes

1. How do distributed operating systems ensure scalability and flexibility in resource management across heterogeneous wide-area networks?

This theme explores the architectural designs and middleware solutions that enable distributed operating systems to handle resource management and service integration efficiently over diverse and geographically dispersed computing resources. It focuses on mechanisms for seamless resource discovery, naming, replication, and communication that provide scalability to millions of devices, including dynamically joining and leaving nodes, while abstracting complexity from users and applications.

Key finding: The Globe system introduces a uniform distributed object-based infrastructure that integrates communication, naming, replication, migration, fault tolerance, and security into a scalable middleware framework, allowing each... Read more
Key finding: HARD proposes a hierarchical and self-organizing overlay for resource discovery in highly heterogeneous and dynamic computing environments (Jungle Computing Systems). By organizing computing resources into multi-layered... Read more
Key finding: PODOS enhances a monolithic Linux kernel with four additional distributed components—a Communication Manager, a Resource Manager, a Global Inter-Process Communication system, and a Distributed File System—focused on... Read more
Key finding: This work designs a distributed system that mimics the mainframe’s centralized resource sharing via dynamic and transparent allocation of computing power from a pooled collection of UNIX workstations. It presents an approach... Read more

2. What kernel architectures and design principles improve reliability, adaptability, and performance in distributed operating systems?

This theme investigates kernel structures, componentization, and adaptation mechanisms that underpin distributed OS functionalities, including fault tolerance, dynamic system reconfiguration, and performance optimization. It highlights approaches balancing modularity with performance, supporting rapid changes in execution environment and software evolution, and integrating distributed resource management with kernel-level mechanisms.

Key finding: 2K introduces a component-based, architecturally-aware operating system designed to adapt dynamically to frequent changes in execution environment (e.g., bandwidth, memory) and slower software/hardware evolution (e.g.,... Read more
Key finding: Amoeba emphasizes transparency and scalability by providing a single-system illusion across distributed resources with a microkernel approach that supports multithreading and lightweight communication. The experiences... Read more
Key finding: This paper classifies distributed systems based on coupling granularity and discusses reliability issues in distributed OS design. It emphasizes the implications of implicit versus explicit parallelism in system transparency... Read more

3. How can resource sharing abstractions and mutual exclusion protocols be optimized to improve concurrent process coordination in distributed operating systems?

Focusing on synchronization mechanisms and resource abstractions, this theme addresses algorithms and methodologies to minimize contention delays, improve concurrency, and accommodate the needs of distributed systems for mutual exclusion and resource access. It reviews classic and novel proposals in mutual exclusion, critical section management, and mappings of logical abstractions onto physical resources to enhance performance and scalability in distributed OS environments.

Key finding: The paper identifies 'mapping dilemmas'—implementation decisions that inherently bias system performance—as central to OS design. It posits that traditional OSs’ fixed policies inadequately serve specialized applications... Read more
Key finding: Proposes a novel mutual exclusion algorithm that removes delay time by creating multiple instances of a data structure, assigning them dynamically, and simultaneously managing creation and deletion of instances. This allows... Read more

All papers in Distributed Operating Systems

The Amoeba distributed operating system has been in development and use for over eight years now, which is a long enough period to warrant taking a look back at its successes and failures. We will first describe the current version of the... more
This paper proposes a new idea which makes the processes not to wait for a long time to enter into a critical section to access a data structure which is already assigned to the other process and that process enters into critical section.... more
In recent years, Jungle Computing has emerged as a distributed computing paradigm based on simultaneous combination of various hierarchical and distributed computing environments which are composed by large number of heterogeneous... more
This document and trademark(s) contained herein are protected by law. This representation of RAND intellectual property is provided for noncommercial use only. Unauthorized posting of this publication online is prohibited. Permission is... more
Nowadays, due to the large amount of information and the need of rapid processing, the need for computers with up-to-date and efficient operating systems with high processing power is felt. During the last decades, progress in the field... more
This paper constitutes a work-in-progress report on the first, mostly conceptual phase of a major international effort in building and evaluating a distributed testbed for database application systems in safety-critical real-time... more
Internet of Things applications often require reducing the communication delay and the traffic between sensors and actuators. In addition, research and development of dataflow platforms is ongoing. In these platforms, to meet the... more
by r la
This document and trademark(s) contained herein are protected by law. This representation of RAND intellectual property is provided for noncommercial use only. Unauthorized posting of this publication online is prohibited. Permission is... more
Volunteer cloud is a new concept of cloud computing where volunteers, i.e., user machines are used instead of or in addition to dedicated resources. Although volunteer clouds offer potential benefits in terms of energy and cost saving as... more
Ubiquitous Computing (UC) is an emerging technology of sharing the information by using large collection of computing devices. Sharing of information is in terms of availability of information at every time, every where and in invisible... more
The classic network file systems we examined, NFS, CIFS, AFS, Coda, were designed as client-server applications. Clients talk to a service that, in this case, happens to be a file service. The actual file service is offered by a single... more
In recent years, Jungle Computing has emerged as a distributed computing paradigm based on simultaneous combination of various hierarchical and distributed computing environments which are composed by large number of heterogeneous... more
The next generation of many-core enabled large-scale computing systems relies on thousands of billions of heterogeneous processing cores connected to form a single computing unit. In such large-scale computing environments, resource... more
In recent years, Jungle Computing has emerged as a distributed computing paradigm based on simultaneous combination of various hierarchical and distributed computing environments which are composed by large number of heterogeneous... more
The next generation of many-core enabled large-scale computing systems relies on thousands of billions of heterogeneous processing cores connected to form a single computing unit. In such large-scale computing environments, resource... more
One of the most important potential benefits of workstation clusters (client/server computing systems) is resource sharing. By interconnecting a number of workstations using a suitable network, a large number of hardware and software... more
One of the most important potential benefits of workstation clusters (client/server computing systems) is resource sharing. By interconnecting a number of workstations using a suitable network, a large number of hardware and software... more
This paper presents research t o wards the formal speci cation and veri cation of a secure distributed system and secure application programs that run on it. We refer to the whole system | from hardware to application programs written in... more
PODOS is a performance oriented distributed operating system being developed to harness the performance capabilities of a cluster-computing environment. In order to address the growing demand for performance, we are designing a... more
Modern, distributed computing systems need to cope continuously with changes. We identify two kinds of changes: low frequency infrastructural changes, such as software upgrade; and frequent c hanges in the execution environment, such a s... more
Distributed Computing System(DCS) architectures have taken various forms through a considerably short development stage. In a DCS the underlying hardware characteristics are to be transparent to the application level processing. This... more
Distributed systems span a wide spectrum in the design space. In this paper we will look at the various kinds and discuss some of the reliability issues involved. In the first half of the paper we will concentrate on the causes of... more
As distributed computing becomes more widespread, both in high-energy physics and in other applications, centralized operating systems will gradually give way to distributed ones. In this paper we discuss some current research on five... more
Assets Control ("OFAC") administers and enforces economic and trade sanctions based on United States ("U.S. ") foreign policy and national security goals. These sanctions are imposed upon targeted foreign countries and regimes,... more
This article considers the terrorism financing risk associated with the growth of Financial Technology (FinTech) innovations and in particular, focuses on virtual currency (VC) products and services. The ease with which cross-border... more
A perceived risk in terrorist use and adoption of virtual currencies over the past few years has driven the widespread notion that this financial innovation represents a threat to security. This dissertation addresses this issue,... more
In recent years, Jungle Computing has emerged as a distributed computing paradigm based on simultaneous combination of various hierarchical and distributed computing environments which are composed by large number of heterogeneous... more
Government officials and stakeholders in the financial industry trying to counter transnational terrorist financing face a wicked problem. This problem is ill-defined, with multiple causes and uncertainty on whether the solutions... more
In this paper, we present a review of the issues that affect the software requirements for a local area network. We introduce protocols for the local area networks and characterize their software needs. Two approaches to operating systems... more
With the recent data deluge, search applications are confronted with the complexity of data they handle in terms of volume, velocity and variety. Traditional frameworks such as Lucene [1], index text for efficient searching but do not... more
In May 2015, a new globally recognizable and reliable medium for savings and trade was launched. This medium referred to as BitGold is a consumer-focused internet platform offering free global payments combined with unprecedented access... more
Distributed computer systems have been the subject of a vast amount of research. Many prototype distributed computer systems have been built at university, industrial, commercial, and government research laboratories, and production... more
Download research papers for free!