Robert D. Blumofe

Abstract Adaptive and Reliable Parallel Computing on Networks of Workstations (2009)

Robert D. Blumofe

In this paper, we present the design of Cilk-NOW, a runtime system that adaptively and reliably executes functional Cilk programs in parallel on a network of UNIX workstations. Cilk (pronounced...

Abstract Cilk: An Efficient Multithreaded Runtime System (2008)

Robert D. Blumofe, Christopher F. Joerg, Bradley C. Kuszmaul, Charles E. Leiserson, Keith H. Randall, Yuli Zhou

Cilk (pronounced “silk”) is a C-based runtime system for multithreaded parallel programming. In this paper, we document the efficiency of the Cilk work-stealing scheduler, both empirically and...

Abstract Thread Scheduling for Multiprogrammed Multiprocessors (2008)

Nimar S. Arora, Robert D. Blumofe, C. Greg Plaxton

We present a user-level thread scheduler for shared-memory multiprocessors, and we analyze its performance under multiprogramming. We model multiprogramming with two scheduling levels: our scheduler...

Robert D. Blumofe (2007)

J. Eric Baldeschwieler, Robert D. Blumofe, Eric Brewer

In this paper, we present a proposed system architecture for global computing that we call Atlas, and we describe an early prototype that implements several of the mechanisms and policies that...

Certified by......................................................... (2007)

Charles E. Leiserson, R. Morgenthaler, Robert D. Blumofe

This thesis presents the theory, design, and implementation of Cilk (pronounced "silk") and Cilk-NOW. Cilk is a C-based language and portable runtime system for programming and...

Thread Scheduling for Multiprogrammed Multiprocessors (2001)

Nimar S. Arora, Robert D. Blumofe, C. Greg Plaxton

We present a user-level thread scheduler for shared-memory multiprocessors, and we analyze its performance under multiprogramming. We model multiprogramming with two scheduling levels: our scheduler...

Hoard: A Scalable Memory Allocator for Multithreaded Applications (2000)

Emery D. Berger, Kathryn S. McKinley, Robert D. Blumofe, Paul R. Wilson

Parallel, multithreaded C and C++ programs such as web servers, database managers, news servers, and scientific applications are becoming increasingly prevalent. For these applications, the memory...

Hoard: A Scalable Memory Allocator for Multithreaded Applications (2000)

Emery Berger Kathryn, Kathryn S. Mckinley, Robert D. Blumofe, Paul R. Wilson

Parallel, multithreaded C and C++ programs such as web servers, database managers, news servers, and scientific applications are becoming increasingly prevalent. For these applications, the memory...

The Data Locality of Work Stealing (2000)

Umut A. Acar, Guy E. Blelloch, Robert D. Blumofe

This paper studies the data locality of the work-stealing scheduling algorithm on hardware-controlled shared-memory machines. We present lower and upper bounds on the number of cache misses using...

Hoard: A Scalable Memory Allocator for Multithreaded Applications (2000)

Emery Berger Kathryn, Kathryn S. Mckinley, Robert D. Blumofe, Paul R. Wilson

Parallel, multithreaded programs such as web servers, database managers, news servers, and scientific applications are becoming increasingly prevalent. For these C and C++ applications, the memory...

The data locality of work stealing (2000)

Umut A. Acar, Guy E. Blelloch, Robert D. Blumofe

This paper studies the data locality of the work-stealing scheduling algorithm on hardware-controlled shared-memory machines, where movement of data to and from the cache is solely controlled by the...

Verification of a concurrent deque implementation (1999)

Robert D. Blumofe, C. Greg, Plaxton Sandip Ray

We prove the correctness of the concurrent deque component of a recent implementation of the work-stealing algorithm. Specifically, we prove that this concurrent deque implementation is...

Verification of a Concurrent Deque Implementation (1999)

Robert D. Blumofe, C. Greg Plaxton, Sandip Ray, C. Greg, Plaxton Sandip Ray

We prove the correctness of the concurrent deque component of a recent implementation of the work-stealing algorithm. Specifically, we prove that this concurrent deque implementation is...

Verification of a concurrent deque implementation (1999)

Robert D. Blumofe, C. Greg, Plaxton Sandip Ray

We prove the correctness of the concurrent deque component of a recent implementation of the work-stealing algorithm. Specifically, we prove that this concurrent deque implementation is...

Thread scheduling for multiprogrammed multiprocessors (1998)

Nimar S. Arora, Robert D. Blumofe, C. Greg Plaxton

We present a user-level thread scheduler for shared-memory multiprocessors, and we analyze its performance under multiprogramming. We model multiprogramming with two scheduling levels: our scheduler...

Thread scheduling for multiprogrammed multiprocessors (1998)

Nimar S. Arora, Robert D. Blumofe, C. Greg Plaxton

We present a user-level thread scheduler for shared-memory multiprocessors, and we analyze its performance under multiprogramming. We model multiprogramming with two scheduling levels: our scheduler...

The performance of work stealing in multiprogrammed environments (extended abstract (1998)

Robert D. Blumofe, Dionisios Papadopoulos

As small-scale, shared-memory multiprocessors make their way onto desktops, the high-performance parallel applications that run on these machines will have to live alongside other applications, such...

Space-Efficient Scheduling of Multithreaded Computations (1998)

Robert D. Blumofe, Charles E. Leiserson

.<F3.854e+05> This paper considers the problem of scheduling dynamic parallel computations to achieve linear speedup without using significantly more space per processor than that required for...

The Performance of Work Stealing in Multiprogrammed Environments (1998)

Robert D. Blumofe, Dionisios Papadopoulos

We study the performance of user-level thread schedulers in multiprogrammed environments. Our goal is a user-level thread scheduler that delivers efficient performance under multiprogramming without...

Hood: A User-Level Threads Library for Multiprogrammed Multiprocessors (1998)

Robert D. Blumofe, Dionisios Papadopoulos

The Hood user-level threads library delivers efficient performance under multiprogramming without any need for kernel-level resource management, such as coscheduling or process control. It does so by...

The Performance of Work Stealing in Multiprogrammed Environments (1998)

Extend Ed, Robert D. Blumofe, Dionisios Papadopoulos

) Robert D. Blumofe Dionisios Papadopoulos Department of Computer Sciences, The University of Texas at Austin frdb,dionisisg@cs.utexas.edu As small-scale, shared-memory multiprocessors make their way...

Hood: A User-Level Thread Library for Multiprogramming Multiprocessors (1998)

Robert D. Blumofe, Calvin Lin, Dionysios P. Papadopoulos, Dionysios P. Papadopoulos, Dionysios P. Papadopoulos

xi List of Tables xv List of Figures xvii Chapter 1 Introduction 1 1.1 The problem with static partitioning . . . . . . . . . . . . . . . . . . 3 1.2 Contributions of this thesis . . . . . . . . . ....

Thread Scheduling for Multiprogrammed Multiprocessors (1998)

Nimar S. Arora, Robert D. Blumofe, C. Greg Plaxton

We present a user-level thread scheduler for shared-memory multiprocessors, and we analyze its performance under multiprogramming. We model multiprogramming with two scheduling levels: our scheduler...

The performance of work stealing in multiprogrammed environments (extended abstract (1998)

Robert D. Blumofe, Dionisios Papadopoulos

We study the performance of user-level thread schedulers in multiprogrammed environments. Our goal is a user-level thread scheduler that delivers efficient performance under multiprogramming without...

Adaptive and reliable parallel computing on networks of workstations (1997)

Robert D. Blumofe, Robert D. Blumofe, Philip A. Lisiecki, Philip A. Lisiecki

In this paper, we present the design of Cilk-NOW, a runtime system that adaptively and reliably executes functional Cilk programs in parallel on a network of UNIX workstations. Cilk (pronounced...

The Performance of Work Stealing in Multiprogrammed Environments (1997)

Robert Blumofe Dionisios, Robert D. Blumofe, Dionisios Papadopoulos

We study the performance of user-level thread schedulers in multiprogrammed environments. Our goal is a user-level thread scheduler that delivers efficient performance under multiprogramming without...

An analysis of dag-consistent distributed shared-memory algorithms (1996)

Robert D. Blumofe, Matteo Frigo, Christopher F. Joerg

In this paper, we analyze the performance of parallel multi-threaded algorithms that use dag-consistent distributed shared memory. Specifically, we analyze execution time, page faults, and space...

Dag-Consistent Distributed Shared Memory (1996)

Robert D. Blumofe, Matteo Frigo, Christopher F. Joerg, Charles E. Leiserson, Keith H. Randall

We introduce dag consistency, a relaxed consistency model for distributed shared memory which is suitable for multithreaded programming. We have implemented dag consistency in software for the Cilk...

ATLAS: An Infrastructure for Global Computing (1996)

J. Eric Baldeschwieler, Robert D. Blumofe, Eric A. Brewer

In this paper, we present a proposed system architecture for global computing that we call Atlas, and we describe an early prototype that implements several of the mechanisms and policies that...

An Analysis of Dag-Consistent Distributed Shared-Memory Algorithms (1996)

Robert D. Blumofe, Matteo Frigo, Christopher F. Joerg, Charles E. Leiserson, Keith H. Randall

In this paper, we analyze the performance of parallel multithreaded algorithms that use dag-consistent distributed shared memory. Specifically, we analyze execution time, page faults, and space...

Abstract (1996)

Robert D. Blumofe, Christopher F. Joerg, Bradley C. Kuszmaul, Charles E. Leiserson, Keith H. Randall, Yuli Zhou

Cilk (pronounced “silk”) is a C-based runtime system for multithreaded parallel programming. In this paper, we document the efficiency of the Cilk work-stealing scheduler, both empirically and...

Abstract (1996)

Robert D. Blumofe, Christopher F. Joerg, Bradley C. Kuszmaul, Charles E. Leiserson, Keith H. Randall, Yuli Zhou

Cilk (pronounced “silk”) is a C-based runtime system for multithreaded parallel programming. In this paper, we document the efficiency of the Cilk work-stealing scheduler, both empirically and...

Dag-consistent distributed shared memory (1996)

Robert D. Blumofe, Matteo Frigo, Christopher F. Joerg, Charles E. Leiserson, Keith H. Randall

We introduce dag consistency, a relaxed consistency model for distributed shared memory which is suitable for multithreaded programming. We have implemented dag consistency in software for the Cilk...

Executing Multithreaded Programs Efficiently (1995)

Robert D. Blumofe, Charles E. Leiserson, Robert D. Blumofe

right to do so. by::::::::::::::::::::::::::::::::::::::::::::::::::::::::

Cilk: An efficient multithreaded runtime system (1995)

Robert D. Blumofe, Christopher F. Joerg, Bradley C. Kuszmaul, Charles E. Leiserson, Keith H. Randall, Yuli Zhou

Cilk (pronounced “silk”) is a C-based runtime system for multithreaded parallel programming. In this paper, we document the efficiency of the Cilk work-stealing scheduler, both empirically and...

Cilk: An efficient multithreaded runtime system (1995)

Robert D. Blumofe, Christopher F. Joerg, Bradley C. Kuszmaul, Charles E. Leiserson, Keith H. Randall, Yuli Zhou

Cilk (pronounced &quot;silk&quot;) is a C-based runtime system for multithreaded parallel programming. In this paper, we document the efficiency of the Cilk work-stealing scheduler, both...

Cilk: An Efficient Multithreaded Runtime System (1995)

Robert D. Blumofe, Christopher F. Joerg, Bradley C. Kuszmaul, Charles E. Leiserson, Keith H. Randall, Yuli Zhou

Cilk (pronounced "silk") is a C-based runtime system for multithreaded parallel programming. In this paper, we document the efficiency of the Cilk work-stealing scheduler, both empirically...

Cilk: An Efficient Multithreaded Runtime System (1995)

Robert D. Blumofe, Christopher F. Joerg, Bradley C. Kuszmaul, Charles E. Leiserson, Keith H. Randall, Yuli Zhou

Cilk (pronounced "silk") is a C-based runtime system for multithreaded parallel programming. In this paper, we document the efficiency of the Cilk work-stealing scheduler, both empirically...

Cilk: An efficient multithreaded runtime system (1995)

Robert D. Blumofe, Christopher F. Joerg, Bradley C. Kuszmaul, Charles E. Leiserson, Keith H. Randall, Yuli Zhou

Cilk (pronounced “silk”) is a C-based runtime system for multithreaded parallel programming. In this paper, we document the efficiency of the Cilk work-stealing scheduler, both empirically and...

Scheduling Multithreaded Computations by Work Stealing (1994)

Robert D. Blumofe, Charles E. Leiserson

This paper studies the problem of efficiently scheduling fully strict (i.e., well-structured) multithreaded computations on parallel computers. A popular and practical method of scheduling this kind...

Spectral Methods for Bisecting Graphs (1993)

Area Exam, Robert D. Blumofe

this paper have found widespread use in this application [24, 30]. Given the utility of graph bisection, it is disappointing (though not surprising) that this problem is difficult from a complexity...

Space-Efficient Scheduling of Multithreaded Computations (1993)

Robert D. Blumofe, Charles E. Leiserson

. This paper considers the problem of scheduling dynamic parallel computations to achieve linear speedup without using significantly more space per processor than that required for a single-processor...

Space-Efficient Scheduling of Multithreaded Computations (Extended Abstract) (1993)

Robert D. Blumofe, Charles E. Leiserson

This paper considers the problem of scheduling dynamic parallel computations to achieve linear speedup without using significantly more space per processor than that required for a single-processor...

Managing Storage for Multithreaded Computations (1992)

Robert Blumofe, Robert D. Blumofe

Multithreading has become a dominant paradigm in general purpose MIMD parallel computation. To execute a multithreaded computation on a parallel computer, a scheduler must order and allocate threads...