NFDI4DS | UHH-SEMS - Publication Details

The Stanford Dash multiprocessor

OPENALEX - Publications

Daniel Lenoski James Laudon Kourosh Gharachorloo W.-D. Weber Aman Gupta and 3 more

The overall goals and major features of the directory architecture for shared memory (Dash) are presented. fundamental premise behind is that it possible to build a scalable high-performance machine with single address space coherent caches. Dash in achieves linear or near-linear performance growth as number processors increases from few thousand. This results distributing among processing nodes using network bandwidth connect nodes. allows data be cached, significantly reducing latency...

10.1109/2.121510 article EN Computer 1992-03-01

Exploring the benefits of multiple hardware contexts in a multiprocessor architecture: preliminary results

OPENALEX - Publications

W.-D. Weber Aman Gupta

A fundamental problem that any scalable multiprocessor must address is the ability to tolerate high latency memory operations. This paper explores extent which multiple hardware contexts per processor can help mitigate negative effects of latency. In particular, we evaluate performance a directory-based cache coherent using reference traces obtained from three parallel applications. We explore case where there are small fixed number (2-4) and context switch overhead low. contrast previously...

10.1145/74925.74956 article EN 1989-01-01

Cache invalidation patterns in shared-memory multiprocessors

OPENALEX - Publications

Aman Gupta W.-D. Weber

The cache invalidation patterns of several parallel applications are analyzed. results based on multiprocessor simulations with 8, 16, and 32 processors. To provide deeper insight into the observed behavior invalidations in linked to high-level objects causing them programs. predict what would look like beyond processors, a classification scheme for data found programs is proposed. provides powerful conceptual tool reason about applications. Results indicate that it should be possible scale...

10.1109/12.256449 article EN IEEE Transactions on Computers 1992-07-01

Analysis of cache invalidation patterns in multiprocessors

OPENALEX - Publications

W.-D. Weber Aman Gupta

To make shared-memory multiprocessors scalable, researchers are now exploring cache coherence protocols that do not rely on broadcast, but instead send invalidation messages to individual caches contain stale data. The feasibility of such directory-based is highly sensitive the patterns parallel programs exhibit. In this paper, we analyze caused by several applications and investigate effect these a protocol. Our results based multiprocessor traces with 4, 8 16 processors. gain insight into...

10.1145/68182.68205 article EN ACM SIGARCH Computer Architecture News 1989-04-01

Analysis of cache invalidation patterns in multiprocessors

OPENALEX - Publications

W.-D. Weber Aman Gupta

To make shared-memory multiprocessors scalable, researchers are now exploring cache coherence protocols that do not rely on broadcast, but instead send invalidation messages to individual caches contain stale data. The feasibility of such directory-based is highly sensitive the patterns parallel programs exhibit. In this paper, we analyze caused by several applications and investigate effect these a protocol. Our results based multiprocessor traces with 4, 8 16 processors. gain insight into...

10.1145/70082.68205 article EN 1989-04-01

Competitive management of distributed shared memory

OPENALEX - Publications

David L. Black Aman Gupta W.-D. Weber

The authors present and analyze algorithms for managing the distributed shared memory in nonuniform-memory-access multiprocessors related systems. competitive properties of these guarantee that their performance is within a small constant factor optimal even though they make no use any information about reference patterns. Both hardware software implementation concerns are covered. A case study Mach operating system indicates integration into systems does not pose major problems. On other...

10.1109/cmpcon.1989.301925 article EN 2003-01-07

Exploring the benefits of multiple hardware contexts in a multiprocessor architecture: preliminary results

OPENALEX - Publications

W.-D. Weber Aman Gupta

A fundamental problem that any scalable multiprocessor must address is the ability to tolerate high latency memory operations. This paper explores extent which multiple hardware contexts per processor can help mitigate negative effects of latency. In particular, we evaluate performance a directory-based cache coherent using reference traces obtained from three parallel applications. We explore case where there are small fixed number (2-4) and context switch overhead low. contrast previously...

10.1145/74926.74956 article EN ACM SIGARCH Computer Architecture News 1989-04-01

The versatile hardware-in-the-loop laboratory: beyond the ad hoc fixture

OPENALEX - Publications

J.C. Stasko R.L. Crandell Munita Dunn N. Sureshbabu W.-D. Weber

The versatile hardware-in-the-loop laboratory can aid large system development by refining the models of individual components during integration. Selection hardware, software and architecture is discussed. Some general notes on facility are given.

10.1109/acc.1998.694719 article EN 1998-01-01

Powernet for distributed energy resource networks

OPENALEX - Publications

Ana Radovanović Anand Ramesh Ross Koningstein D. K. Fork W.-D. Weber and 18 more

We propose Powernet as an end-to-end open source technology for economically efficient, scalable and secure coordination of grid resources. It offers integrated hardware software solutions that are judiciously divided between local embedded sensing, computing control, which networked with cloud-based high-level real-time optimal operations not only centralized but also millions distributed resources various types. Our goal is to enable penetration 50% or higher intermittent renewables while...

10.1109/pesgm.2016.7742035 article EN 2016-07-01

Peachy Parallel Assignments (EduHPC 2023)

OPENALEX - Publications

H. Martin Bücker Jeremiah Corrado Danila Fedorin D. Garcia-Alvarez Arturo González-Escribano and 12 more

Peachy Parallel Assignments are model assignments for teaching parallel computing concepts. They competitively selected being adoptable by other instructors and "cool inspirational" students. Thus, they allow to easily add high-quality that will engage students their classes.

10.1145/3624062.3625541 article EN 2023-11-10