NFDI4DS | UHH-SEMS - Publication Details

OpenMP to GPGPU

OPENALEX - Publications

Seyong Lee Seung-Jai Min Rudolf Eigenmann

GPGPUs have recently emerged as powerful vehicles for general-purpose high-performance computing. Although a new Compute Unified Device Architecture (CUDA) programming model from NVIDIA offers improved programmability general computing, is still complex and error-prone. This paper presents compiler framework automatic source-to-source translation of standard OpenMP applications into CUDA-based GPGPU applications. The goal this to further improve make existing amenable execution on GPGPUs. In...

10.1145/1504176.1504194 article EN 2009-02-14

OpenMPC: Extended OpenMP Programming and Tuning for GPUs

OPENALEX - Publications

Seyong Lee Rudolf Eigenmann

General-Purpose Graphics Processing Units (GPGPUs) are promising parallel platforms for high performance computing. The CUDA (Compute Unified Device Architecture) programming model provides improved programmability general computing on GPGPUs. However, its unique execution and memory still pose significant challenges developers of efficient GPGPU code. This paper proposes a new interface, called OpenMPC, which builds OpenMP to provide an abstraction the complex offers high-level controls...

10.1109/sc.2010.36 article EN 2010-11-01

Cetus: A Source-to-Source Compiler Infrastructure for Multicores

OPENALEX - Publications

Chirag Dave Hansang Bae Seung-Jai Min Seyong Lee Rudolf Eigenmann and 1 more

The Cetus tool provides an infrastructure for research on multicore compiler optimizations that emphasizes automatic parallelization. infrastructure, which targets C programs, supports source-to-source transformations, is user-oriented and easy to handle, the most important parallelization passes as well underlying enabling techniques.

10.1109/mc.2009.385 article EN Computer 2009-12-01

Novel dielectric BN/epoxy nanocomposites with enhanced heat dissipation performance for electronic packaging

OPENALEX - Publications

Dongju Lee Seyong Lee Segi Byun Kyung‐Wook Paik Sung Ho Song

10.1016/j.compositesa.2018.01.009 article EN Composites Part A Applied Science and Manufacturing 2018-01-09

OpenMP to GPGPU

OPENALEX - Publications

Seyong Lee Seung-Jai Min Rudolf Eigenmann

GPGPUs have recently emerged as powerful vehicles for general-purpose high-performance computing. Although a new Compute Unified Device Architecture (CUDA) programming model from NVIDIA offers improved programmability general computing, is still complex and error-prone. This paper presents compiler framework automatic source-to-source translation of standard OpenMP applications into CUDA-based GPGPU applications. The goal this to further improve make existing amenable execution on GPGPUs. In...

10.1145/1594835.1504194 article EN ACM SIGPLAN Notices 2009-02-14

COMPASS

OPENALEX - Publications

Seyong Lee Jeremy S. Meredith Jeffrey S. Vetter

Flexible, accurate performance predictions offer numerous benefits such as gaining insight into and optimizing applications architectures. However, the development evaluation of has been a major research challenge, due to architectural complexities. To address this we have designed implemented prototype system, named COMPASS, for automated model generation prediction. COMPASS generates structured from target application's source code using static analysis, then, it evaluates various...

10.1145/2751205.2751220 article EN 2015-06-02

OpenARC

OPENALEX - Publications

Seyong Lee Jeffrey S. Vetter

This paper presents Open Accelerator Research Compiler (OpenARC): an open-source framework that supports the full feature set of OpenACC V1.0 and performs source-to-source transformations, targeting heterogeneous devices, such as NVIDIA GPUs. Combined with its high-level, extensible Intermediate Representation (IR) rich semantic annotations, OpenARC serves a powerful research vehicle for prototyping optimization, instrumentation debugging, performance analysis, autotuning. In fact, is...

10.1145/2600212.2600704 article EN 2014-06-20

MapReduce with communication overlap (MaRCO)

OPENALEX - Publications

Faraz S. Ahmad Seyong Lee Mithuna Thottethodi T. N. Vijaykumar

10.1016/j.jpdc.2012.12.012 article EN Journal of Parallel and Distributed Computing 2012-12-31

Early evaluation of directive-based GPU programming models for productive exascale computing

OPENALEX - Publications

Seyong Lee Jeffrey S. Vetter

Graphics Processing Unit (GPU)-based parallel computer architectures have shown increased popularity as a building block for high performance computing, and possibly future Exascale computing. However, their programming complexity remains major hurdle widespread adoption. To provide better abstractions GPU architectures, researchers vendors proposed several directive-based models. These models different levels of abstraction, required effort to port optimize applications. Understanding these...

10.5555/2388996.2389028 article EN 2012-11-10

OpenACC to FPGA: A Framework for Directive-Based High-Performance Reconfigurable Computing

OPENALEX - Publications

Seyong Lee Jungwon Kim Jeffrey S. Vetter

This paper presents a directive-based, high-level programming framework for high-performance reconfigurable computing. It takes standard, portable OpenACC C program as input and generates hardware configuration file execution on FPGAs. We implemented this prototype system using our open-source OpenARC compiler, it performs source-to-source translation optimization of the into an OpenCL code, which is further compiled FPGA by backend Altera Offline compiler. Internally, design uses...

10.1109/ipdps.2016.28 article EN 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2016-05-01

The tradeoffs of fused memory hierarchies in heterogeneous computing architectures

OPENALEX - Publications

Kyle Spafford Jeremy S. Meredith Seyong Lee Dong Li Philip C. Roth and 1 more

With the rise of general purpose computing on graphics processing units (GPGPU), influence from consumer markets can now be seen across spectrum computer architectures. In fact, many high-ranking Top500 HPC systems include these accelerators. Traditionally, GPUs have connected to CPU via PCIe bus, which has proved a significant bottleneck for scalable scientific applications. Now, trend toward tighter integration between and GPU removed this unified memory hierarchy both cores. We examine...

10.1145/2212908.2212924 article EN 2012-05-15

DRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access

OPENALEX - Publications

Pak Markthub Mehmet E. Belviranlı Seyong Lee Jeffrey S. Vetter Satoshi Matsuoka

Heterogeneous computing with accelerators is growing in importance high performance (HPC). Recently, application datasets have expanded beyond the memory capacity of these accelerators, and often their hosts. Meanwhile, nonvolatile (NVM) storage has emerged as a pervasive component HPC systems because NVM provides massive amounts at affordable cost. Currently, for accelerator applications to use NVM, they must manually orchestrate data movement across multiple memories this approach only...

10.1109/sc.2018.00035 article EN 2018-11-01

IRIS: A Portable Runtime System Exploiting Multiple Heterogeneous Programming Systems

OPENALEX - Publications

Jungwon Kim Seyong Lee Beau Johnston Jeffrey S. Vetter

Across embedded, mobile, enterprise, and high performance computing systems, computer architectures are becoming more heterogeneous complex. This complexity is causing a crisis in programming systems portability. Several working to address these challenges, but the increasing architectural diversity forcing software stacks applications be specialized for each architecture. As we show, all of approaches critically depend on their runtime system discovery, execution, scheduling, data...

10.1109/hpec49654.2021.9622873 article EN 2021-09-20

Prediction of Resource Availability in Fine-Grained Cycle Sharing Systems Empirical Evaluation

OPENALEX - Publications

Xiaojuan Ren Seyong Lee Rudolf Eigenmann Saurabh Bagchi

10.1007/s10723-007-9077-5 article EN Journal of Grid Computing 2007-03-14

Early evaluation of directive-based GPU programming models for productive exascale computing

OPENALEX - Publications

Seyong Lee Jeffrey S. Vetter

Graphics Processing Unit (GPU)-based parallel computer architectures have shown increased popularity as a building block for high performance computing, and possibly future Exascale computing. However, their programming complexity remains major hurdle widespread adoption. To provide better abstractions GPU architectures, researchers vendors proposed several directive-based models. These models different levels of abstraction, required effort to port optimize applications. Understanding these...

10.1109/sc.2012.51 article EN International Conference for High Performance Computing, Networking, Storage and Analysis 2012-11-01

PapyrusKV

OPENALEX - Publications

Jungwon Kim Seyong Lee Jeffrey S. Vetter

This paper introduces PapyrusKV, a parallel embedded key-value store (KVS) for distributed high-performance computing (HPC) architectures that offer potentially massive pools of nonvolatile memory (NVM). PapyrusKV stores keys with their values in arbitrary byte arrays across multiple NVMs system. provides standard KVS operations such as put, get, and delete. More importantly, advanced features HPC dynamic consistency control, zero-copy workflow, asynchronous checkpoint/restart. Beyond...

10.1145/3126908.3126943 article EN 2017-11-08

The enhancement and inhibition of mercury reduction by natural organic matter in the presence of Shewanella oneidensis MR-1

OPENALEX - Publications

Seyong Lee Dong-Hun Kim Kyoung‐Woong Kim

10.1016/j.chemosphere.2017.12.007 article EN Chemosphere 2017-12-05

CLACC: Translating OpenACC to OpenMP in Clang

OPENALEX - Publications

Joel Denny Seyong Lee Jeffrey S. Vetter

OpenACC was launched in 2010 as a portable programming model for heterogeneous accelerators. Although various implementations already exist, no extensible, open-source, production-quality compiler support is available to the community. This deficiency poses serious risk HPC application developers targeting GPUs and other accelerators, it limits experimentation progress specification. To address this deficiency, Clacc recent effort funded by US Exascale Computing Project develop production...

10.1109/llvm-hpc.2018.8639349 article EN 2018-11-01

NVL-C

OPENALEX - Publications

Joel Denny Seyong Lee Jeffrey S. Vetter

Computer architecture experts expect that non-volatile memory (NVM) hierarchies will play a more significant role in future systems including mobile, enterprise, and HPC architectures. With this expectation mind, we present NVL-C: novel programming system facilitates the efficient correct of NVM main systems. The NVL-C abstraction extends C with small set intuitive language features target memory, can be combined directly traditional model for DRAM. We have designed these new to enable...

10.1145/2907294.2907303 article EN 2016-05-31

GPU Data Access on Complex Geometries for D3Q19 Lattice Boltzmann Method

OPENALEX - Publications

Gregory Herschlag Seyong Lee Jeffrey S. Vetter Amanda Randles

GPU performance of the lattice Boltzmann method (LBM) depends heavily on memory access patterns. When LBM is advanced with GPUs complex computational domains, geometric data typically accessed indirectly, and lexicographically in Structure Array (SoA) layout. Although there are a variety existing patterns beyond typical choices, no study has yet examined relative efficacy between them. Here, we compare suite schemes via empirical testing modeling. We find strong evidence that semi-direct...

10.1109/ipdps.2018.00092 article EN 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2018-05-01

Resource Availability Prediction in Fine-Grained Cycle Sharing Systems

OPENALEX - Publications

Xiaojuan Ren Seyong Lee Rudolf Eigenmann Saurabh Bagchi

Fine-grained cycle sharing (FGCS) systems aim at utilizing the large amount of computational resources available on Internet. In FGCS, host computers allow guest jobs to utilize CPU cycles if do not significantly impact local users a host. A characteristic such is that they are generally provided voluntarily and their availability fluctuates highly. Guest may fail because unexpected resource unavailability. To provide fault tolerance without adding significant overhead, it requires predict...

10.1109/hpdc.2006.1652140 article EN 2006-07-21

Adaptive runtime tuning of parallel sparse matrix-vector multiplication on distributed memory systems

OPENALEX - Publications

Seyong Lee Rudolf Eigenmann

Sparse matrix-vector (SpMV) multiplication is a widely used kernel in scientific applications. In these applications, the SpMV usually deeply nested within multiple loops and thus executed large number of times. We have observed that there can be significant performance variability, due to irregular memory access patterns. Static optimizations are difficult because patterns may known only at runtime. this paper, we propose adaptive runtime tuning mechanisms improve parallel on distributed...

10.1145/1375527.1375558 article EN 2008-06-07