NFDI4DS | UHH-SEMS - Publication Details

Pei‐Hung Lin

ORCID: 0000-0003-4977-814X

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5035121838

Research Areas

Parallel Computing and Optimization Techniques
Advanced Data Storage Technologies
Distributed and Parallel Computing Systems
Software Engineering Research
Scientific Computing and Data Management
Topic Modeling
Software System Performance and Reliability
Research Data Management Practices
Stellar, planetary, and galactic studies
Software Testing and Debugging Techniques
Lattice Boltzmann Simulation Studies
Machine Learning and Data Classification
Distributed systems and fault tolerance
Cloud Computing and Resource Management
Formal Methods in Verification
Astro and Planetary Science
Advanced Neural Network Applications
Embedded Systems Design Techniques
Semantic Web and Ontologies
Algorithms and Data Compression
Logic, programming, and type systems
Solar and Space Plasma Dynamics
Natural Language Processing Techniques
Meteorological Phenomena and Simulations
Electrochemical sensors and biosensors

Lawrence Livermore National Laboratory
2015-2024

Iowa State University
2023

University of California, Merced
2023

Argonne National Laboratory
2023

National Taipei University of Technology
2018-2019

University of Minnesota
2008-2014

National Taipei University of Nursing and Health Science
2009-2010

The late-time dynamics of the single-mode Rayleigh-Taylor instability

OPENALEX - Publications

Praveen Ramaprabhu Guy Dimonte Paul R. Woodward Chris L. Fryer Gabriel Rockefeller and 3 more

We report on numerical simulations of the detailed evolution single mode Rayleigh-Taylor [Lord Rayleigh, Scientific Papers II (Cambridge University Press, Cambridge, 1900), p. 200; G. I. Taylor, “The instability liquid surfaces when accelerated in a direction perpendicular to their plane,” Proc. R. Soc. London, Ser. A 201, 192 (1950)10.1098/rspa.1950.0052; S. Chandrasekhar, Hydrodynamic and Hydromagnetic Stability (Oxford Oxford, 1961)] late times high aspect ratios. In contrast established...

10.1063/1.4733396 article EN Physics of Fluids 2012-07-01

HYDRODYNAMIC SIMULATIONS OF H ENTRAINMENT AT THE TOP OF He-SHELL FLASH CONVECTION

OPENALEX - Publications

Paul R. Woodward Falk Herwig Pei‐Hung Lin

We present the first three-dimensional, fully compressible gas-dynamics simulations in 4π geometry of He-shell flash convection with proton-rich fuel entrainment at upper boundary. This work is motivated by insufficiently understood observed consequences H-ingestion post-asymptotic giant branch (post-AGB) stars (Sakurai's object) and metal-poor AGB stars. Our investigation focused on process top boundary subsequent advection H-rich material into deeper layers, we therefore ignore burning...

10.1088/0004-637x/798/1/49 article EN The Astrophysical Journal 2014-12-19

GLOBAL NON-SPHERICAL OSCILLATIONS IN THREE-DIMENSIONAL 4π SIMULATIONS OF THE H-INGESTION FLASH

OPENALEX - Publications

Falk Herwig Paul R. Woodward Pei‐Hung Lin Mike Knox Chris L. Fryer

We performed three-dimensional simulations of proton-rich material entrainment into 12C-rich He-shell flash convection and the subsequent H-ingestion that took place in post-asymptotic giant branch star Sakurai's object. Observations transient nature anomalous abundance features are available to validate our method assumptions, with aim applying them very low-metallicity stars future. include nuclear energy feedback from H burning cover full 4π geometry shell. Runs on 7683 15363 grids agree...

10.1088/2041-8205/792/1/l3 article EN The Astrophysical Journal Letters 2014-08-11

HPC-GPT: Integrating Large Language Model for High-Performance Computing

OPENALEX - Publications

Xianzhong Ding Le Chen Murali Emani Chunhua Liao Pei‐Hung Lin and 4 more

Large Language Models (LLMs), including the LLaMA model, have exhibited their efficacy across various general-domain natural language processing (NLP) tasks. However, performance in high-performance computing (HPC) domain tasks has been less than optimal due to specialized expertise required interpret model responses. In response this challenge, we propose HPC-GPT, a novel LLaMA-based that supervised fine-tuning using generated QA (Question-Answer) instances for HPC domain. To evaluate its...

10.1145/3624062.3624172 preprint EN cc-by 2023-11-10

Effects of functional electrical stimulation on dysphagia caused by radiation therapy in patients with nasopharyngeal carcinoma

OPENALEX - Publications

Pei‐Hung Lin Tzu‐Yu Hsiao Yeun‐Chung Chang Lai‐Lei Ting Wen‐Shiang Chen and 2 more

10.1007/s00520-009-0792-2 article EN Supportive Care in Cancer 2009-11-28

DataRaceBench

OPENALEX - Publications

Chunhua Liao Pei‐Hung Lin Joshua Asplund Markus Schordan Ian Karlin

Data races in multi-threaded parallel applications are notoriously damaging while extremely difficult to detect. Many tools have been developed help programmers find data races. However, there is no dedicated OpenMP benchmark suite systematically evaluate race detection for their strengths and limitations.

10.1145/3126908.3126958 article EN 2017-11-08

A novel, efficient electrochemical sensor for the detection of isoniazid based on the B/N doped mesoporous carbon modified electrode

OPENALEX - Publications

Paramasivam Balasubramanian T.S.T. Balamurugan Shen‐Ming Chen Tse‐Wei Chen Pei‐Hung Lin

10.1016/j.snb.2018.12.020 article EN Sensors and Actuators B Chemical 2018-12-11

Data Race Detection Using Large Language Models

OPENALEX - Publications

Le Chen Xianzhong Ding Murali Emani Tristan Vanderbruggen Pei‐Hung Lin and 1 more

Large language models (LLMs) are demonstrating significant promise as an alternate strategy to facilitate analyses and optimizations of high-performance computing programs, circumventing the need for resource-intensive manual tool creation. In this paper, we explore a novel LLM-based data race detection approach combining prompting engineering fine-tuning techniques. We create dedicated dataset named DRB-ML, which is derived from DataRaceBench, with fine-grain labels showing presence pairs...

10.1145/3624062.3624088 article EN cc-by 2023-11-10

Revisiting loop fusion in the polyhedral framework

OPENALEX - Publications

Sanyam Mehta Pei‐Hung Lin Pen-Chung Yew

Loop fusion is an important compiler optimization for improving memory hierarchy performance through enabling data reuse. Traditional compilers have approached loop in a manner decoupled from other high-level optimizations, missing several interesting solutions. Recently, the polyhedral framework with its ability to compose complex transformations, has proved be promising performing optimizations small programs. However, our experiments large programs using state-of-the-art frameworks reveal...

10.1145/2555243.2555250 article EN 2014-02-06

Facile, low-temperature synthesis of tungsten carbide (WC) flakes for the sensitive and selective electrocatalytic detection of dopamine in biological samples

OPENALEX - Publications

Muthaiah Annalakshmi Paramasivam Balasubramanian Shen‐Ming Chen Tse‐Wei Chen Pei‐Hung Lin

Transition metal carbides have shown potential for use in electrochemical applications due to their excellent electronic conductivity, stability and electrocatalysis.

10.1039/c9qi00447e article EN Inorganic Chemistry Frontiers 2019-01-01

Reduction in Hyoid Bone Forward Movement in Irradiated Nasopharyngeal Carcinoma Patients With Dysphagia

OPENALEX - Publications

Tyng‐Guey Wang Yeun‐Chung Chang Wen‐Shiang Chen Pei‐Hung Lin Tzu‐Yu Hsiao

10.1016/j.apmr.2010.02.011 article EN Archives of Physical Medicine and Rehabilitation 2010-06-01

HPC Ontology: Towards a Unified Ontology for Managing Training Datasets and AI Models for High-Performance Computing

OPENALEX - Publications

Chunhua Liao Pei‐Hung Lin Gaurav Verma Tristan Vanderbruggen Murali Emani and 2 more

Machine learning (ML) techniques have been widely studied to address various challenges of productively and efficiently running large-scale scientific applications on heterogeneous supercomputers. However, it is extremely difficult generate, access, maintain training datasets AI models accelerate ML-based research. The Future Research Communications e-Scholarship has proposed the FAIR data principles describing Findability, Accessibility, Interoperability, Reusability. In this paper, we...

10.1109/mlhpc54614.2021.00012 article EN 2021-11-01

Using Polyhedral Analysis to Verify OpenMP Applications are Data Race Free

OPENALEX - Publications

Fangke Ye Markus Schordan Chunhua Liao Pei‐Hung Lin Ian Karlin and 1 more

Among the most common and hardest to debug types of bugs in concurrent systems are data races. In this paper, we present an approach for verifying that OpenMP program is race free. We use polyhedral analysis verify those parts where detect parallel affine loop nests. show applicability with analysis-enabling transformations detection HPC applications. evaluate our dedicated benchmark suite DataRaceBench LLNL Proxy Application AMG2013 which consists 75,000 LOC. Our evaluation shows can...

10.1109/correctness.2018.00010 article EN 2018-11-01

Supporting multiple accelerators in high-level programming models

OPENALEX - Publications

Yonghong Yan Pei‐Hung Lin Chunhua Liao Bronis R. de Supinski Daniel J. Quinlan

Computational accelerators, such as manycore NVIDIA GPUs, Intel Xeon Phi and FPGAs, are becoming common in work-stations, servers supercomputers for scientific engineering applications. Efficiently exploiting the massive parallelism these accelerators provide requires designs implementations of productive programming models.

10.1145/2712386.2712405 article EN 2015-01-28

LM4HPC: Towards Effective Language Model Application in High-Performance Computing

OPENALEX - Publications

Le Chen Pei‐Hung Lin Tristan Vanderbruggen Chunhua Liao Murali Emani and 1 more

In recent years, language models (LMs), such as GPT-4, have been widely used in multiple domains, including natural processing, visualization, and so on. However, applying them for analyzing optimizing high-performance computing (HPC) software is still challenging due to the lack of HPC-specific support. this paper, we design LM4HPC framework facilitate research development HPC analyses optimizations using LMs. Tailored supporting datasets, AI models, pipelines, our built on top a range...

10.48550/arxiv.2306.14979 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Moving Scientific Codes to Multicore Microprocessor CPUs

OPENALEX - Publications

Paul R. Woodward Jagan Jayaraj Pei‐Hung Lin Pen-Chung Yew

The IBM Cell processor represents the first and most extreme of a new generation multicore CPUs. For scientific codes that can be formulated in terms vector computing concepts, as far we know, is rewarding. In this article, present method for implementing numerical algorithms so they run efficiently on other We our using piecewise-parabolic (PPM) gas dynamics algorithm but believe many could benefit from approach. Nevertheless, code transformations are difficult to perform manually,...

10.1109/mcse.2008.152 article EN Computing in Science & Engineering 2008-10-22

HPCFAIR: Enabling FAIR AI for HPC Applications

OPENALEX - Publications

Gaurav Verma Murali Emani Chunhua Liao Pei‐Hung Lin Tristan Vanderbruggen and 2 more

Artificial Intelligence (AI) is being adopted in different domains at an unprecedented scale. A significant interest the scientific community also involves leveraging machine learning (ML) to effectively run high performance computing applications Given multiple efforts this arena, there are often duplicated when existing rich data sets and ML models could be leveraged instead. The primary challenge a lack of ecosystem reuse reproduce datasets. In work, we propose HPCFAIR, modular,...

10.1109/mlhpc54614.2021.00011 article EN 2021-11-01

Creating a Dataset for High-Performance Computing Code Translation using LLMs: A Bridge Between OpenMP Fortran and C++

OPENALEX - Publications

Bin Lei Caiwen Ding Le Chen Pei‐Hung Lin Chunhua Liao

In this study, we present a novel dataset for training machine learning models translating between OpenMP Fortran and C++ code. To ensure reliability applicability, the is created from range of representative open-source benchmarks. It also refined using meticulous code similarity test. The effectiveness our assessed both quantitative (CodeBLEU) qualitative (human evaluation) methods. We showcase how significantly elevates translation competencies large language (LLMs). Specifically, without...

10.1109/hpec58863.2023.10363534 article EN 2023-09-25

Machine Learning Guided Optimal Use of GPU Unified Memory

OPENALEX - Publications

Hailu Xu Murali Emani Pei‐Hung Lin Liting Hu Chunhua Liao

NVIDIA's unified memory (UM) creates a pool of managed mem- ory on top physically separated CPU and GPU memories. UM automatically migrates page-level data on-demand so program- mers can quickly write CUDA codes heterogeneous machines without tedious error-prone manual management. To improve performance, NVIDIA allows advanced programmers to pass additional use hints its driver. However, it is extremely difficult for decide when how effi- ciently memory, given the complex interactions...

10.1109/mchpc49590.2019.00016 article EN 2019-11-01

First experience of compressible gas dynamics simulation on the Los Alamos roadrunner machine

OPENALEX - Publications

Paul R. Woodward Jagan Jayaraj Pei‐Hung Lin William W. Dai

Abstract We report initial experience with gas dynamics simulation on the Los Alamos Roadrunner machine. In this work, we have restricted our attention to flows in which flow Mach number is less than 2. This permits us use a simplified version of PPM algorithm that has been described detail by Woodward (2006). follow multifluid volume fraction using PPB moment‐conserving advection scheme, enforcing both pressure and temperature equilibrium between two monatomic ideal gases within each grid...

10.1002/cpe.1494 article EN Concurrency and Computation Practice and Experience 2009-10-16

Porting a 3D seismic modeling code (SW4) to CORAL machines

OPENALEX - Publications

Ramesh Pankajakshan Pei‐Hung Lin Björn Sjögreen

Seismic waves fourth order (SW4) solves the seismic wave equations on Cartesian and curvilinear grids using large compute clusters with O (100,000) cores. This article discusses porting of SW4 to run CORAL architecture RAJA performance portability abstraction layer. The performances key kernels CUDA are compared estimate penalty Code changes required for efficiency GPUs minimizing time spent in Message Passing Interface (MPI) discussed. describes a path efficiently code bases GPU-based...

10.1147/jrd.2019.2960218 article EN IBM Journal of Research and Development 2019-12-17

Coming Soon ...