NFDI4DS | UHH-SEMS - Publication Details

Swen Boehm

ORCID: 0000-0003-2902-4906

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5012850684

Research Areas

Distributed and Parallel Computing Systems
Parallel Computing and Optimization Techniques
Advanced Data Storage Technologies
Cloud Computing and Resource Management
Scientific Computing and Data Management
Computational Drug Discovery Methods
Distributed systems and fault tolerance
Software System Performance and Reliability
Embedded Systems Design Techniques
Software-Defined Networks and 5G
Service-Oriented Architecture and Web Services
Protein Structure and Dynamics
Systems Engineering Methodologies and Applications
Genetics, Bioinformatics, and Biomedical Research
SARS-CoV-2 detection and testing
Brain Tumor Detection and Classification
Peer-to-Peer Network Technologies
SARS-CoV-2 and COVID-19 Research
Caching and Content Delivery
Semantic Web and Ontologies
Machine Learning in Materials Science
Simulation Techniques and Applications
Particle Detector Development and Performance
Data Mining Algorithms and Applications
Security and Verification in Computing

Oak Ridge National Laboratory
2012-2023

Center for Clinical Studies
2021

A survey of MPI usage in the US exascale computing project

OPENALEX - Publications

David E. Bernholdt Swen Boehm George Bosilca Manjunath Gorentla Venkata Ryan E. Grant and 4 more

Summary The Exascale Computing Project (ECP) is currently the primary effort in United States focused on developing “exascale” levels of computing capabilities, including hardware, software, and applications. In order to obtain a more thorough understanding how software projects under ECP are using, planning use Message Passing Interface (MPI), help guide work our own project within ECP, we created survey. Of 97 active at time survey was distributed, received 77 responses, 56 which reported...

10.1002/cpe.4851 article EN Concurrency and Computation Practice and Experience 2018-09-27

High-throughput virtual laboratory for drug discovery using massive datasets

OPENALEX - Publications

Jens Gläser Josh V. Vermaas David Rogers Jeff Larkin Scott LeGrand and 7 more

Time-to-solution for structure-based screening of massive chemical databases COVID-19 drug discovery has been decreased by an order magnitude, and a virtual laboratory deployed at scale on up to 27,612 GPUs the Summit supercomputer, allowing average molecular docking 19,028 compounds per second. Over one billion were docked two SARS-CoV-2 protein structures with full optimization ligand position 20 poses docking, each in under 24 hours. GPU acceleration high-throughput optimizations program...

10.1177/10943420211001565 article EN The International Journal of High Performance Computing Applications 2021-03-23

SARS-CoV2 billion-compound docking

OPENALEX - Publications

David Rogers Rupesh Agarwal Josh V. Vermaas Micholas Dean Smith Rajitha T. Rajeshwar and 6 more

Abstract This dataset contains ligand conformations and docking scores for 1.4 billion molecules docked against 6 structural targets from SARS-CoV2, representing 5 unique proteins: MPro, NSP15, PLPro, RDRP, the Spike protein. Docking was carried out using AutoDock-GPU platform on Summit supercomputer Google Cloud. The procedure employed Solis Wets search method to generate 20 independent binding poses per compound. Each compound geometry scored AutoDock free energy estimate, rescored RFScore...

10.1038/s41597-023-01984-9 article EN cc-by Scientific Data 2023-03-28

OpenMP application experiences: Porting to accelerated nodes

OPENALEX - Publications

Seonmyeong Bak Colleen Bertoni Swen Boehm Reuben D. Budiardja Barbara Chapman and 19 more

10.1016/j.parco.2021.102856 article EN publisher-specific-oa Parallel Computing 2021-10-25

UnifyFS: A User-level Shared File System for Unified Access to Distributed Local Storage

OPENALEX - Publications

Michael J. Brim Adam Moody Seung–Hwan Lim Ross Miller Swen Boehm and 3 more

We introduce UnifyFS, a user-level file system that aggregates node-local storage tiers available on high performance computing (HPC) systems and makes them to HPC applications under unified namespace. UnifyFS employs transparent I/O interception, so it does not require changes application code is compatible with commonly used libraries. The design of supports the predominant workloads optimized for bulk-synchronous patterns. Furthermore, provides customizable semantics flexibly adapt its...

10.1109/ipdps54959.2023.00037 article EN 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2023-05-01

Supercomputing Pipelines Search for Therapeutics Against COVID-19

OPENALEX - Publications

Josh V. Vermaas Ada Sedova Matthew Baker Swen Boehm David Rogers and 5 more

The urgent search for drugs to combat SARS-CoV-2 has included the use of supercomputers. general-purpose graphical processing units (GPUs), massive parallelism, and new software high-performance computing (HPC) allowed researchers vast chemical space potential faster than ever before. We developed a drug discovery pipeline using Summit supercomputer at Oak Ridge National Laboratory help pioneer this effort, with platforms that incorporate GPU-accelerated simulation allow virtual screening...

10.1109/mcse.2020.3036540 article EN publisher-specific-oa Computing in Science & Engineering 2020-11-06

Supercomputer-Based Ensemble Docking Drug Discovery Pipeline with Application to Covid-19

OPENALEX - Publications

Atanu Acharya Rupesh Agarwal Matthew Baker Jérôme Baudry Debsindhu Bhowmik and 44 more

We present a supercomputer-driven pipeline for

10.26434/chemrxiv.12725465 preprint EN cc-by-nc-nd 2020-07-29

SPEChpc 2021 Benchmark Suites for Modern HPC Systems

OPENALEX - Publications

Junjie Li Alexander Bobyr Swen Boehm William C. Brantley Holger Brunst and 29 more

The SPEChpc 2021 suites are application-based benchmarks de- signed to measure performance of modern HPC systems. bench- marks support MPI, MPI+OpenMP, MPI+OpenMP target offload, MPI+OpenACC and portable across all major platforms.

10.1145/3491204.3527498 article EN 2022-07-14

A Big Data Analytics Framework for HPC Log Data: Three Case Studies Using the Titan Supercomputer Log

OPENALEX - Publications

Byung H. Park Yawei Hui Swen Boehm Rizwan A. Ashraf Christopher Layton and 1 more

Reliability, availability and serviceability (RAS) logs of high performance computing (HPC) resources, when closely investigated in spatial temporal dimensions, can provide invaluable information regarding system status, performance, resource utilization. These data are often generated from multiple logging systems sensors that cover many components the system. The analysis these for finding persistent insights faces two main difficulties: volume RAS makes manual inspection difficult...

10.1109/cluster.2018.00073 article EN 2018-09-01

Supercomputer-Based Ensemble Docking Drug Discovery Pipeline with Application to Covid-19

OPENALEX - Publications

Atanu Acharya Rupesh Agarwal Matthew Baker Jérôme Baudry Debsindhu Bhowmik and 44 more

We present a supercomputer-driven pipeline for in-silico drug discovery using enhanced sampling molecular dynamics (MD) and ensemble docking. also describe preliminary results obtained 23 systems involving eight protein targets of the proteome SARS CoV-2. THe MD performed is temperature replica-exchange sampling, making use massively parallel supercomputing on SUMMIT supercomputer at Oak Ridge National Laboratory, with which more than 1ms can be generated per day. have docked repurposing...

10.26434/chemrxiv.12725465.v1 preprint EN cc-by-nc-nd 2020-07-29

Virtual Framework for Development and Testing of Federation Software Stack

OPENALEX - Publications

Anees Al-Najjar Nageswara S. V. Rao Neena Imam Thomas J. Naughton Seth Hitefield and 7 more

Softwarization of networked infrastructures combined with containerization codes promises unprecedented computing capabilities distributed across the federations systems and physical instruments. The development testing a software stack that implements these over an expensive production infrastructure is not cost-effective, in early stages, may potentially cause service disruptions. To address aspects, we develop Virtual Federated Science Instrument Environment (VFSIE), digital twin emulates...

10.1109/lcn52139.2021.9524993 article EN 2021-09-07

Are we witnessing the spectre of an HPC meltdown?

OPENALEX - Publications

Verónica Melesse Vergara Michael J. Brim Wayne Joubert Swen Boehm Matthew Baker and 4 more

Summary We measure and analyze the performance observed when running applications benchmarks before after Meltdown Spectre fixes have been applied to Cray supercomputers supporting systems at Oak Ridge Leadership Computing Facility (OLCF). Of particular interest is effect of these on selected from OLCF portfolio scale. This comprehensive study presents results experiments run Titan, Eos, Cumulus, Percival OLCF. The this are useful for HPC users serve better understand impact that two...

10.1002/cpe.5020 article EN Concurrency and Computation Practice and Experience 2018-10-16

VFSIE -- Development and Testing Framework for Federated Science Instruments

OPENALEX - Publications

Anees Al‐Najjar Nageswara S. V. Rao Neena Imam Thomas J. Naughton Seth Hitefield and 7 more

Recent developments in softwarization of networked infrastructures combined with containerization computing workflows promise unprecedented compute anywhere and everywhere capabilities for federations edge remote systems science instruments. The development testing software stacks that implement these over physical production federations, however, is not very practical nor cost-effective. In response, we develop a digital twin the infrastructure, called Virtual Federated Science Instrument...

10.48550/arxiv.2101.02184 preprint EN other-oa arXiv (Cornell University) 2021-01-01

A Case Study of MPI Over Long Distance Connections

OPENALEX - Publications

Nageswara S. V. Rao Neena Imam Swen Boehm

Scientific workflows are increasingly being distributed across wide-area networks, and their code executions expected to span geographically dispersed computing systems. MPI has been extensively used support communications for computations, typically, over compute clusters high-performance systems within a single facility. We present case study of performance basic operations long distance connections, wherein TCP is the underlying transport. measurements execution times codes that utilize...

10.1109/syscon.2019.8836721 article EN 2022 IEEE International Systems Conference (SysCon) 2019-04-01

Coming Soon ...