Luanzheng Guo

ORCID: 0000-0001-8266-0923
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Distributed systems and fault tolerance
  • Advanced Data Storage Technologies
  • Radiation Effects in Electronics
  • Parallel Computing and Optimization Techniques
  • Cloud Computing and Resource Management
  • Distributed and Parallel Computing Systems
  • Scientific Computing and Data Management
  • Security and Verification in Computing
  • Algorithms and Data Compression
  • Software System Performance and Reliability
  • Tensor decomposition and applications
  • Remote Sensing and LiDAR Applications
  • Advanced Vision and Imaging
  • 3D Surveying and Cultural Heritage
  • Robotics and Sensor-Based Localization
  • Graph Theory and Algorithms
  • Advancements in Photolithography Techniques
  • Advanced Neural Network Applications
  • Data Quality and Management
  • Electron and X-Ray Spectroscopy Techniques
  • Anomaly Detection Techniques and Applications
  • Data Visualization and Analytics
  • Advanced Graph Neural Networks
  • Machine Learning and ELM
  • Advanced Electron Microscopy Techniques and Applications

Pacific Northwest National Laboratory
2018-2025

University of California, Merced
2018-2023

Lawrence Livermore National Laboratory
2019

Nanchang Hangkong University
2013

Neural Architecture Search (NAS) is a powerful approach of automating the design efficient neural architectures. In contrast to traditional NAS methods, recently proposed one-shot methods prove be more in performing NAS. One-shot works by generating singular weight-sharing supernetwork that acts as search space (container) subnetworks. Despite its achievements, designing remains major challenge. this work we propose strategy for Vision Transformer (ViT)-based particular, convert Segment...

10.48550/arxiv.2501.08504 preprint EN arXiv (Cornell University) 2025-01-14

Sparse tensor algebra is widely used in many applications, including scientific computing, machine learning, and data analytics. The performance of sparse kernels strongly depends on the intrinsic characteristics input tensors, hence storage formats are designed for tensors to achieve optimal particular applications/architectures, which makes it challenging implement optimize every operation interest a given architecture. We propose domain-specific language (DSL) compiler framework...

10.1109/llvmhpc54804.2021.00009 article EN 2021-11-01

10.1016/j.jpdc.2021.02.015 article EN publisher-specific-oa Journal of Parallel and Distributed Computing 2021-03-07

As high-performance computing systems scale in size and computational power, the danger of silent errors, i.e., errors that can bypass hardware detection mechanisms impact application state, grows dramatically. Consequently, applications running on HPC need to exhibit resilience such errors. Previous work has found that, for certain codes, this come free, some are naturally resilient, but few studies have shown code patterns-combinations or sequences computations-that make an resilient. In...

10.1109/sc.2018.00011 preprint EN 2018-11-01

10.1016/j.optlastec.2012.11.010 article EN Optics & Laser Technology 2013-01-17

Understanding application resilience (or error tolerance) in the presence of hardware transient faults on data objects is critical to ensure computing integrity and enable efficient application-level fault tolerance mechanisms. However, we lack a method tool quantify objects. The traditional method, random injection, cannot help, because losing semantics insufficient information how where errors are tolerated. In this paper, introduce (called "MOARD") model Our based systematically...

10.1109/ipdps.2019.00096 article EN 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2019-05-01

Automating the theory-experiment cycle requires effective distributed workflows that utilize a computing continuum spanning lab instruments, edge sensors, resources at multiple facilities, data sets across information sources, and potentially cloud. Unfortunately, obvious methods for constructing platforms, orchestrating workflow tasks, curating datasets over time fail to achieve scientific requirements performance, energy, security, reliability. Furthermore, achieving best use of depends...

10.48550/arxiv.2410.16093 preprint EN arXiv (Cornell University) 2024-10-21

Tensor algebra is widely used in many applications, such as scientific computing, machine learning, and data analytics. The tensors represented real-world are usually large sparse. There tens of storage formats designed for sparse matrices and/or the performance tensor operations depends on a particular architecture selected format, which makes it challenging to implement optimize every operation interest transfer code from one another. We propose domain-specific language (DSL) compiler...

10.48550/arxiv.2102.05187 preprint EN cc-by-nc-nd arXiv (Cornell University) 2021-01-01

A critical performance challenge in distributed scientific workflows is coordinating tasks and data flows on resources. To guide these decisions, this paper introduces flow lifecycle analysis. Workflows are commonly represented using directed acyclic graphs (DAGs). Data lifecycles (DFL) enrich task DAGs with objects properties that describe how interact flow. Lifecycles enable analysis from several important perspectives: task, data, We representation, measurement, analysis, visualization,...

10.1145/3581784.3607104 article EN 2023-10-30

As high-performance computing systems scale in size and computational power, the danger of silent errors, i.e., errors that can bypass hardware detection mechanisms impact application state, grows dramatically. Consequently, applications running on HPC need to exhibit resilience such errors. Previous work has found that, for certain codes, this come free, some are naturally resilient, but few studies have shown code patterns---combinations or sequences computations---that make an resilient....

10.5555/3291656.3291667 article EN 2018-11-11

Frame structure estimation from line segments is an important yet challenging problem in understanding indoor scenes. In practice, segment extraction can be affected by occlusions, illumination variations, and weak object boundaries. To address this problem, approach for frame recovery based on refinement voting proposed. We refined the revising, connecting, adding operations. then propose iterative mechanism selecting segments, where a cross ratio constraint enforced to build crab-like...

10.1109/icassp.2013.6638003 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2013-05-01

Tensor Algebra is a powerful tool for carrying out high-order data analytics in scientific applications, such as finite element analysis, N-body simulation, and quantum chemistry. Many of these applications are critical terms correctness safety. Since often run on High Performance Computing (HPC) systems, which susceptible to soft errors caused by cosmic rays, unstable voltage, etc., we must ensure that the execution reliable resilient, outcome highly trustworthy. However, traditional fault...

10.1109/ipdpsw59300.2023.00061 article EN 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) 2023-05-01

MPI has been ubiquitously deployed in flagship HPC systems aiming to accelerate distributed scientific applications running on tens of hundreds processes and compute nodes. Maintaining the correctness integrity application execution is critical, especially for safety-critical applications. Therefore, a collection effective fault tolerance techniques have proposed enable efficiently resume from system failures. However, there no structured way study compare different designs, so guide...

10.1109/iiswc50251.2020.00015 article EN 2020-10-01

High-performance computing (HPC) systems play a critical role in facilitating scientific discoveries. Their scale and complexity (e.g., the number of computational units software stack) continue to grow as new are expected process increasingly more data reduce time. However, with processing elements, probability that these will experience random bit-flip error corrupts program's output also increases, which is often recognized silent corruption. Analyzing resiliency HPC applications...

10.1109/tvcg.2022.3230636 article EN IEEE Transactions on Visualization and Computer Graphics 2022-12-20

Image segmentation is a critical enabler for tasks ranging from medical diagnostics to autonomous driving. However, the correct semantics - where are boundaries located? what segments logically similar? change depending on domain, such that state-of-the-art foundation models can generate meaningless and incorrect results. Moreover, in certain domains, fine-tuning retraining techniques infeasible: obtaining labels costly time-consuming; domain images (micrographs) be exponentially diverse;...

10.48550/arxiv.2404.06638 preprint EN arXiv (Cornell University) 2024-04-09

10.1109/ipdpsw63119.2024.00090 article EN 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) 2024-05-27

One of the most common operations in multimodal scientific data management is searching for $k$ similar items (or, $k$-nearest neighbors, KNN) from database after being provided a new item. Although recent advances machine learning models offer \textit{semantic} index, so-called \textit{embedding vectors} mapped original data, dimension resulting embedding vectors are usually on order hundreds or thousand, which impractically high time-sensitive applications. This work proposes to reduce...

10.48550/arxiv.2408.10264 preprint EN arXiv (Cornell University) 2024-08-15

10.11578/dc.20241120.4 article OSTI OAI (U.S. Department of Energy Office of Scientific and Technical Information) 2024-11-20

Checkpoint/Restart (C/R) has been widely deployed in numerous HPC systems, Clouds, and industrial data centers, which are typically operated by system engineers. Nevertheless, there is no existing approach that helps engineers without domain expertise, scientists fault tolerance knowledge identify those critical variables accounted for correct application execution restoration a failure C/R. To address this problem, we propose an analytical model tool (AutoCheck) can automatically to...

10.48550/arxiv.2408.06082 preprint EN arXiv (Cornell University) 2024-08-12
Coming Soon ...