NFDI4DS | UHH-SEMS - Publication Details

Canqun Yang

ORCID: 0009-0008-4757-2475

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5027243864

Research Areas

Parallel Computing and Optimization Techniques
Lattice Boltzmann Simulation Studies
Advanced Data Storage Technologies
Software-Defined Networks and 5G
Advanced Numerical Methods in Computational Mathematics
Network Security and Intrusion Detection
Distributed and Parallel Computing Systems
Advanced Malware Detection Techniques
Fluid Dynamics and Turbulent Flows
Software System Performance and Reliability
Model Reduction and Neural Networks
Polymer Nanocomposites and Properties
Aerosol Filtration and Electrostatic Precipitation
Cavitation Phenomena in Pumps
Algorithms and Data Compression
Computer Graphics and Visualization Techniques
Matrix Theory and Algorithms
Fluid Dynamics and Vibration Analysis
DNA and Biological Computing
Cloud Computing and Resource Management
Computational Fluid Dynamics and Aerodynamics
Fluid Dynamics Simulations and Interactions
Recommender Systems and Techniques
Stochastic Gradient Optimization Techniques
Diamond and Carbon-based Materials Research

National University of Defense Technology
2013-2025

National Supercomputing Center of Tianjin
2024

Real-Time Anomaly Detection for Large-Scale Network Devices

OPENALEX - Publications

Tao Lei Minghua Ma Shenglin Zhang Junhua Kuang Xiaowei Guo and 2 more

10.1109/ton.2025.3529861 article EN 2025-01-01

A pseudo-time stepping and parameterized physics-informed neural network framework for Navier–Stokes equations

OPENALEX - Publications

Zhuo Zhang Xiong Xiong Sen Zhang Wei Wang Xi Yang and 2 more

Physics-informed neural networks (PINNs) have emerged as a popular approach in scientific machine learning for solving both forward and inverse problems of partial differential equations (PDEs). However, complex physical systems are often characterized by parameters, such viscosity Reynolds number fluid dynamics, which pose significant challenges parameterized PDE solutions. The inherent limitations PINNs include the need repeated time-consuming training under varying parameter conditions,...

10.1063/5.0259583 article EN Physics of Fluids 2025-03-01

Diagnosing Performance Issues for Large-Scale Microservice Systems With Heterogeneous Graph

OPENALEX - Publications

Tao Lei Xianglin Lu Shenglin Zhang Jiaqi Luan Yingke Li and 8 more

The availability of microservice systems is critical to business operations and corporate reputation. However, the dynamics complexity introduce significant challenges performance issue diagnosis large-scale systems. After investigating hundreds real-world cases in Tencent, we find that previous troubleshooting approaches fail accurately localize root causes because they overlook inconsistency between causality calling relationships. Therefore, propose a novel approach, MicroDig, diagnose...

10.1109/tsc.2024.3402172 article EN IEEE Transactions on Services Computing 2024-05-17

A Block Iteration with Parallelization Method for the Greedy Selection in Radial Basis Functions Based Mesh Deformation

OPENALEX - Publications

Ran Zhao Chao Li Xiaowei Guo Sijiang Fan Yi Wang and 1 more

Greedy algorithm is one of the important point selection methods in radial basis function based mesh deformation. However, large-scale mesh, conventional greedy will generate expensive time consumption and result performance penalties. To accelerate computational procedure selection, a block iteration with parallelization method proposed this paper. By method, complexities three steps are all reduced from O ( n 3 ) to 2 . In addition, two separates boundary points into sub-cores, efficiently...

10.3390/app9061141 article EN cc-by Applied Sciences 2019-03-18

SNCL: a supernode OpenCL implementation for hybrid computing arrays

OPENALEX - Publications

Tao Tang Kai Lu Peng Lin Yingbo Cui Jianbin Fang and 4 more

10.1007/s11227-023-05766-3 article EN The Journal of Supercomputing 2023-12-08

Implementing and Evaluating OpenCL on an ARMv8 Multi-Core CPU

OPENALEX - Publications

Jianbin Fang Peng Zhang Tao Tang Chun Huang Canqun Yang

The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using single unified programming interface language. But guaranteeing portability relies heavily on platform-specific implementations. In this paper, we provide an implementation ARMv8 multi-core which efficiently maps the generic platform model to architecture. With implementation, first characterize maximum achieved arithmetic throughput memory accessing bandwidth architecture, measure...

10.1109/ispa/iucc.2017.00131 article EN 2017-12-01

A Motion Trace Decomposition-based overset grid method for parallel CFD simulations with moving boundaries

OPENALEX - Publications

Ran Zhao Chao Li Xiaowei Guo Sen Zhang Xi Yang and 2 more

The overset grid method is widely employed to solve moving boundary problems in numerical simulations. However, the heavy and inevitable communication resulting from movements severely impedes improvement of parallel efficiency. This paper proposes a Motion Trace Decomposition (MTD) alleviate this issue. MTD minimizes overhead between processors by decomposing sub-grids distributing them according object motion trajectory, negating need reproduce areas when boundaries move. Various tests...

10.1145/3673038.3673102 article EN cc-by 2024-08-08

Giving Every Modality a Voice in Microservice Failure Diagnosis via Multimodal Adaptive Optimization

OPENALEX - Publications

Tao Lei Shenglin Zhang Zhao Jia Jinrui Sun Minghua Ma and 5 more

10.1145/3691620.3695489 article 2024-10-18

Fast Noisy Long Read Alignment with Multi-Level Parallelism

OPENALEX - Publications

Zeyu Xia Canqun Yang Chenchen Peng Yifei Guo Yufei Guo and 2 more

<title>Abstract</title> Background: The advent of Single Molecule Real-Time (SMRT) sequencing has overcome many limitations second-generation sequencing, such as limited read lengths, PCR amplification biases. However, longer reads increase data volume exponentially and high error rates make existing alignment tools inapplicable. Additionally, a single CPU's performance bottleneck restricts the effectiveness algorithms for SMRT sequencing. Methods: To address these challenges, we introduce...

10.21203/rs.3.rs-5360308/v1 preprint EN cc-by Research Square (Research Square) 2024-11-13

Discrete-time Physics-Informed Neural Networks for Two-Phase Flow Interface Capturing

OPENALEX - Publications

Sen Zhang Yanxu Zhong Xi Yang Wei Wang Zhuo Zhang and 2 more

10.1145/3704814.3704821 article EN Proceedings of the 4th International Conference on Computer Science and Application Engineering 2024-11-28

Hybrid MPI/OpenMP optimization in Linpack benchmark on multi-core platforms

OPENALEX - Publications

Feng Wang Canqun Yang Juncheng Bai

With the increasing of number CPU cores, thousands cores are used in current supercomputers. The MPIJOpenMP hybrid programming model is popular multicore systems. Some serial codes pure MPI programs turn to bottleneck and easy be neglected when these ported model. In Linpack benchmark, we focus on local swap algorithm present an OpenMP optimization method speedup performance using multi-thread. On a cluster system with 36 multi-core CPUs, experiment results show that this can decrease time...

10.1109/iccse.2013.6554042 article EN 2013-04-01

Accelerating Type Confusion Detection by Identifying Harmless Type Castings

OPENALEX - Publications

Xiaokang Fan Sifan Long Chun Huang Canqun Yang F. Y. Li

C++ allows reinterpretation of memory objects via type casting, which facilitates easier manipulation class fields and virtual methods inside the hierarchy. However, misinterpretation objects, is called confusion, can result in illegal access or methods. Type confusion accounts for many security vulnerabilities programs written C++. Previous detection techniques report a bug when an object parent casted to child class. downcast safe as long no are accessed. This paper presents Harmless...

10.1145/3587135.3592205 article EN 2023-05-09

An Improved Parallel Overset Grid Method for Fluid Simulation with Moving Boundary

OPENALEX - Publications

Ran Zhao Chao Li Xiaowei Guo Yi Liu Sifan Long and 3 more

The Overset Grid method is a promising computational approach for tackling the challenging moving boundary problems in Computational Fluid Dynamics (CFD) simulations. efficiency and accuracy of are critically dependent on effectiveness Assembly (OGA) process. However, OGA process plagued by unavoidable issues load imbalance communication overheads, which adversely impact parallel method, particularly when dealing with sub-grids motion. This paper proposes an improved assembly as effective...

10.1145/3605573.3605634 article EN 2023-08-07

Bilateral Sampling Randomized Singular Value Decomposition

OPENALEX - Publications

Hao Jiang Peibing Du Tao Sun Housen Li Lizhi Cheng and 1 more

Designing fast singular value decomposition (SVD) is significantly interesting in applications. The random direct SVD (RSVD) has provided a scheme to compute the well-approximate by unilateral randomized sampling. In this paper, we present an efficient algorithm bilateral sampling way. We also prove that proposed algorithms can be bounded well and have less computational complexity compared RSVD when objective matrix approximately square. Numerical experiments on graph Laplacian Hilbert...

10.1109/pdcat.2016.027 article EN 2016-12-01

The Optimization of Multi-physics Application Simulated by Lattice Boltzmann Method Based on Domestic Processors

OPENALEX - Publications

Yi Liu Peng Lin Peng Zhang Yanlong Qiu Sen Zhang and 2 more

LBM can conveniently deal with the interaction between fluid and solid. Thus, it is widely used in numerical simulation of multi-physics applications. Based on domestic processor FT-2000, Rayleigh-Bnard convection carried out. Through performance tests, found that function call overhead process occupies nearly execution time. By expanding reorganizing data structure, time for 104 timestep declines from 9236s to 2754s, has improved about .

10.1145/3605801.3605810 article EN 2023-06-16

Scalability of OpenFOAM for Viscoelastic Solver on High Performance Systems

OPENALEX - Publications

Gang Huang Canqun Yang Xiaowei Guo Chengkun Wu Xiang Zhang

In this paper, we investigate the scalability of OpenFOAM for viscoelastic solver which is implemented in our previous study on HPC platforms.Results show that scales reasonably well up to 256 cores.Further profiling shows greatly restricted by global reduction, introduced numerous scalar product operations parallel PCG algorithm OpenFOAM.

10.2991/eeeis-17.2017.14 article EN cc-by-nc 2017-01-01

Coming Soon ...