NFDI4DS | UHH-SEMS - Publication Details

Zan Wang

ORCID: 0000-0001-6173-8170

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100707447

Research Areas

Software Testing and Debugging Techniques
Adversarial Robustness in Machine Learning
Software Engineering Research
Advanced Malware Detection Techniques
Parallel Computing and Optimization Techniques
Software Reliability and Analysis Research
Machine Learning and Data Classification
Software System Performance and Reliability
Topic Modeling
Anomaly Detection Techniques and Applications
Seismology and Earthquake Studies
Reinforcement Learning in Robotics
Geophysical and Geoelectrical Methods
Natural Language Processing Techniques
Multimodal Machine Learning Applications
Human Motion and Animation
Opinion Dynamics and Social Influence
Recommender Systems and Techniques
Smart Grid Security and Resilience
Seismic Imaging and Inversion Techniques
Advanced Neural Network Applications
Web Data Mining and Analysis
Human Pose and Action Recognition
Radiation Effects in Electronics
Healthcare Systems and Reforms

Tianjin University
2016-2025

Beijing Institute of Technology
2024-2025

Henan University
2019-2024

First Hospital of Jilin University
2023-2024

Jilin University
2023-2024

Beijing University of Posts and Telecommunications
2024

National Energy Technology Laboratory
2018-2020

Xi'an Technological University
2020

Southeast University
2019

Huazhong University of Science and Technology
2019

Semi-Supervised Log-Based Anomaly Detection via Probabilistic Label Estimation

OPENALEX - Publications

Lin Yang Junjie Chen Zan Wang Weijing Wang Jiajun Jiang and 2 more

With the growth of software systems, logs have become an important data to aid system maintenance. Log-based anomaly detection is one most methods for such purpose, which aims automatically detect anomalies via log analysis. However, existing log-based approaches still suffer from practical issues due either depending on a large amount manually labeled training (supervised approaches) or unsatisfactory performance without learning knowledge historical (unsupervised and semi-supervised...

10.1109/icse43902.2021.00130 article EN 2021-05-01

An improved collaborative movie recommendation system using computational intelligence

OPENALEX - Publications

Zan Wang Xue Yu Nan Feng Zhenhua Wang

10.1016/j.jvlc.2014.09.011 article EN Journal of Visual Languages & Computing 2014-10-14

Deep learning library testing via effective model generation

OPENALEX - Publications

Zan Wang Ming Yan Junjie Chen Shuang Liu Dongdi Zhang

Deep learning (DL) techniques are rapidly developed and have been widely adopted in practice. However, similar to traditional software systems, DL systems also contain bugs, which could cause serious impacts especially safety-critical domains. Recently, many research approaches focused on testing models, while little attention has paid for libraries, is the basis of building models directly affects behavior systems. In this work, we propose a novel approach, LEMON, libraries. particular, (1)...

10.1145/3368089.3409761 article EN 2020-11-08

Prioritizing Test Inputs for Deep Neural Networks via Mutation Analysis

OPENALEX - Publications

Zan Wang Hanmo You Junjie Chen Yingyi Zhang Xuyuan Dong and 1 more

Deep Neural Network (DNN) testing is one of the most widely-used ways to guarantee quality DNNs. However, labeling test inputs check correctness DNN prediction very costly, which could largely affect efficiency testing, even whole process development. To relieve labeling-cost problem, we propose a novel input prioritization approach (called PRIMA) for DNNs via intelligent mutation analysis in order label more bug-revealing earlier limited time, facilitates improve testing. PRIMA based on key...

10.1109/icse43902.2021.00046 article EN 2021-05-01

Practical Accuracy Estimation for Efficient Deep Neural Network Testing

OPENALEX - Publications

Junjie Chen Zhuo Wu Zan Wang Hanmo You Lingming Zhang and 1 more

Deep neural network (DNN) has become increasingly popular and DNN testing is very critical to guarantee the correctness of DNN, i.e., accuracy in this work. However, suffers from a serious efficiency problem, it costly label each test input know for set, since labeling involves multiple persons (even with domain-specific knowledge) manual way set large-scale. To relieve we propose novel practical approach, called PACE (which short P ractical AC curacy E stimation), which selects small inputs...

10.1145/3394112 article EN ACM Transactions on Software Engineering and Methodology 2020-07-07

TCM-GPT: Efficient pre-training of large language models for domain adaptation in Traditional Chinese Medicine

OPENALEX - Publications

Guoxing Yang Xiaohong Liu Jian‐Yu Shi Zan Wang Guangyu Wang

Pre-training and fine-tuning have emerged as a promising paradigm across various natural language processing (NLP) tasks. The effectiveness of pretrained large models (LLM) has witnessed further enhancement, holding potential for applications in the field medicine, particularly context Traditional Chinese Medicine (TCM). However, application these general to specific domains often yields suboptimal results, primarily due challenges like lack domain knowledge, unique objectives, computational...

10.1016/j.cmpbup.2024.100158 article EN cc-by-nc-nd Computer Methods and Programs in Biomedicine Update 2024-01-01

History-driven test program synthesis for JVM testing

OPENALEX - Publications

Yingquan Zhao Zan Wang Junjie Chen Mengdi Liu Mingyuan Wu and 2 more

Java Virtual Machine (JVM) provides the runtime environment for programs, which allows to be "write once, run anywhere". JVM plays a decisive role in correctness of all programs running on it. Therefore, ensuring and robustness implementations is essential programs. To date, various techniques have been proposed expose bugs via generating potential bug-revealing test However, diversity effectiveness generated by existing research are far from enough since they mainly focus minor...

10.1145/3510003.3510059 article EN Proceedings of the 44th International Conference on Software Engineering 2022-05-21

An Adaptive Markov Strategy for Defending Smart Grid False Data Injection From Malicious Attackers

OPENALEX - Publications

Jianye Hao Eunsuk Kang Jun Sun Zan Wang Zhaopeng Meng and 2 more

We present a novel defending strategy, adaptive Markov strategy (AMS), to protect smart-grid system from being attacked by unknown attackers with unpredictable and dynamic behaviors. One significant merit of deploying AMS defend the is that it theoretically guaranteed converge best response against any stationary attacker, Nash equilibrium (NE) in case self-play (the attacker intelligent enough use attack). The effectiveness evaluated considering class data integrity attacks which an manages...

10.1109/tsg.2016.2610582 article EN IEEE Transactions on Smart Grid 2016-09-16

PLELog: Semi-Supervised Log-Based Anomaly Detection via Probabilistic Label Estimation

OPENALEX - Publications

Lin Yang Junjie Chen Zan Wang Weijing Wang Jiajun Jiang and 2 more

PLELog is a novel approach for log-based anomaly detection via probabilistic label estimation. It designed to effectively detect anomalies in unlabeled logs and meanwhile avoid the manual labeling effort training data generation. We use semantic information within log events as fixed-length vectors apply HDBSCAN automatically clustering sequences. After that, we also propose Probabilistic Label Estimation reduce noises introduced by error put "labeled" instances into an attention-based GRU...

10.1109/icse-companion52605.2021.00106 article EN 2021-05-01

Exposing numerical bugs in deep learning via gradient back-propagation

OPENALEX - Publications

Ming Yan Junjie Chen Xiangyu Zhang Lin Tan Wang Gan and 1 more

Numerical computation is dominant in deep learning (DL) programs. Consequently, numerical bugs are one of the most prominent kinds defects DL can lead to exceptional values such as NaN (Not-a-Number) and INF (Infinite), which be propagated eventually cause crashes or invalid outputs. They occur when special inputs parameter at internal mathematical operations log(). In this paper, we propose first dynamic technique, called GRIST, automatically generates a small input that expose GRIST...

10.1145/3468264.3468612 article EN 2021-08-18

Mitigating Regression Faults Induced by Feature Evolution in Deep Learning Systems

OPENALEX - Publications

Hanmo You Zan Wang Xuyang Chen Junjie Chen Jun Sun and 2 more

Deep learning (DL) systems have been widely utilized across various domains. However, the evolution of DL can result in regression faults. In addition to through incorporation new data, feature evolution, such as features, is also common and introduce this work, we first investigate underlying factors that are correlated with faults scenarios, i.e., redundancy contribution shift. Based on our investigation, propose a novel mitigation approach called FeaProtect, which aims minimize impact...

10.1145/3712199 article EN ACM Transactions on Software Engineering and Methodology 2025-01-15

X's Day: Personality-Driven Virtual Human Behavior Generation

OPENALEX - Publications

Li Haoyang Zan Wang Wei Liang Yizhuo Wang

Developing convincing and realistic virtual human behavior is essential for enhancing user experiences in reality (VR) augmented (AR) settings. This paper introduces a novel task focused on generating long-term behaviors agents, guided by specific personality traits contextual elements within 3D environments. We present comprehensive framework capable of autonomously producing daily activities autoregressively. By modeling the intricate connections between characteristics observable...

10.1109/tvcg.2025.3549574 article EN IEEE Transactions on Visualization and Computer Graphics 2025-01-01

Evaluating Spectrum-based Fault Localization on Deep Learning Libraries

OPENALEX - Publications

Ming Yan Junjie Chen Tianyu Jiang Jiajun Jiang Zan Wang

10.1109/tse.2025.3552622 article EN IEEE Transactions on Software Engineering 2025-01-01

Large-Scale Empirical Studies on Effort-Aware Security Vulnerability Prediction Methods

OPENALEX - Publications

Xiang Chen Yingquan Zhao Zhanqi Cui Guozhu Meng Yang Liu and 1 more

Security vulnerability prediction (SVP) can identify potential vulnerable modules in advance and then help developers to allocate most of the test resources these modules. To evaluate performance different SVP methods, we should take security audit code inspection into account consider effort-aware measures (such as ACC P <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">opt</sub> ). However, best our knowledge, effectiveness methods has not been...

10.1109/tr.2019.2924932 article EN IEEE Transactions on Reliability 2019-08-22

HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes

OPENALEX - Publications

Zan Wang Yixin Chen Tengyu Liu Yixin Zhu Wei Liang and 1 more

Learning to generate diverse scene-aware and goal-oriented human motions in 3D scenes remains challenging due the mediocre characteristics of existing datasets on Human-Scene Interaction (HSI); they only have limited scale/quality lack semantics. To fill gap, we propose a large-scale semantic-rich synthetic HSI dataset, denoted as HUMANISE, by aligning captured motion sequences with various indoor scenes. We automatically annotate aligned language descriptions that depict action unique...

10.48550/arxiv.2210.09729 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Regression Fuzzing for Deep Learning Systems

OPENALEX - Publications

Hanmo You Zan Wang Junjie Chen Shuang Liu Shuochuan Li

Deep learning (DL) Systems have been widely used in various domains. Similar to traditional software, DL system evolution may also incur regression faults. To find the faults between versions of a system, we propose novel fuzzing technique called DRFuzz, which facilitates generating inputs that trigger diverse and high fidelity. enhance diversity found faults, DRFuzz proposes diversity-oriented test criterion explore as many faulty behaviors possible. Then, incorporates GAN model guarantee...

10.1109/icse48619.2023.00019 article EN 2023-05-01

Recurrent Deep Multiagent Q-Learning for Autonomous Brokers in Smart Grid

OPENALEX - Publications

Yaodong Yang Jianye Hao Mingyang Sun Zan Wang Changjie Fan and 1 more

The broker mechanism is widely applied to serve for interested parties derive long-term policies in order reduce costs or gain profits smart grid. However, a faced with number of challenging problems such as balancing demand and supply from customers competing other coexisting brokers maximize its profit. In this paper, we develop an effective pricing strategy local electricity retail market based on recurrent deep multiagent reinforcement learning sequential clustering. We use real...

10.24963/ijcai.2018/79 article EN 2018-07-01

Assessment of geophysical monitoring methods for detection of brine and CO2 leakage in drinking water aquifers

OPENALEX - Publications

Xianjin Yang Thomas A. Buscheck Kayyum Mansoor Zan Wang Kai Gao and 3 more

10.1016/j.ijggc.2019.102803 article EN publisher-specific-oa International journal of greenhouse gas control 2019-08-12

Stratified random sampling for neural network test input selection

OPENALEX - Publications

Zhuo Wu Zan Wang Junjie Chen Hanmo You Ming Yan and 1 more

10.1016/j.infsof.2023.107331 article EN Information and Software Technology 2023-09-20

TECCD: A Tree Embedding Approach for Code Clone Detection

OPENALEX - Publications

Yifei Gao Zan Wang Shuang Liu Lin Yang Wei Sang and 1 more

Clone detection techniques have been explored for decades. Recently, deep learning has adopted to improve the code representation capability, and state-of-the-art in clone detection. These approaches usually require a transformation from AST binary tree incorporate syntactical information, which introduces overheads. Moreover, these conduct term-embedding, requires large training datasets. In this paper, we introduce embedding technique Our approach first conducts obtain node vector each...

10.1109/icsme.2019.00025 article EN 2019-09-01

Anxiety and depression among patients with insomnia during the first wave and the release of the COVID-19 in Northeast China: A cross-sectional survey

OPENALEX - Publications

Huimin Li Yanan Zhang Qianqian Chen Qingqing Sun Ying Wang and 3 more

The global coronavirus disease 2019 (COVID-19) pandemic seriously affected people's lives. We evaluated anxiety and depression among patients with insomnia in northeast China during the first wave release of COVID-19, providing a basis for clinical diagnosis treatment insomnia.

10.1016/j.jad.2023.12.088 article EN cc-by-nc-nd Journal of Affective Disorders 2024-01-02

Testing the Compiler for a New-Born Programming Language: An Industrial Case Study (Experience Paper)

OPENALEX - Publications

Yingquan Zhao Junjie Chen Ruifeng Fu Haojie Ye Zan Wang

Due to the critical role of compilers, many compiler testing techniques have been proposed, two most notable categories among which are grammar-based and metamorphic-based techniques. All them extensively studied for mature compilers. However, it is typical develop a new new-born programming language in practice. In this scenario, existing hardly applicable due some major reasons: (1) no reference compilers support differential testing, (2) lack program analysis tools (3) substantial...

10.1145/3597926.3598077 article EN 2023-07-12

APIRecX: Cross-Library API Recommendation via Pre-Trained Language Model

OPENALEX - Publications

Yuning Kang Zan Wang Hongyu Zhang Junjie Chen Hanmo You

For programmers, learning the usage of APIs (Application Programming Interfaces) a software library is important yet difficult. API recommendation tools can help developers use by recommending which to be used next given that have been written. Traditionally, language models such as N-gram are applied recommendation. However, because libraries keep changing and new emerging, common. These seen OOV (out vocabulary) words cannot handled well existing approaches due lack training data. In this...

10.18653/v1/2021.emnlp-main.275 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2021-01-01

Coming Soon ...