NFDI4DS | UHH-SEMS - Publication Details

Shuai Wang

ORCID: 0000-0002-0866-0308

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100328264

Research Areas

Advanced Malware Detection Techniques
Software Testing and Debugging Techniques
Software Engineering Research
Security and Verification in Computing
Adversarial Robustness in Machine Learning
Topic Modeling
Natural Language Processing Techniques
Speech Recognition and Synthesis
Cloud Computing and Resource Management
Software System Performance and Reliability
Anomaly Detection Techniques and Applications
Digital and Cyber Forensics
Speech and Audio Processing
Bayesian Modeling and Causal Inference
Software Reliability and Analysis Research
Parallel Computing and Optimization Techniques
Data Quality and Management
Music and Audio Processing
Speech and dialogue systems
Advanced Decision-Making Techniques
Distributed and Parallel Computing Systems
Sentiment Analysis and Opinion Mining
Advanced Neural Network Applications
Web Application Security Vulnerabilities
Explainable Artificial Intelligence (XAI)

University of Hong Kong
2020-2025

Hong Kong University of Science and Technology
2020-2025

Guangxi Medical University
2025

Jiangsu University of Science and Technology
2024

Peking University
2019-2024

Beijing Institute of Technology
2022-2024

Shenyang Agricultural University
2024

Yanshan University
2024

Zhejiang Provincial People's Hospital
2021-2024

Southeast Asia University
2024

LibD: Scalable and Precise Third-Party Library Detection in Android Markets

OPENALEX - Publications

Menghao Li Wei Wang Pei Wang Shuai Wang Dinghao Wu and 3 more

With the thriving of mobile app markets, third-party libraries are pervasively integrated in Android applications. Third-party provide functionality such as advertisements, location services, and social networking making multi-functional development much more productive. However, spread vulnerable or harmful may also hurt entire ecosystem, leading to various security problems. The platform suffers severely from problems due way its ecosystem is constructed maintained. Therefore, library...

10.1109/icse.2017.38 article EN 2017-05-01

Continuous-variable quantum teleportation with non-Gaussian entangled states generated via multiple-photon subtraction and addition

OPENALEX - Publications

Shuai Wang Lili Hou Xianfeng Chen Xuefen Xu

We theoretically analyze the Einstein-Podolsky-Rosen (EPR) correlation, quadrature squeezing, and continuous-variable quantum teleportation when considering non-Gaussian entangled states generated by applying multiple-photon subtraction addition to a two-mode squeezed vacuum state (TMSVs). Our results indicate that in case of multiple-photon-subtracted TMSVs with symmetric operations, corresponding EPR squeezing degree, sum fidelity teleporting coherent or can be enhanced for any parameter r...

10.1103/physreva.91.063832 article EN Physical Review A 2015-06-25

CCTEST: Testing and Repairing Code Completion Systems

OPENALEX - Publications

Zongjie Li Chaozheng Wang Zhibo Liu Haoxuan Wang Dong Chen and 2 more

Code completion, a highly valuable topic in the software development domain, has been increasingly promoted for use by recent advances large language models (LLMs). To date, visible LLM-based code completion frameworks such as GitHub Copilot and GPT are trained using deep learning over vast quantities of unstructured text open source code. As paramount component cornerstone daily programming tasks, largely boosted professionals' efficiency building real-world systems. In contrast to this...

10.1109/icse48619.2023.00110 article EN 2023-05-01

Adaptive Unpacking of Android Apps

OPENALEX - Publications

Lei Xue Xiapu Luo Le Yu Shuai Wang Dinghao Wu

More and more app developers use the packing services (or packers) to prevent attackers from reverse engineering modifying executable Dex files) of their apps. At same time, malware authors also packers hide malicious component evade signature-based detection. Although there are a few recent studies on unpacking Android apps, it has been shown that evolving can easily circumvent them because they not adaptive changes packers. In this paper, we propose novel approach develop new system, named...

10.1109/icse.2017.40 article EN 2017-05-01

Recursively summarizing enables long-term dialogue memory in large language models

OPENALEX - Publications

Qingyue Wang Yanhe Fu Yanan Cao Shuai Wang Zhiliang Tian and 1 more

10.1016/j.neucom.2025.130193 article EN Neurocomputing 2025-04-01

Enhancing the Description-to-Behavior Fidelity in Android Apps with Privacy Policy

OPENALEX - Publications

Le Yu Xiapu Luo Chenxiong Qian Shuai Wang Hareton Leung

Since more than 96 percent of mobile malware targets the Android platform, various techniques based on static code analysis or dynamic behavior have been proposed to detect malicious apps. As is becoming complicated and stealthy, recent research a promising detection approach that looks for inconsistency between an app's permissions its description. In this paper, we first revisit reveal using description permission will lead many false positives because descriptions often fail declare all...

10.1109/tse.2017.2730198 article EN IEEE Transactions on Software Engineering 2017-07-21

In-memory fuzzing for binary code similarity analysis

OPENALEX - Publications

Shuai Wang Dinghao Wu

Detecting similar functions in binary executables serves as a foundation for many code analysis and reuse tasks. By far, recognizing components remains challenge. Existing research employs either static or dynamic approaches to capture program syntax semantics-level features comparison. However, there exist multiple design limitations previous work, which result relatively high cost, low accuracy scalability, thus severely impede their practical use. In this paper, we present novel method...

10.1109/ase.2017.8115645 article EN 2017-10-01

Metamorphic Testing of Deep Learning Compilers

OPENALEX - Publications

Dongwei Xiao Zhibo Liu Yuanyuan Yuan Qi Pang Shuai Wang

The prosperous trend of deploying deep neural network (DNN) models to diverse hardware platforms has boosted the development learning (DL) compilers. DL compilers take high-level DNN model specifications as input and generate optimized executables for architectures like CPUs, GPUs, various accelerators. Compiling into high-efficiency is not easy: compilation procedure often involves converting several different intermediate representations (IR), e.g., graph IR operator IR, performing...

10.1145/3508035 article EN Proceedings of the ACM on Measurement and Analysis of Computing Systems 2022-02-24

Instruction Tuning for Few-Shot Aspect-Based Sentiment Analysis

OPENALEX - Publications

Siddharth Varia Shuai Wang Kishaloy Halder Robert Vacareanu Miguel Ballesteros and 5 more

Siddharth Varia, Shuai Wang, Kishaloy Halder, Robert Vacareanu, Miguel Ballesteros, Yassine Benajiba, Neha Anna John, Rishita Anubhai, Smaranda Muresan, Dan Roth. Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis. 2023.

10.18653/v1/2023.wassa-1.3 article EN cc-by 2023-01-01

XInsight: eXplainable Data Analysis Through The Lens of Causality

OPENALEX - Publications

Pingchuan Ma Rui Ding Shuai Wang Shi Han Dongmei Zhang

In light of the growing popularity Exploratory Data Analysis (EDA), understanding underlying causes knowledge acquired by EDA is crucial. However, it remains under-researched. This study promotes a transparent and explicable perspective on data analysis, called eXplainable (XDA). For this reason, we present XInsight, general framework for XDA. XInsight provides analysis with qualitative quantitative explanations causal non-causal semantics. way, will significantly improve human confidence in...

10.1145/3589301 article EN Proceedings of the ACM on Management of Data 2023-06-13

Exploring Missed Optimizations in WebAssembly Optimizers

OPENALEX - Publications

Zhibo Liu Dongwei Xiao Zongjie Li Shuai Wang Wei Meng

The prosperous trend of deploying complex applications to web browsers has boosted the development WebAssembly (wasm) compilation toolchains. Software written in different high-level programming languages are compiled into wasm executables, which can be executed fast and safely a virtual machine. performance executables depends highly on compiler optimizations. Despite use recent research indicated that real-world slower than anticipated, suggesting deficiencies

10.1145/3597926.3598068 article EN 2023-07-12

On Extracting Specialized Code Abilities from Large Language Models: A Feasibility Study

OPENALEX - Publications

Zongjie Li Chaozheng Wang Pingchuan Ma Chaowei Liu Shuai Wang and 3 more

Recent advances in large language models (LLMs) significantly boost their usage software engineering. However, training a well-performing LLM demands substantial workforce for data collection and annotation. Moreover, datasets may be proprietary or partially open, the process often requires costly GPU cluster. The intellectual property value of commercial LLMs makes them attractive targets imitation attacks, but creating an model with comparable parameters still incurs high costs. This...

10.1145/3597503.3639091 article EN 2024-04-12

UROBOROS: Instrumenting Stripped Binaries with Static Reassembling

OPENALEX - Publications

Shuai Wang Pei Wang Dinghao Wu

Software instrumentation techniques are widely used in program analysis tasks such as profiling, vulnerability discovering, and security-oriented transforming. In this paper, we present an tool called UROBOROS, which supports static on stripped binaries. Due to the lack of relocation debug information, reverse engineering binaries is challenging. Compared with previous work, UROBOROS can provide complete, easy-to-use, transparent, efficient complete by statically recovering relocatable...

10.1109/saner.2016.106 article EN 2016-03-01

MDPFuzz: testing models solving Markov decision processes

OPENALEX - Publications

Qi Pang Yuanyuan Yuan Shuai Wang

The Markov decision process (MDP) provides a mathematical frame- work for modeling sequential decision-making problems, many of which are crucial to security and safety, such as autonomous driving robot control. rapid development artificial intelligence research has created efficient methods solving MDPs, deep neural networks (DNNs), reinforcement learning (RL), imitation (IL). However, these popular models MDPs neither thoroughly tested nor rigorously reliable.

10.1145/3533767.3534388 article EN 2022-07-15

Semantics-Aware Machine Learning for Function Recognition in Binary Code

OPENALEX - Publications

Shuai Wang Pei Wang Dinghao Wu

Function recognition in program binaries serves as the foundation for many binary instrumentation and analysis tasks. However, are usually stripped before distribution, function information is indeed absent most binaries. By far, identifying functions remains a challenge. Recent research work proposes to recognize code through machine learning techniques. The model, including typical entry point patterns, automatically constructed learning. we observed that previous only leverages...

10.1109/icsme.2017.59 article EN 2017-09-01

How far we have come: testing decompilation correctness of C decompilers

OPENALEX - Publications

Zhibo Liu Shuai Wang

A C decompiler converts an executable (the output from a compiler) into source code. The recovered code, once recompiled, will produce with the same functionality as original executable. With over twenty years of development, decompilers have been widely used in production to support reverse engineering applications, including legacy software migration, security retrofitting, comprehension, and act first step launching adversarial exploitations. As paramount component trust base numerous...

10.1145/3395363.3397370 article EN 2020-07-13

Protecting Intellectual Property of Large Language Model-Based Code Generation APIs via Watermarks

OPENALEX - Publications

Zongjie Li Chaozheng Wang Shuai Wang Cuiyun Gao

The rise of large language model-based code generation (LLCG) has enabled various commercial services and APIs. Training LLCG models is often expensive time-consuming, the training data are large-scale even inaccessible to public. As a result, risk intellectual property (IP) theft over (e.g., via imitation attacks) been serious concern. In this paper, we propose first watermark (WM) technique protect APIs from remote attacks. Our proposed based on replacing tokens in an output with their...

10.1145/3576915.3623120 article EN Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security 2023-11-15

RedDroid: Android Application Redundancy Customization Based on Static Analysis

OPENALEX - Publications

Yufei Jiang Qinkun Bao Shuai Wang Xiao Liu Dinghao Wu

Smartphone users are installing more and bigger apps. At the meanwhile, each app carries considerable amount of unused stuff, called software bloat, in its apk file. As a result, resources smartphone, such as hard disk network bandwidth, has become even insufficient than ever before. Therefore, it is critical to investigate existing apps on market development identify sources bloat develop techniques tools remove bloat. In this paper, we present comprehensive study Android applications,...

10.1109/issre.2018.00029 article EN 2018-10-01

CipherGuard: Compiler-aided Mitigation against Ciphertext Side-channel Attacks

OPENALEX - Publications

Ke Jiang Sen Deng Yinshuai Li Shuai Wang Tianwei Zhang and 1 more

Cryptographic implementations bolster security against timing side-channel attacks by integrating constant-time components. However, the new ciphertext side channels resulting from deterministic memory encryption in Trusted Execution Environments (TEEs), enable ciphertexts to manifest identifiable patterns when being sequentially written same address. Attackers with read access encrypted TEEs can potentially deduce plaintexts analyzing these changing patterns. In this paper, we design...

10.48550/arxiv.2502.13401 preprint EN arXiv (Cornell University) 2025-02-18

Reeq : Testing and Mitigating Ethically Inconsistent Suggestions of Large Language Models with Reflective Equilibrium

OPENALEX - Publications

Pingchuan Ma Zhao-Yu Wang Zongjie Li Zhenlan Ji Ao Sun and 2 more

LLMs increasingly serve as general-purpose AI assistants in daily life, and their subtly unethical suggestions become a serious real concern. It is demanding to test mitigate such from LLMs. Despite existing efforts detect violations of “testable” facets ethics (e.g., fairness testing), it challenging encode the full scope justice, deontology) into oracle without human annotations or intervention. In this paper, we take inspiration reflective equilibrium, modern moral reasoning method...

10.1145/3722554 article EN ACM Transactions on Software Engineering and Methodology 2025-03-11

A simple colorimetric and paper-based-smartphone sensing platform based on the enhanced peroxidase-like activity of Al doping Prussian blue for point-of-care detection of GSH

OPENALEX - Publications

Ying Yang Liuyan Wei Weiqin You Hao Huang Shuai Wang and 4 more

10.1016/j.talanta.2025.128020 article EN Talanta 2025-04-01

Coming Soon ...