NFDI4DS | UHH-SEMS - Publication Details

Song Wang

ORCID: 0000-0003-0617-2877

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100326214

Research Areas

Software Engineering Research
Software Reliability and Analysis Research
Machine Learning and Data Classification
Software Testing and Debugging Techniques
Advanced Malware Detection Techniques
Adversarial Robustness in Machine Learning
Topic Modeling
Natural Language Processing Techniques
Web Data Mining and Analysis
Software System Performance and Reliability
Quality and Safety in Healthcare
Real-time simulation and control systems
Mobile Crowdsensing and Crowdsourcing
Scientific Computing and Data Management
Safety Systems Engineering in Autonomy
Radiomics and Machine Learning in Medical Imaging
Artificial Intelligence in Law
Head and Neck Cancer Studies
Hate Speech and Cyberbullying Detection
Artificial Intelligence in Healthcare and Education
Advanced Neural Network Applications
Ferroelectric and Negative Capacitance Devices
AI and Big Data Applications
Engineering Education and Technology
Semiconductor materials and devices

York University
2021-2025

Lanzhou Jiaotong University
2024

Hunan Cancer Hospital
2023

Central South University
2023

Lyceum of the Philippines University
2023

East University Of Heilongjiang
2022

Southeast University
2022

Taiwan Semiconductor Manufacturing Company (United States)
2010

Hewlett-Packard (United States)
2009

High performance 22/20nm FinFET CMOS devices with advanced high-K/metal gate scheme

OPENALEX - Publications

Chau-Neng Wu Dongyang Lin A. Keshavarzi Cheng-Tung Huang C.T. Chan and 58 more

A high performance 22/20nm CMOS bulk FinFET achieves the best in-class N/P I<inf>on</inf> values of 1200/1100 µA/µm for I<inf>off</inf>=100nA/µm at 1V. Excellent device electrostatic control is demonstrated gate length (L<inf>gate</inf>) down to 20nm. Dual-Epitaxy and multiple stressors are essential boost performance. Dual workfunction (WF) with an advanced High-K/Metal (HK/MG) stack deployed in integration-friendly process flow. This dual-WF approach provides excellent...

10.1109/iedm.2010.5703430 article EN International Electron Devices Meeting 2010-12-01

High-quality remanufacturing of HSLA-100 steel through the underwater laser directed energy deposition in an underwater hyperbaric environment

OPENALEX - Publications

Zhandong Wang Kun Yang Mingzhi Chen Yi Lu Song Wang and 4 more

10.1016/j.surfcoat.2022.128370 article EN Surface and Coatings Technology 2022-03-23

CrashTranslator: Automatically Reproducing Mobile Application Crashes Directly from Stack Trace

OPENALEX - Publications

Yuchao Huang Junjie Wang Zhe Liu Yawen Wang Song Wang and 3 more

Crash reports are vital for software maintenance since they allow the developers to be informed of problems encountered in mobile application. Before fixing, need reproduce crash, which is an extremely time-consuming and tedious task. Existing studies conducted automatic crash reproduction with natural language described reproducing steps. Yet we find a non-neglectable portion only contain stack trace when occurs. Such stack-trace-only crashes merely reveal last GUI page occurs, lack...

10.1145/3597503.3623298 article EN cc-by 2024-02-06

Context- and Fairness-Aware In-Process Crowdworker Recommendation

OPENALEX - Publications

Junjie Wang Ye Yang Song Wang Jun Hu Qing Wang

Identifying and optimizing open participation is essential to the success of software development. Existing studies highlighted importance worker recommendation for crowdtesting tasks in order improve bug detection efficiency, i.e., detect more bugs with fewer workers. However, there are a couple limitations existing work. First, these mainly focus on one-time recommendations based expertise matching at beginning new task. Second, results suffer from severe popularity bias, highly...

10.1145/3487571 article EN ACM Transactions on Software Engineering and Methodology 2022-03-07

Assessing the Impact of GPT-4 Turbo in Generating Defeaters for Assurance Cases

OPENALEX - Publications

Kimya Khakzad Shahandashti Mithila Sivakumar Mohammad Mahdi Mohajer Alvine Boaye Belle Song Wang and 1 more

Assurance cases (ACs) are structured arguments that allow verifying the correct implementation of created systems' non-functional requirements (e.g., safety, security). This allows for preventing system failure. The latter may result in catastrophic outcomes loss lives). ACs support certification systems compliance with industrial standards, e.g., DO-178C and ISO 26262. Identifying defeaters ---arguments challenge these --- is crucial enhancing ACs' robustness confidence. To automatically...

10.1145/3650105.3652291 article EN 2024-04-14

One Sentence Can Kill the Bug: Auto-replay Mobile App Crashes from One-sentence Overviews

OPENALEX - Publications

Yuchao Huang Junjie Wang Zhe Liu Mingyang Li Song Wang and 3 more

10.1109/tse.2025.3535938 article EN IEEE Transactions on Software Engineering 2025-01-01

Bias Unveiled: Investigating Social Bias in LLM-Generated Code

OPENALEX - Publications

Lin Ling Fazle Rabbi Song Wang Jinqiu Yang

Large language models (LLMs) have significantly advanced the field of automated code generation. However, a notable research gap exists in evaluating social biases that may be present produced by LLMs. To solve this issue, we propose novel fairness framework, i.e., Solar, to assess and mitigate LLM-generated code. Specifically, Solar can automatically generate test cases for quantitatively uncovering auto-generated quantify severity generated code, develop dataset covers diverse set...

10.1609/aaai.v39i26.34961 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

Evaluating API-Level Deep Learning Fuzzers: A Comprehensive Benchmarking Study

OPENALEX - Publications

Nima Shiri Harzevili Moshi Wei Mohammad Mahdi Mohajer Song Wang Hung Viet Pham

In recent years, the practice of fuzzing Deep Learning (DL) APIs has received significant attention in software engineering community. Many API-level DL fuzzers have been proposed to test individual by generating malformed input. Although these effective detecting bugs and outperforming prior work, there remains a gap bench-marking them against ground-truth, real-world libraries. Existing comparisons among primarily focus on detected but do not offer comprehensive, in-depth evaluation...

10.1145/3729533 article EN ACM Transactions on Software Engineering and Methodology 2025-04-15

Context-Aware Personalized Crowdtesting Task Recommendation

OPENALEX - Publications

Junjie Wang Ye Yang Song Wang Chunyang Chen Dandan Wang and 1 more

Crowdsourced software testing (short for crowdtesting) is a special type of crowdsourcing. It requires that crowdworkers master appropriate skill-sets and commit significant effort completing task. Abundant uncertainty may arise during crowdtesting process due to imperfect information between the task requester crowdworkers. For example, worker frequently chooses tasks in an ad hoc manner context, inappropriate selection lead worker's failing detect any bugs, unpaid wasted. Recent studies...

10.1109/tse.2021.3081171 article EN IEEE Transactions on Software Engineering 2021-05-18

Prompt Engineering or Fine Tuning: An Empirical Assessment of Large Language Models in Automated Software Engineering Tasks

OPENALEX - Publications

Jiho Shin C.J. Tang Tahmineh Mohati Maleknaz Nayebi Song Wang and 1 more

In this paper, we investigate the effectiveness of state-of-the-art LLM, i.e., GPT-4, with three different prompting engineering techniques (i.e., basic prompting, in-context learning, and task-specific prompting) against 18 fine-tuned LLMs on typical ASE tasks, code generation, summarization, translation. Our quantitative analysis these strategies suggests that prompt GPT-4 cannot necessarily significantly outperform fine-tuning smaller/older in all tasks. For comment best strategy prompt)...

10.48550/arxiv.2310.10508 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Demystifying and Detecting Misuses of Deep Learning APIs

OPENALEX - Publications

Moshi Wei Nima Shiri Harzevili Yue-Kai Huang Jinqiu Yang Junjie Wang and 1 more

Deep Learning (DL) libraries have significantly impacted various domains in computer science over the last decade. However, developers often face challenges when using DL APIs, as development paradigm of applications differs greatly from traditional software development. Existing studies on API misuse mainly focus software, leaving a gap understanding within APIs. To address this gap, we present first comprehensive study TensorFlow and PyTorch. Specifically, collected dataset 4,224 commits...

10.1145/3597503.3639177 article EN 2024-04-12

Effectiveness of ChatGPT for Static Analysis: How Far Are We?

OPENALEX - Publications

Mohammad Mahdi Mohajer Reem Aleithan Nima Shiri Harzevili Moshi Wei Alvine Boaye Belle and 2 more

10.1145/3664646.3664777 article EN 2024-07-10

ClarifyGPT: A Framework for Enhancing LLM-Based Code Generation via Requirements Clarification

OPENALEX - Publications

Fangwen Mu Lin Shi Song Wang Zhuohao Yu Binquan Zhang and 3 more

Large Language Models (LLMs), such as ChatGPT, have demonstrated impressive capabilities in automatically generating code from provided natural language requirements. However, real-world practice, it is inevitable that the requirements written by users might be ambiguous or insufficient. Current LLMs will directly generate programs according to those unclear requirements, regardless of interactive clarification, which likely deviate original user intents. To bridge gap, we introduce a novel...

10.1145/3660810 article EN Proceedings of the ACM on software engineering. 2024-07-12

Using GPT-4 Turbo to Automatically Identify Defeaters in Assurance Cases

OPENALEX - Publications

Kimya Khakzad Shahandashti Alvine Boaye Belle Mohammad Mahdi Mohajer Oluwafemi Odu Timothy C. Lethbridge and 2 more

10.1109/rew61692.2024.00011 article EN 2024-06-24

Domain Adaptation for Code Model-Based Unit Test Case Generation

OPENALEX - Publications

Jiho Shin Sepehr Hashtroudi Hadi Hemmati Song Wang

10.1145/3650212.3680354 article EN 2024-09-11

Automatic Comment Generation via Multi-Pass Deliberation

OPENALEX - Publications

Fangwen Mu Xiao Chen Lin Shi Song Wang Qing Wang

Deliberation is a common and natural behavior in human daily life. For example, when writing papers or articles, we usually first write drafts, then iteratively polish them until satisfied. In light of such cognitive process, propose DECOM, which multi-pass deliberation framework for automatic comment generation. DECOM consists multiple Models one Evaluation Model. Given code snippet, extract keywords from the retrieve similar fragment pre-defined corpus. Then, treat retrieved as initial...

10.1145/3551349.3556917 article EN 2022-10-10

API recommendation for machine learning libraries: how far are we?

OPENALEX - Publications

Moshi Wei Yuchao Huang Junjie Wang Jiho Shin Nima Shiri Harzevili and 1 more

Application Programming Interfaces (APIs) are designed to help developers build software more effectively. Recommending the right APIs for specific tasks is gaining increasing attention among researchers and developers. However, most of existing approaches mainly evaluated general programming using statically typed languages such as Java. Little known about their practical effectiveness usefulness machine learning (ML) with dynamically Python, whose paradigms fundamentally different from...

10.1145/3540250.3549124 article EN Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering 2022-11-07

The Good, the Bad, and the Missing: Neural Code Generation for Machine Learning Tasks

OPENALEX - Publications

Jiho Shin Moshi Wei Junjie Wang Lin Shi Song Wang

Machine learning (ML) has been increasingly used in a variety of domains, while solving ML programming tasks poses unique challenges due to the fundamental difference nature and construct general tasks, especially for developers who do not have backgrounds. Automatic code generation that produces snippet from natural language description can be promising technique accelerate tasks. In recent years, although many deep learning-based neural models proposed with high accuracy, fact most them...

10.1145/3630009 article EN ACM Transactions on Software Engineering and Methodology 2023-10-23

History-Driven Fuzzing For Deep Learning Libraries

OPENALEX - Publications

Nima Shiri Harzevili Mohammad Mahdi Mohajer Moshi Wei Hung Viet Pham Song Wang

Recently, many Deep Learning (DL) fuzzers have been proposed for API-level testing of DL libraries. However, they either perform unguided input generation (e.g., not considering the relationship between API arguments when generating inputs) or only support a limited set corner-case test inputs. Furthermore, developer APIs crucial library development remain untested, as are typically well documented and lack clear usage guidelines, unlike end-user APIs. This makes them more challenging target...

10.1145/3688838 article EN ACM Transactions on Software Engineering and Methodology 2024-08-16

Deep API Sequence Generation via Golden Solution Samples and API Seeds

OPENALEX - Publications

Yue-Kai Huang Junjie Wang Song Wang Moshi Wei Lin Shi and 2 more

Automatic API recommendation can accelerate developers’ programming, and has been studied for years. There are two orthogonal lines of approaches this task, i.e., information retrieval-based (IR-based) sequence to (seq2seq) model based approaches. Although these were reported have remarkable performance, our observation finds major drawbacks, IR-based lack the consideration relations among recommended APIs, seq2seq models do not API’s semantic meaning. To alleviate above problems, we propose...

10.1145/3695995 article EN ACM Transactions on Software Engineering and Methodology 2024-09-13

Which API is Faster: Mining Fine-grained Performance Opinion from Online Discussions

OPENALEX - Publications

Yue-Kai Huang Junjie Wang Song Wang Ru-Peng Zhang Qing Wang

10.1109/qrs62785.2024.00066 article EN 2024-07-01

Individualized CTV Delineation Method for Nasopharyngeal Carcinoma Based on Safe Distance Expansion from Primary Tumor: A Preliminary Efficacy Report from a Prospective Observational Study

OPENALEX - Publications

Zhuo Wu He Qian Yanping Li Jennifer Xiao Song Wang and 3 more

10.1016/j.ijrobp.2024.07.1761 article EN International Journal of Radiation Oncology*Biology*Physics 2024-09-27

Checker Bug Detection and Repair in Deep Learning Libraries

OPENALEX - Publications

Nima Shiri Harzevili Mohammad Mahdi Mohajer Jiho Shin Moshi Wei Gias Uddin and 6 more

Checker bugs in Deep Learning (DL) libraries are critical yet not well-explored. These often concealed the input validation and error-checking code of DL can lead to silent failures, incorrect results, or unexpected program behavior applications. Despite their potential significantly impact reliability performance DL-enabled systems built with these libraries, checker have received limited attention. We present first comprehensive study two widely-used i.e., TensorFlow PyTorch. Initially, we...

10.48550/arxiv.2410.06440 preprint EN arXiv (Cornell University) 2024-10-08

Research on risk prediction of bridge support maintenance construction based on machine learning

OPENALEX - Publications

Song Wang Xiaozhong Li

The risk prediction in the process of bridge support maintenance and construction is very important to stability safety whole structure. Taking a project as case, this study analyzed sorted out factors from three aspects environment, management through expert investigation literature reading based on machine learning algorithm model, established index system, adopted grid search optimize hyperparameters model. model evaluation indexes R2 MAE were used evaluate results. results show that...

10.1117/12.3034433 article EN 2024-10-16

Coming Soon ...