NFDI4DS | UHH-SEMS - Publication Details

CoCoNuT: combining context-aware neural translation models using ensemble for program repair

OPENALEX - Publications

Thibaud Lutellier Hung Viet Pham Lawrence Pang Yitong Li Moshi Wei and 1 more

Automated generate-and-validate (GV) program repair techniques (APR) typically rely on hard-coded rules, thus only fixing bugs following specific fix patterns. These rules require a significant amount of manual effort to discover and it is hard adapt these different programming languages.

10.1145/3395363.3397369 article EN 2020-07-13

CLEAR

OPENALEX - Publications

Moshi Wei Nima Shiri Harzevili Yuchao Huang Junjie Wang Song Wang

Automatic API recommendation has been studied for years. There are two orthogonal lines of approaches this task, i.e., information-retrieval-based (IR-based) and neural-based methods. Although these were reported having remarkable performance, our observation shows that existing can fail due to the following reasons: 1) most IR-based treat task queries as bag-of-words use word embedding represent queries, which cannot capture sequential semantic information. 2) both weak at distinguishing...

10.1145/3510003.3510159 article EN Proceedings of the 44th International Conference on Software Engineering 2022-05-21

Automatic Unit Test Generation for Machine Learning Libraries: How Far Are We?

OPENALEX - Publications

Song Wang Nishtha Shrestha Abarna Kucheri Subburaman Junjie Wang Moshi Wei and 1 more

Automatic unit test generation that explores the input space and produces effective cases for given programs have been studied decades. Many tools can help generate with high structural coverage over a program examined. However, fact existing are mainly evaluated on general software calls into question about its practical effectiveness usefulness machine learning libraries, which statistically orientated fundamentally different nature construction from projects. In this paper, we set out to...

10.1109/icse43902.2021.00138 article EN 2021-05-01

Evaluating API-Level Deep Learning Fuzzers: A Comprehensive Benchmarking Study

OPENALEX - Publications

Nima Shiri Harzevili Moshi Wei Mohammad Mahdi Mohajer Song Wang Hung Viet Pham

In recent years, the practice of fuzzing Deep Learning (DL) APIs has received significant attention in software engineering community. Many API-level DL fuzzers have been proposed to test individual by generating malformed input. Although these effective detecting bugs and outperforming prior work, there remains a gap bench-marking them against ground-truth, real-world libraries. Existing comparisons among primarily focus on detected but do not offer comprehensive, in-depth evaluation...

10.1145/3729533 article EN ACM Transactions on Software Engineering and Methodology 2025-04-15

Demystifying and Detecting Misuses of Deep Learning APIs

OPENALEX - Publications

Moshi Wei Nima Shiri Harzevili Yue-Kai Huang Jinqiu Yang Junjie Wang and 1 more

Deep Learning (DL) libraries have significantly impacted various domains in computer science over the last decade. However, developers often face challenges when using DL APIs, as development paradigm of applications differs greatly from traditional software development. Existing studies on API misuse mainly focus software, leaving a gap understanding within APIs. To address this gap, we present first comprehensive study TensorFlow and PyTorch. Specifically, collected dataset 4,224 commits...

10.1145/3597503.3639177 article EN 2024-04-12

Assessing Evaluation Metrics for Neural Test Oracle Generation

OPENALEX - Publications

Jiho Shin Hadi Hemmati Moshi Wei Song Wang

Recently, deep learning models have shown promising results in test oracles generation.Static evaluation metrics from Natural Language Generation (NLG) such as BLEU, CodeBLEU, ROUGE-L, METEOR, and Accuracy, which is mainly based on textual comparisons, been widely adopted to measure the performance of Neural Oracle (NOG) models.However, these NLG-based may not reflect testing effectiveness generated oracle within a suite, often measured by dynamic (execution-based) adequacy code coverage...

10.1109/tse.2024.3433463 article EN IEEE Transactions on Software Engineering 2024-07-25

Effectiveness of ChatGPT for Static Analysis: How Far Are We?

OPENALEX - Publications

Mohammad Mahdi Mohajer Reem Aleithan Nima Shiri Harzevili Moshi Wei Alvine Boaye Belle and 2 more

10.1145/3664646.3664777 article EN 2024-07-10

Deep API Sequence Generation via Golden Solution Samples and API Seeds

OPENALEX - Publications

Yue-Kai Huang Junjie Wang Song Wang Moshi Wei Lin Shi and 2 more

Automatic API recommendation can accelerate developers’ programming, and has been studied for years. There are two orthogonal lines of approaches this task, i.e., information retrieval-based (IR-based) sequence to (seq2seq) model based approaches. Although these were reported have remarkable performance, our observation finds major drawbacks, IR-based lack the consideration relations among recommended APIs, seq2seq models do not API’s semantic meaning. To alleviate above problems, we propose...

10.1145/3695995 article EN ACM Transactions on Software Engineering and Methodology 2024-09-13

ENCORE: Ensemble Learning using Convolution Neural Machine Translation for Automatic Program Repair

OPENALEX - Publications

Thibaud Lutellier Lawrence Pang Viet Pham Moshi Wei Lin Tan

Automated generate-and-validate (G&V) program repair techniques typically rely on hard-coded rules, only fix bugs following specific patterns, and are hard to adapt different programming languages. We propose ENCORE, a new G&V technique, which uses ensemble learning convolutional neural machine translation (NMT) models automatically in multiple take advantage of the randomness hyper-parameter tuning build that combine them using learning. This NMT approach outperforms standard long...

10.48550/arxiv.1906.08691 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Yet another combination of IR- and neural-based comment generation

OPENALEX - Publications

Yuchao Huang Moshi Wei Song Wang Junjie Wang Qing Wang

10.1016/j.infsof.2022.107001 article EN Information and Software Technology 2022-07-20

API recommendation for machine learning libraries: how far are we?

OPENALEX - Publications

Moshi Wei Yuchao Huang Junjie Wang Jiho Shin Nima Shiri Harzevili and 1 more

Application Programming Interfaces (APIs) are designed to help developers build software more effectively. Recommending the right APIs for specific tasks is gaining increasing attention among researchers and developers. However, most of existing approaches mainly evaluated general programming using statically typed languages such as Java. Little known about their practical effectiveness usefulness machine learning (ML) with dynamically Python, whose paradigms fundamentally different from...

10.1145/3540250.3549124 article EN Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering 2022-11-07

CoCoFuzzing: Testing Neural Code Models with Coverage-Guided Fuzzing

OPENALEX - Publications

Moshi Wei Yuchao Huang Jinqiu Yang Junjie Wang Song Wang

Deep learning-based code processing models have shown good performance for tasks such as predicting method names, summarizing programs, and comment generation. However, despite the tremendous progress, deep learning are often prone to adversarial attacks, which can significantly threaten robustness generalizability of these by leading them misclassification with unexpected inputs. To address above issue, many testing approaches been proposed, however, mainly focus on applications in domains...

10.48550/arxiv.2106.09242 preprint EN cc-by-sa arXiv (Cornell University) 2021-01-01

The Good, the Bad, and the Missing: Neural Code Generation for Machine Learning Tasks

OPENALEX - Publications

Jiho Shin Moshi Wei Junjie Wang Lin Shi Song Wang

Machine learning (ML) has been increasingly used in a variety of domains, while solving ML programming tasks poses unique challenges due to the fundamental difference nature and construct general tasks, especially for developers who do not have backgrounds. Automatic code generation that produces snippet from natural language description can be promising technique accelerate tasks. In recent years, although many deep learning-based neural models proposed with high accuracy, fact most them...

10.1145/3630009 article EN ACM Transactions on Software Engineering and Methodology 2023-10-23

Dynamic Encoding and Decoding of Information for Split Learning in Mobile-Edge Computing: Leveraging Information Bottleneck Theory

OPENALEX - Publications

Omar Alhussein Moshi Wei Arashmid Akhavain

Split learning is a privacy-preserving distributed paradigm in which an ML model (e.g., neural network) split into two parts (i.e., encoder and decoder). The shares so-called latent representation, rather than raw data, for training. In mobile-edge computing, network functions (such as traffic forecasting) can be trained via where resides user equipment (UE) decoder the edge network. Based on data processing inequality information bottleneck (IB) theory, we present new framework training...

10.1109/globecom54140.2023.10437933 article EN GLOBECOM 2022 - 2022 IEEE Global Communications Conference 2023-12-04

CoCoFuzzing: Testing Neural Code Models With Coverage-Guided Fuzzing

OPENALEX - Publications

Moshi Wei Yuchao Huang Jinqiu Yang Junjie Wang Song Wang

Deep learning (DL)-based code processing models have demonstrated good performance for tasks such as method name prediction, program summarization, and comment generation. However, despite the tremendous advancements, DL are frequently susceptible to adversarial attacks, which pose a significant threat robustness generalizability of these by causing them misclassify unexpected inputs. To address issue above, numerous testing approaches been proposed; however, primarily target applications in...

10.1109/tr.2022.3208239 article EN IEEE Transactions on Reliability 2022-10-11

History-Driven Fuzzing For Deep Learning Libraries

OPENALEX - Publications

Nima Shiri Harzevili Mohammad Mahdi Mohajer Moshi Wei Hung Viet Pham Song Wang

Recently, many Deep Learning (DL) fuzzers have been proposed for API-level testing of DL libraries. However, they either perform unguided input generation (e.g., not considering the relationship between API arguments when generating inputs) or only support a limited set corner-case test inputs. Furthermore, developer APIs crucial library development remain untested, as are typically well documented and lack clear usage guidelines, unlike end-user APIs. This makes them more challenging target...

10.1145/3688838 article EN ACM Transactions on Software Engineering and Methodology 2024-08-16

Checker Bug Detection and Repair in Deep Learning Libraries

OPENALEX - Publications

Nima Shiri Harzevili Mohammad Mahdi Mohajer Jiho Shin Moshi Wei Gias Uddin and 6 more

Checker bugs in Deep Learning (DL) libraries are critical yet not well-explored. These often concealed the input validation and error-checking code of DL can lead to silent failures, incorrect results, or unexpected program behavior applications. Despite their potential significantly impact reliability performance DL-enabled systems built with these libraries, checker have received limited attention. We present first comprehensive study two widely-used i.e., TensorFlow PyTorch. Initially, we...

10.48550/arxiv.2410.06440 preprint EN arXiv (Cornell University) 2024-10-08

Development of a scalable, immersive, multiplayer game for teaching engineering courses

OPENALEX - Publications

Tahzinul Islam Moshi Wei Alidad Amirfazli

A scalable and immersive game was developed to serve as a monthly concept review for theory-heavy engineering courses (such fluid dynamics or heat transfer). It designed such that in-game items content may be dynamically replaced easily with an Excel data table, without the need further programming. is expected course instructors use tool by simply updating table rapidly tailor any course. Even room/zone designs parametrized using table. Given automation level of tool, its scalability...

10.24908/pceea.2023.17136 article EN Proceedings of the Canadian Engineering Education Association (CEEA) 2024-03-04

SkipAnalyzer: A Tool for Static Code Analysis with Large Language Models

OPENALEX - Publications

Mohammad Mahdi Mohajer Reem Aleithan Nima Shiri Harzevili Moshi Wei Alvine Boaye Belle and 2 more

We introduce SkipAnalyzer, a large language model (LLM)-powered tool for static code analysis. SkipAnalyzer has three components: 1) an LLM-based bug detector that scans source and reports specific types of bugs, 2) false-positive filter can identify bugs in the results detectors (e.g., result step to improve detection accuracy, 3) patch generator generate patches detected above. As proof-of-concept, is built on ChatGPT, which exhibited outstanding performance various software engineering...

10.48550/arxiv.2310.18532 preprint EN cc-by-nc-sa arXiv (Cornell University) 2023-01-01

The Good, the Bad, and the Missing: Neural Code Generation for Machine Learning Tasks

OPENALEX - Publications

Jiho Shin Moshi Wei Junjie Wang Lin Shi Song Wang

Machine learning (ML) has been increasingly used in a variety of domains, while solving ML programming tasks poses unique challenges because the fundamentally different nature and construction from general tasks, especially for developers who do not have backgrounds. Automatic code generation that produces snippet natural language description can be promising technique to accelerate tasks. In recent years, although many deep learning-based neural models proposed with high accuracy, fact most...

10.48550/arxiv.2305.09082 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Automatic Unit Test Generation for Deep Learning Frameworks based on API Knowledge

OPENALEX - Publications

Arunkaleeshwaran Narayanan Nima Shiri Harzevili Junjie Wang Shi Lin Moshi Wei and 1 more

Many automatic unit test generation tools that can generate cases with high coverage over a program have been proposed. However, most of these are ineffective on deep learning (DL) frameworks due to the fact many APIs expect inputs follow specific API knowledge. To fill this gap, we propose MUTester for by leveraging constraints mined from corresponding documentation and usage patterns code fragments in Stack Overflow (SO). Particularly, first set 18 rules mining documents. We then use...

10.48550/arxiv.2307.00404 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Dynamic Encoding and Decoding of Information for Split Learning in Mobile-Edge Computing: Leveraging Information Bottleneck Theory

OPENALEX - Publications

Omar Alhussein Moshi Wei Arashmid Akhavain

Split learning is a privacy-preserving distributed paradigm in which an ML model (e.g., neural network) split into two parts (i.e., encoder and decoder). The shares so-called latent representation, rather than raw data, for training. In mobile-edge computing, network functions (such as traffic forecasting) can be trained via where resides user equipment (UE) decoder the edge network. Based on data processing inequality information bottleneck (IB) theory, we present new framework training...

10.48550/arxiv.2309.02787 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Assessing Evaluation Metrics for Neural Test Oracle Generation

OPENALEX - Publications

Jiho Shin Hadi Hemmati Moshi Wei Song Wang

In this work, we revisit existing oracle generation studies plus ChatGPT to empirically investigate the current standing of their performance in both NLG-based and test adequacy metrics. Specifically, train run four state-of-the-art models on five two metrics for our analysis. We apply different correlation analyses between these sets Surprisingly, found no significant For instance, oracles generated from project activemq-artemis had highest all among studied NOGs, however, it most number...

10.48550/arxiv.2310.07856 preprint EN other-oa arXiv (Cornell University) 2023-01-01

A Survey on Query-based API Recommendation

OPENALEX - Publications

Moshi Wei Nima Shiri Harzevili Alvine Boaye Belle Junjie Wang Lin Shi and 2 more

Application Programming Interfaces (APIs) are designed to help developers build software more effectively. Recommending the right APIs for specific tasks has gained increasing attention among researchers and in recent years. To comprehensively understand this research domain, we have surveyed analyze API recommendation studies published last 10 Our study begins with an overview of structure tools. Subsequently, systematically prior pose four key questions. For RQ1, examine volume papers...

10.48550/arxiv.2312.10623 preprint EN cc-by-nc-sa arXiv (Cornell University) 2023-01-01