NFDI4DS | UHH-SEMS - Publication Details

Chuanyi Li

ORCID: 0000-0001-9270-5072

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5103039853

Research Areas

Software Engineering Research
Topic Modeling
Software Testing and Debugging Techniques
Natural Language Processing Techniques
Cloud Computing and Resource Management
Software System Performance and Reliability
Software Reliability and Analysis Research
Business Process Modeling and Analysis
Scientific Computing and Data Management
Web Data Mining and Analysis
Distributed and Parallel Computing Systems
Service-Oriented Architecture and Web Services
IoT and Edge/Fog Computing
Multimodal Machine Learning Applications
Fullerene Chemistry and Applications
Graphene research and applications
Open Source Software Innovations
Caching and Content Delivery
Inertial Sensor and Navigation
Boron and Carbon Nanomaterials Research
Cloud Computing and Remote Desktop Technologies
Artificial Intelligence in Law
GNSS positioning and interference
Smart Grid Security and Resilience
Video Analysis and Summarization

Nanjing University
2016-2025

Institute of Software
2015-2024

Affiliated Hospital of Nantong University
2022

Nantong University
2022

Beijing University of Posts and Telecommunications
2015-2020

Nanjing University of Aeronautics and Astronautics
2018-2019

Nanjing University of Science and Technology
2015

Hubei University
2012

Hubei University of Science and Technology
2002

Peking University
1992

Multi-objective scheduling for scientific workflow in multicloud environment

OPENALEX - Publications

Haiyang Hu Zhongjin Li Hua Hu Jie Chen Jidong Ge and 2 more

10.1016/j.jnca.2018.03.028 article EN publisher-specific-oa Journal of Network and Computer Applications 2018-04-27

AST-trans

OPENALEX - Publications

Ze Tang Xiaoyu Shen Chuanyi Li Jidong Ge LiGuo Huang and 2 more

Code summarization aims to generate brief natural language descriptions for source codes. The state-of-the-art approaches follow a transformer-based encoder-decoder architecture. As the code is highly structured and follows strict grammars, its Abstract Syntax Tree (AST) widely used encoding structural information. However, ASTs are much longer than corresponding code. Existing ignore size constraint simply feed whole linearized AST into encoders. We argue that such simple process makes it...

10.1145/3510003.3510224 article EN Proceedings of the 44th International Conference on Software Engineering 2022-05-21

Automatically classifying user requests in crowdsourcing requirements engineering

OPENALEX - Publications

Chuanyi Li LiGuo Huang Jidong Ge Bin Luo Vincent Ng

10.1016/j.jss.2017.12.028 article EN publisher-specific-oa Journal of Systems and Software 2017-12-20

Real-time and dynamic fault-tolerant scheduling for scientific workflows in clouds

OPENALEX - Publications

Zhongjin Li Victor Chang Haiyang Hu Hua Hu Chuanyi Li and 1 more

10.1016/j.ins.2021.03.003 article EN Information Sciences 2021-03-09

An Empirical Study of Code Simplification Methods in Code Intelligence Tasks

OPENALEX - Publications

Zongwen Shen Y. Li Jidong Ge Xiang Chen Chuanyi Li and 2 more

In recent years, pre-trained language models have seen significant success in natural processing and been increasingly applied to code-related tasks. Code intelligence tasks shown promising performance with the support of code models. Pre-processing simplification methods introduced prune tokens from model’s input while maintaining task effectiveness. These improve efficiency reducing computational costs. Post-prediction provide explanations for outcomes, enhancing reliability...

10.1145/3720540 article EN ACM Transactions on Software Engineering and Methodology 2025-02-27

Experimental Evaluation of Parameter-Efficient Fine-Tuning for Software Engineering Tasks

OPENALEX - Publications

Wentao Zou Zongwen Shen Qi Li Jidong Ge Chuanyi Li and 4 more

Pre-trained models (PTMs) have succeeded in various software engineering (SE) tasks following the “pre-train then fine-tune” paradigm. As fully fine-tuning all parameters of PTMs can be computationally expensive, a potential solution is parameter-efficient (PEFT), which freezes while introducing extra parameters. Although PEFT methods been applied to SE tasks, researchers often focus on specific scenarios and lack comprehensive comparison from different aspects such as field, size,...

10.1145/3722107 article EN ACM Transactions on Software Engineering and Methodology 2025-03-07

Online learning offloading framework for heterogeneous mobile edge computing system

OPENALEX - Publications

Feifei Zhang Jidong Ge Chifong Wong Chuanyi Li Xingguo Chen and 4 more

10.1016/j.jpdc.2019.02.003 article EN Journal of Parallel and Distributed Computing 2019-03-06

Energy cost minimization with job security guarantee in Internet data center

OPENALEX - Publications

Zhongjin Li Jidong Ge Chuanyi Li Hongji Yang Haiyang Hu and 2 more

10.1016/j.future.2016.12.017 article EN Future Generation Computer Systems 2016-12-16

FAIR: Flow Type-Aware Pre-Training of Compiler Intermediate Representations

OPENALEX - Publications

Changan Niu Chuanyi Li Vincent Ng David Lo Bin Luo

While the majority of existing pre-trained models from code learn source features such as tokens and abstract syntax trees, there are some other works that focus on learning compiler intermediate representations (IRs). Existing IR-based typically utilize IR instructions, control data flow graphs (CDFGs), call graphs, etc. However, these methods confuse variable nodes instruction in a CDFG fail to distinguish different types flows, neural networks they use capture long-distance dependencies...

10.1145/3597503.3608136 article EN 2024-02-06

A load-aware resource allocation and task scheduling for the emerging cloudlet system

OPENALEX - Publications

Feifei Zhang Jidong Ge Zhongjin Li Chuanyi Li Chifong Wong and 3 more

10.1016/j.future.2018.01.053 article EN Future Generation Computer Systems 2018-02-03

Practical Program Repair via Preference-based Ensemble Strategy

OPENALEX - Publications

Wenkang Zhong Chuanyi Li Kui Liu Tongtong Xu Jidong Ge and 3 more

To date, over 40 Automated Program Repair (APR) tools have been designed with varying bug-fixing strategies, which demonstrated to complementary performance in terms of being effective for different bug classes. Intuitively, it should be feasible improve the overall APR via assembling existing tools. Unfortunately, simply invoking all available a given can result unacceptable costs on execution as well patch validation (via expensive testing). Therefore, while is appealing, requires an...

10.1145/3597503.3623310 preprint EN 2024-02-06

SPT-Code: Sequence-to-Sequence Pre-Training for Learning Source Code Representations

OPENALEX - Publications

Changan Niu Chuanyi Li Vincent Ng Jidong Ge LiGuo Huang and 1 more

Recent years have seen the successful application of large pre-trained models to code representation learning, resulting in substantial improvements on many code-related downstream tasks. But there are issues surrounding their SE First, majority focus pre-training only encoder Transformer. For generation tasks that addressed using with encoder-decoder architecture, however, is no reason why decoder should be left out during pre-training. Second, existing models, including state-of-the-art...

10.48550/arxiv.2201.01549 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Process mining with token carried data

OPENALEX - Publications

Chuanyi Li Jidong Ge LiGuo Huang Haiyang Hu Budan Wu and 3 more

10.1016/j.ins.2015.08.050 article EN Information Sciences 2015-09-08

On Evaluating the Efficiency of Source Code Generated by LLMs

OPENALEX - Publications

Changan Niu Ting Zhang Chuanyi Li Bin Luo Vincent Ng

Recent years have seen the remarkable capabilities of large language models (LLMs) for code generation. Different from existing work that evaluate correctness generated by LLMs, we propose to further its efficiency. More efficient can lead higher performance and execution efficiency programs software completed LLM-assisted programming. First, LLMs on two benchmarks, HumanEval MBPP. Then, choose a set programming problems online judge platform LeetCode conduct more difficult evaluation....

10.1145/3650105.3652295 article EN 2024-04-14

An Empirical Study on Code Search Pre-trained Models: Academic Progresses vs. Industry Requirements

OPENALEX - Publications

Kuo Chi Chuanyi Li Jidong Ge Bin Luo

10.1145/3671016.3672580 article EN 2024-07-18

Benchmarking and Categorizing the Performance of Neural Program Repair Systems for Java

OPENALEX - Publications

Wenkang Zhong Chuanyi Li Kui Liu Jidong Ge Bin Luo and 2 more

Recent years have seen a rise in neural program repair systems the software engineering community, which adopt advanced deep learning techniques to automatically fix bugs. Having comprehensive understanding of existing can facilitate new improvements this area and provide practical instructions for users. However, we observe two potential weaknesses current evaluation NPR systems: ① published are trained with varying data, ② roughly evaluated through number totally fixed Questions such as...

10.1145/3688834 article EN ACM Transactions on Software Engineering and Methodology 2024-08-19

Software cybernetics in BPM: Modeling software behavior as feedback for evolution by a novel discovery method based on augmented event logs

OPENALEX - Publications

Chuanyi Li Jidong Ge LiGuo Huang Haiyang Hu Budan Wu and 2 more

10.1016/j.jss.2016.03.013 article EN publisher-specific-oa Journal of Systems and Software 2016-03-14

Implementation of artificial intelligence in the histological assessment of pulmonary subsolid nodules

OPENALEX - Publications

Jiajun Deng Mengmeng Zhao Qiuyuan Li Yikai Zhang Minjie Ma and 19 more

Clinical management of subsolid nodules (SSNs) is defined by the suspicion tumor invasiveness. We sought to develop an artificial intelligent (AI) algorithm for invasiveness assessment lung adenocarcinoma manifesting as radiological SSNs. investigated performance this in classification SSNs related invasiveness.A retrospective chest computed tomography (CT) dataset 1,589 was constructed (85%) and internally test (15%) proposed AI diagnostic tool, SSNet. Diagnostic evaluated hold-out set...

10.21037/tlcr-21-971 article EN Translational Lung Cancer Research 2021-12-01

KDM2B mediates the Wnt/β-catenin pathway through transcriptional activation of PKMYT1 via microRNA-let-7b-5p/EZH2 to affect the development of non-small cell lung cancer

OPENALEX - Publications

Xuedong Zhang Zhongbo Yin Chuanyi Li Lishen Nie Keyan Chen

10.1016/j.yexcr.2022.113208 article EN Experimental Cell Research 2022-05-14

Security and performance-aware resource allocation for enterprise multimedia in mobile edge computing

OPENALEX - Publications

Zhongjin Li Haiyang Hu Binbin Huang Jie Chen Chuanyi Li and 2 more

10.1007/s11042-019-08557-2 article EN Multimedia Tools and Applications 2020-01-14

PassSum: Leveraging paths of abstract syntax trees and self‐supervision for code summarization

OPENALEX - Publications

Changan Niu Chuanyi Li Vincent Ng Jidong Ge LiGuo Huang and 1 more

Abstract Code summarization is to provide a high‐level comment for code snippet that typically describes the function and intent of given code. Recent years have seen successful application data‐driven summarization. To improve performance model, numerous approaches use abstract syntax trees (ASTs) represent structural information code, which considered by most researchers be main factor distinguishes from natural language. Then, such methods are trained on large‐scale labeled datasets...

10.1002/smr.2620 article EN Journal of Software Evolution and Process 2023-10-02

PTM-APIRec: Leveraging Pre-trained Models of Source Code in API Recommendation

OPENALEX - Publications

Zhihao Li Chuanyi Li Ze Tang Wanhong Huang Jidong Ge and 5 more

Recommending APIs is a practical and essential feature of IDEs. Improving the accuracy API recommendations an effective way to improve coding efficiency. With success deep learning in software engineering, state-of-the-art (SOTA) performance recommendation also achieved by deep-learning-based approaches. However, existing SOTAs either only consider sequences code snippets or rely on complex operations for extracting hand-crafted features, all which have potential risks under-encoding input...

10.1145/3632745 article EN ACM Transactions on Software Engineering and Methodology 2023-11-14

Monitoring Interactions Across Multi Business Processes with Token Carried Data

OPENALEX - Publications

Chuanyi Li Jidong Ge Zhongjin Li LiGuo Huang Hongji Yang and 1 more

The rapid development of web service provides many opportunities for companies to migrate their business processes the Internet wider accessibility and higher collaboration efficiency. However, open, dynamic ever-changing also brings challenges in protecting these processes. There are certain process monitoring methods recently proposed ones based on state changes artifacts or places, however, they do not mention defending interactions from outer tampering, where events could be detected by...

10.1109/tsc.2016.2645690 article EN IEEE Transactions on Services Computing 2016-12-28

Study of microstructure and anomalous variation of resistivity in metal-C60 multilayer thin films

OPENALEX - Publications

Wenbing Zhao Kejian Luo Jun Chen Jinlong Zhang Chuanyi Li and 4 more

10.1016/0038-1098(92)90899-k article EN Solid State Communications 1992-09-01

Coming Soon ...