NFDI4DS | UHH-SEMS - Publication Details

FedGraph: Federated Graph Learning With Intelligent Sampling

OPENALEX - Publications

Fahao Chen Peng Li Toshiaki Miyazaki Celimuge Wu

Federated learning has attracted much research attention due to its privacy protection in distributed machine learning. However, existing work of federated mainly focuses on Convolutional Neural Network (CNN), which cannot efficiently handle graph data that are popular many applications. Graph (GCN) been proposed as one the most promising techniques for learning, but setting seldom explored. In this article, we propose FedGraph among multiple computing clients, each holds a subgraph....

10.1109/tpds.2021.3125565 article EN IEEE Transactions on Parallel and Distributed Systems 2021-11-08

Corrections to “Giant Could Be Tiny: Efficient Inference of Giant Models on Resource-Constrained UAVs”

OPENALEX - Publications

Fahao Chen Peng Li Shengli Pan Lei Zhong Jing Deng

10.1109/jiot.2025.3531743 article EN IEEE Internet of Things Journal 2025-02-25

Serving Transformer Models Via Joint Requst Scheduling and Batching in the Network Edge

OPENALEX - Publications

Boqian Fu Fahao Chen Peng Li Deze Zeng

Transformers have dominated the field of natural language processing, attributed to their capability handle sequential input data. There is a surge work on computational and networking optimizations, aimed at improving training efficiency Transformers. However, transformer inference, cornerstone myriad AI services, remains relatively underexplored. With challenge variable-length inputs, conventional methods adopt padding schemes, resulting in waste. Moreover, works inference often overlook...

10.1109/tsusc.2025.3528105 article EN IEEE Transactions on Sustainable Computing 2025-01-01

Efficient multi-job federated learning scheduling with fault tolerance

OPENALEX - Publications

Boqian Fu Fahao Chen Shengli Pan Peng Li Zhou Su

10.1007/s12083-024-01847-z article EN other-oa Peer-to-Peer Networking and Applications 2025-01-16

Edge-Assisted Short Video Sharing With Guaranteed Quality-of-Experience

OPENALEX - Publications

Fahao Chen Peng Li Deze Zeng Song Guo

As a rising star of social apps, short video e.g., TikTok, have attracted large number mobile users by providing fresh and contents that highly match their watching preferences. Meanwhile, the booming growth apps imposes new technical challenges on existing computation communication infrastructure. Traditional solutions maintain all videos cloud stream them to via contend delivery networks or Internet. However, they incur huge network traffic long delay seriously affects users’ experiences....

10.1109/tcc.2021.3067834 article EN IEEE Transactions on Cloud Computing 2021-03-23

TCB: Accelerating Transformer Inference Services with Request Concatenation

OPENALEX - Publications

Boqian Fu Fahao Chen Peng Li Deze Zeng

Transformer has dominated the field of natural language processing because its strong capability in learning from sequential input data. In recent years, various computing and networking optimizations have been proposed for improving transformer training efficiency. However, inference, as core many AI services, seldom studied. A key challenge inference is variable-length input. order to align these input, existing work batching schemes by padding zeros, which unfortunately introduces...

10.1145/3545008.3545052 article EN 2022-08-29

In-Network Aggregation for Privacy-Preserving Federated Learning

OPENALEX - Publications

Fahao Chen Peng Li Toshiaki Miyazaki

Cross-silo federated learning becomes popular in various fields due to its great promises protecting training data. By carefully examining the interaction among distributed nodes, we find that existing still suffers from security weakness and network bottleneck during model synchronization. It has no protection on models, which also contain significant private information. In addition, many evidences have shown synchronization over wide-area is slow, bottlenecking whole process. To fill this...

10.1109/ict-dm52643.2021.9664035 article EN 2021-12-03

Hare

OPENALEX - Publications

Fahao Chen Peng Li Celimuge Wu Song Guo

Distributed machine learning (DML) has shown great promise in accelerating model training on multiple GPUs. To increase GPU utilization, a common practice is to let jobs share clusters, where the most fundamental and critical challenge how efficiently schedule these However, existing works about DML job scheduling are constrained settings with homogeneous heterogeneity practice, but its influence been seldom studied. Moreover, have internal structures that contain parallelism potentials,...

10.1145/3502181.3531462 article EN 2022-06-23

DGC: Training Dynamic Graphs with Spatio-Temporal Non-Uniformity using Graph Partitioning by Chunks

OPENALEX - Publications

Fahao Chen Peng Li Celimuge Wu

Dynamic Graph Neural Network (DGNN) has shown a strong capability of learning dynamic graphs by exploiting both spatial and temporal features. Although DGNN recently received considerable attention AI community various models have been proposed, building distributed system for efficient training is still challenging. It well recognized that how to partition the graph assign workloads multiple GPUs plays critical role in acceleration. Existing works into snapshots or sequences, which only...

10.1145/3626724 article EN Proceedings of the ACM on Management of Data 2023-12-08

Non-Clairvoyant Scheduling of Distributed Machine Learning with Inter-job and Intra-job Parallelism on Heterogeneous GPUs

OPENALEX - Publications

Fahao Chen Peng Li Celimuge Wu Song Guo

10.1109/tcc.2024.3414440 article EN IEEE Transactions on Cloud Computing 2024-06-14

Efficient Scheduling for Multi-Job Federated Learning Systems with Client Sharing

OPENALEX - Publications

Boqian Fu Fahao Chen Peng Li Zhou Su

Federated Learning (FL) has emerged as a promising learning approch for data distributed across edge devices. Existing research mainly focuses on single-job FL systems. However, in practical scenarios, multiple jobs are often submitted simultaneously. Simply applying optimizations to multi-job systems results sub-optimal system performance. Specifically, we find considerably low resource utilization the client side due device heterogeneity. In this paper, exploit opportunities improve by...

10.1109/dasc/picom/cbdcom/cy59711.2023.10361429 article EN 2021 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech) 2023-11-14

Giant Could Be Tiny: Efficient Inference of Giant Models on Resource-Constrained UAVs

OPENALEX - Publications

Fahao Chen Peng Li Shengli Pan Lei Zhong Jing Deng

Giant models, characterized by their billions or even trillions of parameters, has demonstrated unprecedented capabilities in handling complex tasks on Artificial intelligence (AI)-driven UAVs, such as disaster relief, aerial navigation, and manipulation. However, there is an open challenge about the mismatching between massive computation memory requirements giant models limited resources UAVs. Existing works either pose privacy concerns with offloading methods compromise model accuracy...

10.1109/jiot.2024.3366531 article EN IEEE Internet of Things Journal 2024-02-15

Communication-Efficient Sparsely-Activated Model Training via Sequence Migration and Token Condensation

OPENALEX - Publications

Fahao Chen Peng Li Zicong Hong Zhou Su Song Guo

Mixture-of-Experts (MoE) is an emerging technique for scaling large models with sparse activation. MoE are typically trained in a distributed manner expert parallelism scheme, where experts each layer across multiple GPUs. However, the default suffers from heavy network burden due to all-to-all intermediate data exchange among GPUs before and after run. Some existing works have proposed reduce exchanges by transferring loads, however, which would decrease level of execution make computation...

10.48550/arxiv.2411.15419 preprint EN arXiv (Cornell University) 2024-11-22

DGC: Training Dynamic Graphs with Spatio-Temporal Non-Uniformity using Graph Partitioning by Chunks

OPENALEX - Publications

Fahao Chen Peng Li Celimuge Wu

Dynamic Graph Neural Network (DGNN) has shown a strong capability of learning dynamic graphs by exploiting both spatial and temporal features. Although DGNN recently received considerable attention AI community various models have been proposed, building distributed system for efficient training is still challenging. It well recognized that how to partition the graph assign workloads multiple GPUs plays critical role in acceleration. Existing works into snapshots or sequences, which only...

10.48550/arxiv.2309.03523 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Low-Latency Perception Sharing Services for Connected Autonomous Vehicles

OPENALEX - Publications

Fahao Chen Peng Li Lei Zhong Dongxiao Yu Xiuzhen Cheng

Connected autonomous vehicles (CAVs) are promising to improve road safety, thanks various on-board sensors, such as LiDAR, radars, and stereo cameras. However, perception view could be significantly limited due occlusions, extreme weather, far objects. To address these challenges, in this paper, we propose an efficient edge-assisted sharing scheme, which enables exchange the information about their sensed environment safety. We formulate online optimization problem, with objective of...

10.1109/vtc2023-fall60731.2023.10333577 article EN 2021 IEEE 94th Vehicular Technology Conference (VTC2021-Fall) 2023-10-10

Efficient Giant Graph Unlearning via Push-Pull Tuning

OPENALEX - Publications

Fahao Chen Peng Li Shui Yu

The advent of deep learning applications for data collection has elicited concerns pertaining to privacy, particularly regarding the potential exposure through various vulnerabilities, such as membership inference attacks. In response these concerns, several machine unlearning techniques have been proposed, which can effectively eliminate specific from a trained model. However, it is important note that existing methods primarily concentrate on Euclidean data, leaving non-Euclidean...

10.1109/ispa-bdcloud-socialcom-sustaincom59178.2023.00151 article EN 2023-12-21

FedGraph: Federated Graph Learning with Intelligent Sampling

OPENALEX - Publications

Fahao Chen Peng Li Toshiaki Miyazaki Celimuge Wu

Federated learning has attracted much research attention due to its privacy protection in distributed machine learning. However, existing work of federated mainly focuses on Convolutional Neural Network (CNN), which cannot efficiently handle graph data that are popular many applications. Graph (GCN) been proposed as one the most promising techniques for learning, but setting seldom explored. In this paper, we propose FedGraph among multiple computing clients, each holds a subgraph. provides...

10.48550/arxiv.2111.01370 preprint EN cc-by-nc-nd arXiv (Cornell University) 2021-01-01

On-Camera Content Filtering for Real-Time Video Analytics

OPENALEX - Publications

Fahao Chen Peng Li Celimuge Wu

In the past several years, pervasive surveillance cameras have generated massive video records continually, and can be used for applications (e.g., tracking object detection). Machine Learning (ML), especially Deep Learning, is a powerful method extensively analytics. Typically, do not enough computational power to analyze videos locally. A commonly strategy gathering from processing them in cloud or edge server. However, limited bandwidth between server results inefficiency of existing...

10.1109/hpcc-dss-smartcity-dependsys53884.2021.00210 article EN 2021-12-01