NFDI4DS | UHH-SEMS - Publication Details

Decentralized Edge Intelligence: A Dynamic Resource Allocation Framework for Hierarchical Federated Learning

OPENALEX - Publications

Wei Yang Bryan Lim Jer Shyuan Ng Zehui Xiong Jiangming Jin Yang Zhang and 3 more

To enable the large scale and efficient deployment of Artificial Intelligence (AI), confluence AI Edge Computing has given rise to Intelligence, which leverages on computation communication capabilities end devices edge servers process data closer where it is produced. One enabling technologies privacy preserving machine learning paradigm known as Federated Learning (FL), enables owners conduct model training without having transmit their raw third-party servers. However, FL network...

10.1109/tpds.2021.3096076 article EN IEEE Transactions on Parallel and Distributed Systems 2021-07-09

Reputation-aware Hedonic Coalition Formation for Efficient Serverless Hierarchical Federated Learning

OPENALEX - Publications

Jer Shyuan Ng Wei Yang Bryan Lim Zehui Xiong Xianbin Cao Jiangming Jin and 3 more

Amid growing concerns on data privacy, Federated Learning (FL) has emerged as a promising privacy preserving distributed machine learning paradigm. Given that the FL network is expected to be implemented at scale, several studies have proposed system architectures towards improving scalability and efficiency. Specifically, Hierarchical (HFL) utilizes cluster heads, e.g., base stations, for intermediate aggregation relay of model parameters. Serverless also recently, in which owners, i.e.,...

10.1109/tpds.2021.3139039 article EN publisher-specific-oa IEEE Transactions on Parallel and Distributed Systems 2021-01-01

Privacy-Aware Double Auction With Time-Dependent Valuation for Blockchain-Based Dynamic Spectrum Sharing in IoT Systems

OPENALEX - Publications

Kun Zhu Lu Huang Jiangtian Nie Yang Zhang Zehui Xiong and 2 more

For future Internet of Things (IoT) systems, data-driven and dynamic spectrum-sharing schemes can significantly improve the spectrum utilization efficiency. However, conventional centralized architecture such IoT systems is often considered to be nontransparent, costly, vulnerable potential attacks single-point failures. To address aforementioned issues, a blockchain-based scheme has been proposed investigated in this work, which aims at enhancing system by providing desirable features, as...

10.1109/jiot.2022.3165819 article EN IEEE Internet of Things Journal 2022-04-08

Joint optimization of information trading in Internet of Things (IoT) market with externalities

OPENALEX - Publications

Yang Zhang Zehui Xiong Dusit Niyato Ping Wang Jiangming Jin

Internet of Things (IoT) technology enables various physical devices to collect, process and exchange information. Market oriented models become important for IoT systems efficiently utilize information, as network nodes operate in a highly distributed autonomous manner. In this work, we propose three-player game theoretic market model information trading, considering direct indirect externalities among participants. the model, an service provider collects processes then delivers processed...

10.1109/wcnc.2018.8377290 article EN 2022 IEEE Wireless Communications and Networking Conference (WCNC) 2018-04-01

BitFlow: Exploiting Vector Parallelism for Binary Neural Networks on CPU

OPENALEX - Publications

Yuwei Hu Jidong Zhai Dinghua Li Yifan Gong Yuhao Zhu and 3 more

Deep learning has revolutionized computer vision and other fields since its big bang in 2012. However, it is challenging to deploy Neural Networks (DNNs) into real-world applications due their high computational complexity. Binary (BNNs) dramatically reduce complexity by replacing most arithmetic operations with bitwise operations. Existing implementations of BNNs have been focusing on GPU or FPGA, using the conventional image-to-column method that doesn't perform well for binary convolution...

10.1109/ipdps.2018.00034 article EN 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2018-05-01

A Game-Theoretic Analysis of Complementarity, Substitutability and Externalities in Cloud Services

OPENALEX - Publications

Yang Zhang Zehui Xiong Dusit Niyato Ping Wang Jiangming Jin

In cloud computing, services can be allocated to users upon requests in an on-demand basis. Heterogeneous service providers may join the systems serve various types of users. Cloud complementary or substitutable. For services, request for a bundle e.g., CPU and storage, gain higher benefit from requesting them alone. The substitutable have similar functionalities users, different database obtaining one replace another one. Furthermore, also influence each other because externalities,...

10.1109/glocom.2017.8254116 article EN GLOBECOM 2022 - 2022 IEEE Global Communications Conference 2017-12-01

Message Passing Optimization in Robot Operating System

OPENALEX - Publications

Ziyue Jiang Yifan Gong Jidong Zhai Yu-Ping Wang Wei Liu and 2 more

10.1007/s10766-019-00647-w article EN International Journal of Parallel Programming 2019-11-16

HPC Simulations of Information Propagation Over Social Networks

OPENALEX - Publications

Jiangming Jin Stephen John Turner Bu‐Sung Lee Jianlong Zhong Bingsheng He

Simulations provide a flexible and valuable method to study the behaviors of information propagation over complex social networks. High Performance Computing (HPC) is technology that allows implementation efficient algorithms on powerful new hardware resources. With increased computing resource usage in large-scale network based simulations, it therefore attractive apply emerging HPC techniques improve simulation performance. This paper describes optimized strategies algorithmic adaptation...

10.1016/j.procs.2012.04.031 article EN Procedia Computer Science 2012-01-01

Diffusion-Model-Based Incentive Mechanism With Prospect Theory for Edge AIGC Services in 6G IoT

OPENALEX - Publications

Jinbo Wen Jiangtian Nie Yue Zhong Changyan Yi Xiaohuan Li and 3 more

10.1109/jiot.2024.3445171 article EN IEEE Internet of Things Journal 2024-08-16

Distributed Slice Selection-Based Computation Offloading for Intelligent Vehicular Networks

OPENALEX - Publications

Jianhang Tang Yiqun Duan Yi Zhou Jiangming Jin

Distributed artificial intelligence (AI) is becoming an efficient approach to fulfill the high and diverse requirements for future vehicular networks. However, distributed tasks generated by vehicles often require resources. A customized resource provision scheme required improve utilization of multi-dimensional In this work, a slice selection-based online offloading (SSOO) algorithm proposed in First, response time energy consumption are reduced processing locally on vehicles. Then,...

10.1109/ojvt.2021.3087355 article EN cc-by IEEE Open Journal of Vehicular Technology 2021-01-01

Training Deep Nets with Progressive Batch Normalization on Multi-GPUs

OPENALEX - Publications

Lianke Qin Yifan Gong Tianqi Tang Yutian Wang Jiangming Jin

10.1007/s10766-018-0615-5 article EN International Journal of Parallel Programming 2018-12-17

Movement Symmetry Assessment by Bilateral Motion Data Fusion

OPENALEX - Publications

Peng Ren Shiang Hu Han Zhen-feng Qing Wang Shuxia Yao and 6 more

A new approach, named bilateral motion data fusion, was proposed for the analysis of movement symmetry, which takes advantage cross-information between both sides body and processes unilateral at same time.This accomplished using canonical correlation joint independent component analysis. It should be noted that human movements include many categories, cannot enumerated one by one. Therefore, gait rhythm fluctuations healthy subjects patients with neurodegenerative diseases were employed as...

10.1109/tbme.2018.2829749 article EN IEEE Transactions on Biomedical Engineering 2018-04-24

Privacy is not Free: Energy-Aware Federated Learning for Mobile and Edge Intelligence

OPENALEX - Publications

Wenqi Yang Yang Zhang Wei Yang Bryan Lim Zehui Xiong Yutao Jiao and 1 more

In mobile and edge intelligence systems, federated learning (FL) enables local data training model sharing without obtaining actual from users, which are owners. Data processes performed at the user side with only trained gradients passed to an aggregator, i.e., server. The server continually trains updates corresponding models by collecting gradients. updated delivered back users for improved results. Despite advantages of in preserving privacy, process will consume adequate amount energy...

10.1109/wcsp49889.2020.9299703 article EN 2020-10-21

Simulation of Information Propagation over Complex Networks: Performance Studies on Multi-GPU

OPENALEX - Publications

Jiangming Jin Stephen John Turner Bu‐Sung Lee Jianlong Zhong Bingsheng He

General Purpose Graphics Processing Units (GPGPU) have been used in high performance computing platforms to accelerate the of scientific applications such as simulations. With increased resources required for large-scale network simulation, one GPU device may not enough memory and computation capacities. It is therefore necessary enhance system scalability by introducing multiple devices. also attractive investigate Multi-GPU This paper describes simulation information propagation on...

10.1109/ds-rt.2013.27 article EN 2013-10-01

A Robotic Communication Middleware Combining High Performance and High Reliability

OPENALEX - Publications

Wei Liu Hao Wu Ziyue Jiang Yifan Gong Jiangming Jin

With the significant advances of AI technology, intelligent robotic systems have achieved remarkable development and profound effects. To enable massive data transmissionin an efficient reliable way, both high performance andhigh reliability should be taken into account in system design. However, conventional communication middleware used majority autonomous systems, is based on socked-based methods, which always lead to latency. Moreover, some sophisticated utilizes shared memory upon ring...

10.1109/sbac-pad49847.2020.00038 article EN 2020-09-01

Performance modeling for runtime kernel adaptation: A case study on infectious disease simulation

OPENALEX - Publications

Jiangming Jin Stephen John Turner Bu‐Sung Lee Shyh-Hao Kuo Rick Siow Mong Goh and 1 more

In many large-scale scientific applications, there may be a compute intensive kernel that largely determines the overall performance of application. Sometimes algorithmic variations available and benefit can then gained by choosing optimal at runtime. However, it is sometimes difficult to choose most efficient as algorithms have varying under different execution conditions. This paper shows how construct set models explore analyze bottleneck an Furthermore, based on models, theoretical...

10.1109/grid.2010.5698009 article EN 2010-10-01

Zoro: A robotic middleware combining high performance and high reliability

OPENALEX - Publications

Wei Liu Jiangming Jin Hao Wu Yifan Gong Ziyue Jiang and 1 more

10.1016/j.jpdc.2022.04.010 article EN Journal of Parallel and Distributed Computing 2022-04-28

Diffusion-Based Multi-Agent Reinforcement Learning with Communication

OPENALEX - Publications

Xinyue Qi Jianhang Tang Jiangming Jin Yang Zhang

10.1109/apwcs61586.2024.10679289 article EN 2024-08-21

Simulation studies of viral advertisement diffusion on multi-GPU

OPENALEX - Publications

Jiangming Jin Stephen John Turner Bu‐Sung Lee Jianlong Zhong Bingsheng He

Simulation has become an important method that is widely used in studying the propagation behaviors during process of viral advertisement diffusion. With increased computing and memory resources required for large-scale network processing, General Purpose Graphics Processing Units (GPGPUs) have been high performance platforms to accelerate simulation performance. In this paper, we show optimized strategies diffusion on a Multi-GPU system. Using our proposed strategies, examine spread...

10.5555/2675983.2676182 article EN 2013-12-08

Simulation studies of viral advertisement diffusion on multi-GPU

OPENALEX - Publications

Jiangming Jin Stephen John Turner Bu‐Sung Lee Jianlong Zhong Bingsheng He

Simulation has become an important method that is widely used in studying the propagation behaviors during process of viral advertisement diffusion. With increased computing and memory resources required for large-scale network processing, General Purpose Graphics Processing Units (GPGPUs) have been high performance platforms to accelerate simulation performance. In this paper, we show optimized strategies diffusion on a Multi-GPU system. Using our proposed strategies, examine spread...

10.1109/wsc.2013.6721542 article EN 2013 Winter Simulations Conference (WSC) 2013-12-01

Hashing-Based Multi-Modal Semantic Communication

OPENALEX - Publications

Yibo Zhu Hongyu Gu Jiangtian Nie Jianhang Tang Jiangming Jin and 1 more

10.1109/wcnc57260.2024.10570632 article EN 2022 IEEE Wireless Communications and Networking Conference (WCNC) 2024-04-21

Diffusion Model-based Incentive Mechanism with Prospect Theory for Edge AIGC Services in 6G IoT

OPENALEX - Publications

Jinbo Wen Jiangtian Nie Yue Zhong Changyan Yi Xiaohuan Li and 3 more

The fusion of the Internet Things (IoT) with Sixth-Generation (6G) technology has significant potential to revolutionize IoT landscape. With ultra-reliable and low-latency communication capabilities 6G, 6G-IoT networks can transmit high-quality diverse data enhance edge learning. Artificial Intelligence-Generated Content (AIGC) harnesses advanced AI algorithms automatically generate various types content. emergence AIGC integrates networks, facilitating real-time provision customized...

10.48550/arxiv.2407.10979 preprint EN arXiv (Cornell University) 2024-06-10

Performance Modeling and Analysis for Critical Section Contention in Parallel Codes

OPENALEX - Publications

Jiangming Jin Yang Zhang Shanjiang Tang Hongfei Fan

Multi-core architectures are widely used to execute large scale scientific applications with a shared memory parallelism. The locking policy of critical sections is protect the data state if multiple threads simultaneously accessing. Factors such as portion section part parallel code, and number concurrency may affect multi-threaded processing can then become bottleneck constraining overall performance. This paper discusses impact on performance, both in task pattern pattern. According...

10.1109/trustcom.2016.0329 article EN 2015 IEEE Trustcom/BigDataSE/ISPA 2016-08-01

Memory-Centric Communication Mechanism for Real-time Autonomous Navigation Applications

OPENALEX - Publications

Wei Liu Yifan Gong Hao Wu Jidong Zhai Jiangming Jin

There has been a remarkable increase in the speed of AI development over past few years. Artificial intelligence and deep learning techniques are blooming expanding all forms to every sector possible. With emerging intelligent autonomous navigation systems, both memory allocation data movement becoming main bottlenecks inter-process communication procedures, especially supporting various types messages between multiple programming languages. To reduce significant cost, we propose novel...

10.1145/3404397.3404459 article EN 2020-08-09

Safe Process Quitting for GPU Multi-Process Service (MPS)

OPENALEX - Publications

Hao Wu Wei Liu Yifan Gong Jiangming Jin

GPUs have been widely adopted to speedup various throughput-originated applications running on HPC platforms, where typically there are a number of tasks sharing maximize GPU utilization. To facilitate sharing, vendors provide tools, allowing multiple processes concurrently use GPUs. For example, Nvidia provides MPS (Multi-Process Service) managing all achieve high throughput by fully exploiting hardware resources. However, such tool leads undesired single point failure for processes,...

10.1109/icdcs47774.2020.00125 article EN 2020-11-01