NFDI4DS | UHH-SEMS - Publication Details

Gangyong Jia

ORCID: 0000-0002-0284-1685

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5102908258

Research Areas

Parallel Computing and Optimization Techniques
Advanced Data Storage Technologies
Cloud Computing and Resource Management
Interconnection Networks and Systems
Distributed and Parallel Computing Systems
Low-power high-performance VLSI design
Embedded Systems Design Techniques
South Asian Studies and Diaspora
Asian Geopolitics and Ethnography
Privacy-Preserving Technologies in Data
Infrastructure Maintenance and Monitoring
Machine Learning in Healthcare
Video Surveillance and Tracking Methods
Advanced Neural Network Applications
Recommender Systems and Techniques
Ferroelectric and Negative Capacitance Devices
IoT and Edge/Fog Computing

Hangzhou Dianzi University
2013-2025

Ministry of Education of the People's Republic of China
2015

University of Science and Technology of China
2011-2014

Suzhou Research Institute
2011-2012

Dynamic Adaptive Replacement Policy in Shared Last-Level Cache of DRAM/PCM Hybrid Memory for Big Data Storage

OPENALEX - Publications

Gangyong Jia Guangjie Han Jinfang Jiang Li Liu

The increasing demand on the main memory capacity is one of big data challenges. Dynamic random access (DRAM) does not represent best choice for a memory, due to high power consumption and low density. However, nonvolatile such as phase-change (PCM), represents an additional because high-density characteristic. Nevertheless, latency limited write endurance have disabled PCM replace DRAM currently. Therefore, hybrid which combines both PCM, has become good alternative traditional memory. Both...

10.1109/tii.2016.2645941 article EN IEEE Transactions on Industrial Informatics 2016-12-29

Vertical federated learning based on data subset representation for healthcare application

OPENALEX - Publications

Yukun Shi Jilin Zhang Meiting Xue Yan Zeng Gangyong Jia and 2 more

10.1016/j.cmpb.2025.108623 article EN Computer Methods and Programs in Biomedicine 2025-02-12

Dynamic Resource Partitioning for Heterogeneous Multi-Core-Based Cloud Computing in Smart Cities

OPENALEX - Publications

Gangyong Jia Guangjie Han Jinfang Jiang Ning Sun Kun Wang

As the smart cities emerged for more comfortable urban spaces, services, such as health, transportation, and so on, need to be promoted. In addition, cloud computing provides flexible allocation, migration of better security isolation; therefore, it is infrastructure cities. Single instruction-set architecture (ISA) heterogeneous multi-core processors have higher performance per watt than their symmetric counterparts are popular in current processors. computing, which integrates a few fast...

10.1109/access.2015.2507576 article EN cc-by-nc-nd IEEE Access 2015-12-10

PARS: A scheduling of periodically active rank to optimize power efficiency for main memory

OPENALEX - Publications

Gangyong Jia Guangjie Han Jinfang Jiang Joel J. P. C. Rodrigues

10.1016/j.jnca.2015.08.001 article EN Journal of Network and Computer Applications 2015-08-15

Task Scheduling Strategy Based on Resource Constraint in Edge Computing System

OPENALEX - Publications

Qing Ren Huanle Rao Gangyong Jia Youqing Xu Wei Wang and 1 more

10.1109/icps59941.2024.10639950 article EN 2024-05-12

Memory Affinity: Balancing Performance, Power, Thermal and Fairness for Multi-core Systems

OPENALEX - Publications

Gangyong Jia Xi Li Chao Wang Xuehai Zhou Zongwei Zhu

Main memory is expected to grow significantly in both speed and capacity for it a major shared resource among cores multi-core system, which will lead increasing power consumption. Therefore, critical address the issue without seriously decreasing performance subsystem. In this paper, we firstly propose affinity retains active low ranks as long possible avoid frequently switching between status, then present aware scheduling (MAS) balance performance, power, thermal fairness systems....

10.1109/cluster.2012.33 article EN 2012-09-01

Coordinate page allocation and thread group for improving main memory power efficiency

OPENALEX - Publications

Gangyong Jia Xi Li Jian Wan Liang Shi Chao Wang

Main Memory is responsible for a large and increasing fraction of the energy consumed by multi-core systems. Therefore, it critical to address power issue in memory subsystem. In this paper, we present solution improve efficiency through coordinating page allocation thread group scheduling (CAS). Partitioning all threads into different groups, after using proposed allocation, same occupy rank. Adjusting default Linux CFS, implement scheduling. The CAS alternates active partial periodically...

10.1145/2525526.2525851 article EN 2013-10-30

Coordinate Channel-Aware Page Mapping Policy and Memory Scheduling for Reducing Memory Interference Among Multimedia Applications

OPENALEX - Publications

Gangyong Jia Guangjie Han Aohan Li Jaime Lloret

In a modern multicore system, memory is shared among more and concurrently running multimedia applications. Therefore, contention interference are serious, inducing system performance degradation significantly, the of each thread differently, unfairness in resource sharing, priority inversion, even starvation. this paper, we propose an approach coordinating channel-aware page mapping policy scheduling (CCPS) to reduce intermultimedia application system. The idea map data different threads...

10.1109/jsyst.2015.2430522 article EN IEEE Systems Journal 2015-06-03

Dynamic Time-slice Scaling for Addressing OS Problems Incurred by Main Memory DVFS in Intelligent System

OPENALEX - Publications

Gangyong Jia Guangjie Han Jinfang Jiang Aohan Li

10.1007/s11036-015-0587-2 article EN Mobile Networks and Applications 2015-03-10

Phase Detection for Loop-Based Programs on Multicore Architectures

OPENALEX - Publications

Chao Wang Xi Li Dong Dai Gangyong Jia Xuehai Zhou

Phase detection and behavior analysis have been major concerned to improve the performance as well system throughputs. However, for distributed acceleration engines, execution among different phases is much more difficult be analyzed, especially loop based programs. With respect tasks in iterations, how efficiently detect belonging same iteration or even across iterations posing significant challenge. In this paper we propose a phase method loop-based programs on multiprocessor...

10.1109/cluster.2012.73 article EN 2012-09-01

Combine thread with memory scheduling for maximizing performance in multi-core systems

OPENALEX - Publications

Gangyong Jia Guangjie Han Liang Shi Jian Wan Dong Dai

The growing gap between microprocessor speed and DRAM is a major problem that computer designers are facing. In order to narrow the gap, it necessary improve DRAM's throughput. Moreover, on multi-core platforms, memory shared by all cores usually suffers from contention interference problem, which can cause serious performance degradation unfairness among parallel running threads. To address these problems, this paper proposes techniques take both advantages of partitioning cores, threads...

10.1109/padsw.2014.7097821 article EN 2014-12-01

Share memory aware scheduler

OPENALEX - Publications

Xi Li Gangyong Jia Yun Chen Zongwei Zhu Xuehai Zhou

Optimizing system performance through scheduling has received a lot of attention. However, none the existing approaches can balance improvement and fair share CPU time among threads. We present in this paper memory aware scheduler (SMAS). The key idea is to adopt thread group which partitions threads based on address space reduce switching overhead give each chance occupy time. There are three main contributions: 1) SMAS does well balancing fairness all threads; 2) our knowledge, first...

10.1145/2206781.2206852 article EN Proceedings of the Great Lakes Symposium on VLSI 2022 2012-05-03

Cache Promotion Policy Using Re-reference Interval Prediction

OPENALEX - Publications

Gangyong Jia Xi Li Chao Wang Xuehai Zhou Zongwei Zhu

The last-level cache (LLC) mitigates the long latencies of memory access in today's chip multi-core processor (CMP). promotion policy LLC largely affects efficiency, while an inappropriate may lead useless blocks to remain longer than necessary, turn result into inefficiency. Currently state-of-the-art policies are unaware re-reference interval accesses. Applications that exhibit a perform poorly with these policies. In this paper, we propose uses prediction (RRIP) information. Such...

10.1109/cluster.2012.32 article EN 2012-09-01

Impacts of Memory Address Mapping Scheme on Reducing DRAM Self-Refresh Power for Mobile Computing Devices

OPENALEX - Publications

Zongwei Zhu Jing Cao Xi Li Junneng Zhang Youqing Xu and 1 more

With the growth of Internet Things (IoT), increasingly, more computing tasks are implemented on power-sensitive mobile devices, causing a bottleneck energy consumption. Most devices consume considerable power in standby mode, during which capacitive DRAM cells' self-refresh power, is used to preserve data integrity, accounts for large part. To address this issue, strategies from both hardware and software perspectives have been proposed, yet, existing methods usually high cost. Software...

10.1109/access.2018.2885064 article EN cc-by-nc-nd IEEE Access 2018-01-01

DTS: Using Dynamic Time-Slice Scaling to Address the OS Problem Incurred by DVFS

OPENALEX - Publications

Gangyong Jia Xuhong Gao Xi Li Chao Wang Xuehai Zhou

Dynamic voltage and frequency scaling (DVFS) has been the most useful technology to reduce power consumption, but it causes unpredictable program performance decreasing unfair sharing among threads, which may render analysis, optimization, isolation extremely difficult lead thread starvation priority inversion. This paper firstly proposes an OS scheduler based on dynamic time-slice (DTS) address problem incurred by DVFS. The DTS dynamically allocates each with a according threads' behavior...

10.1109/clusterw.2012.12 article EN 2012-09-01

FlexibleCP: A data augmentation strategy for traffic sign detection

OPENALEX - Publications

Jingyi Shi Huanle Rao Qinyang Jing Ziqiang Wen Gangyong Jia

Abstract In the field of traffic sign detection, effective data augmentation can improve model's detection capacity, enabling model to distinguish and locate signs more precisely enhancing driving safety. However, due small size low representation in dataset, standard common techniques are not suitable for detection. To address this issue, a novel strategy called flexible cut paste (FlexibleCP) is proposed. The overall enhancement approach shifted from multi‐image fusion target cropping...

10.1049/ipr2.13204 article EN cc-by-nc-nd IET Image Processing 2024-09-18

PseudoNUMA for reducing memory interference in multi-core systems

OPENALEX - Publications

Gangyong Jia Xi Li Youwei Yuan Jian Wan Congfeng Jiang and 1 more

The growing gap between microprocessor speed and DRAM is a major problem that computer designers are facing. In order to narrow the gap, it necessary improve DRAM's throughput. Moreover, on multi-core platforms, memory shared by all cores usually suffers from contention interference problem, which can cause serious performance degradation unfairness of overall system. To address these problems, this paper proposes techniques take advantage partitioning cores, threads banks into group form...

10.5555/2663510.2663516 article EN High Performance Computing Symposium 2014-04-13

Analyzing Parallelization and Program Performance in Heterogeneous MPSoCs

OPENALEX - Publications

Chao Wang Xi Li Junneng Zhang Gangyong Jia Peng Chen and 1 more

In this paper we extend and analyze Amdahl's law to general heterogeneous MPSoC era, find out how the speedup is affected by parameters, including amount for microprocessors accelerators, as well task partition characteristics. We also theoretical results about extended Law applied leverage load balancing of a without abstract limitation base core equivalents (BCEs). A prototype on FPGA constructed with Microblaze processors JPEG hardware accelerators. The experimental demonstrate that our...

10.1109/mascots.2012.61 article EN 2012-08-01

Behavior Aware Data Locality for Caches

OPENALEX - Publications

Gangyong Jia Xi Li Chao Wang Xuehai Zhou Zongwei Zhu

Optimizing cache performance through improving data locality has been receiving a lot of attention. However, none the existing approaches can combine each task's behavior to optimize for caches. We present aware (BADL) in this paper. The key idea is add when allocating memory, which take advantage different performance. There are five main contributions: 1. our best knowledge, first attempt improve combining task behavior, 2. BADL detailed analyzes low derived from internal line, more...

10.1109/icpads.2012.76 article EN 2012-12-01

A Memory Partition Policy for Mitigating Contention

OPENALEX - Publications

Gangyong Jia Xi Li Jian Wan Chao Wang Dai Dong

10.7544/issn1000-1239.2015.20140706 article EN Journal of Computer Research and Development 2015-11-01

PUMA: From Simultaneous to Parallel for Shared Memory System in Multi-core

OPENALEX - Publications

Gangyong Jia Liang Shi Xi Li Dong Dai

10.1007/s11265-015-1015-3 article EN Journal of Signal Processing Systems 2015-06-29

Using FOM predicting method for scheduling on Chip Multi-Processor

OPENALEX - Publications

Gangyong Jia Sheng Wei Wenbo Dai Xi Li

On a Chip Multi-Processor (CMP) architecture, cache sharing impacts threads non-uniformly, where some may be slowed down significantly, while others are not. This cause severe performance problems such as throughput decreasing, thrashing. paper proposes new predicting inter-thread contention model, FOM (Frequency of Miss), and schedules based on the results CMP architecture. The input to our model is L2 misses number each thread. output extra for thread due sharing. We use guide scheduling....

10.1109/iccsn.2011.6013973 article EN 2011-05-01

Pseudo Share: Bring Shared to Exclusive for Main Memory in Multi-core Systems

OPENALEX - Publications

Xiaolin Meng Gangyong Jia Jian Wan Jilin Zhang

In modern multi-core system, memory is shared among more and concurrently running threads. Therefore, contention interference seriously which induces performance degradation unevenly, unfairness resource sharing priority inversion even starvation. this paper, we first analyze the problems induced by in detail, then, propose pseudo share framework brings to exclusive system. The contains three steps: 1) Partition threads into thread groups respectively, each group runs on one core occupying...

10.1109/cse.2014.347 article EN 2014-12-01

Coming Soon ...