NFDI4DS | UHH-SEMS - Publication Details

Qiong Wang

ORCID: 0000-0003-3755-6267

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100417401

Research Areas

Parallel Computing and Optimization Techniques
Advanced Data Storage Technologies
Cloud Computing and Resource Management
Advanced Image Processing Techniques
Image Processing Techniques and Applications
Advanced Vision and Imaging
Algorithms and Data Compression
Cryptography and Data Security
Cryptography and Residue Arithmetic
Software System Performance and Reliability
Privacy-Preserving Technologies in Data
Error Correcting Code Techniques
Distributed and Parallel Computing Systems
Coding theory and cryptography
Internet Traffic Analysis and Secure E-voting
Low-power high-performance VLSI design
Machine Learning in Materials Science
Genomics and Phylogenetic Studies
Interconnection Networks and Systems
Embedded Systems Design Techniques
Network Security and Intrusion Detection
Graph Theory and Algorithms
DNA and Biological Computing
Advanced Memory and Neural Computing
Advanced Computing and Algorithms

National University of Defense Technology
2013-2024

Changsha University
2021

PLA Army Engineering University
2009

Towards an Efficient Privacy-Preserving Decision Tree Evaluation Service in the Internet of Things

OPENALEX - Publications

Lin Liu Jinshu Su Baokang Zhao Qiong Wang Jinrong Chen and 1 more

With the fast development of Internet Things (IoT) technology, normal people and organizations can produce massive data every day. Due to a lack mining expertise computation resources, most them choose use services. Unfortunately, directly sending query cloud may violate their privacy. In this work, we mainly consider designing scheme that enables provide an efficient privacy-preserving decision tree evaluation service for resource-constrained clients in IoT. To design such scheme, new...

10.3390/sym12010103 article EN Symmetry 2020-01-06

A statistic approach for power analysis of integrated GPU

OPENALEX - Publications

Qiong Wang Ning Li Li Shen Zhiying Wang

10.1007/s00500-017-2786-1 article EN Soft Computing 2017-08-17

Super-Resolution Model Quantized in Multi-Precision

OPENALEX - Publications

Jingyu Liu Qiong Wang Dunbo Zhang Li Shen

Deep learning has achieved outstanding results in various tasks machine under the background of rapid increase equipment’s computing capacity. However, while achieving higher performance and effects, model size is larger, training inference time longer, memory storage occupancy increasing, efficiency shrinking, energy consumption augmenting. Consequently, it’s difficult to let these models run on edge devices such as micro mobile devices. Model compression technology gradually emerging...

10.3390/electronics10172176 article EN Electronics 2021-09-06

Accelerating Weeder: A DNA Motif Search Tool Using the Micron Automata Processor and FPGA

OPENALEX - Publications

Qiong Wang Mohamed El-Hadedy Kevin Skadron Ke Wang

Motif searching, i.e., identifying meaningful patterns from biological data, has been studied extensively due to its importance in the biomedical sciences. In this work, we seek improve performance of Weeder, a widely-used tool for automatic de novo motif searching. Weeder consists several functions, among which find that function oligo_scan, handles pattern matching, is bottleneck, especially when dealing with large datasets. Motivated by observation, adopt Micron Automata Processor (AP)...

10.1587/transinf.2017edp7051 article EN IEICE Transactions on Information and Systems 2017-01-01

Co-designing the Topology/Algorithm to Accelerate Distributed Training

OPENALEX - Publications

Xianghui Hou Rui Xu Sheng Ma Qiong Wang Wei Jiang and 1 more

With the development of Deep Learning (DL), Neural Network (DNN) models have become more complex. At same time, Internet makes it easy to obtain large data sets for DL training. Large-scale model parameters and training enhance level AI by improving accuracy DNN models. But on other hand, they also present severe challenges hardware platform because a needs lot computing memory resources that can easily exceed capacity single processor. In this context, integrating processors hierarchical...

10.1109/ispa-bdcloud-socialcom-sustaincom52081.2021.00141 article EN 2021-09-01

A SOM-Based of Fault Diagnosis for WAN

OPENALEX - Publications

Zhisong Pan Qiong Wang Guiqiang Ni Guyu Hu

As computer networks continue to grow in size and complexity, fault management todaypsilas high speed telecommunications is becoming ever more difficult. a kernel aspect of network management, diagnosis by performance data process deducing the exact source failure from set symptoms. In this paper some existing approach for are firstly discussed. It concentrates on analyzing alarm propagation wide area model then proposed. The composed two parts: Self-organizing maps training historical...

10.1109/iis.2009.114 article EN International Conference on Industrial and Information Systems 2009-04-01

Customizing Super-Resolution Framework According to Image Features

OPENALEX - Publications

Ninghui Yuan Jingyu Liu Qiong Wang Li Shen

As a popular research field of computer vision, super resolution(SR) has received more and attention in recent years. Although the deep learning methods have achieved good results SR, there are still some problems. For example, previous models often based on single depth mechanism. This means that SR reconstruction problem all images is regarded as equal complexity. And we found details suitable for recovering complex models, while other less texture information simple models. At same time,...

10.1109/ispa-bdcloud-socialcom-sustaincom51426.2020.00177 article EN 2020-12-01

Reducing TLB Miss Penalty on GPUs via Unified Multi-level PWB and PWC

OPENALEX - Publications

Lin Yang Dunbo Zhang Chaoyang Jia Qiong Wang Li Shen

Recently, GPUs are found to be used across a broad range of domains. To support virtual memory, which is required by most applications at present, the address translation process introduced GPU side. However, many demonstrate that an irregular memory access pattern, in accesses poor structured and often data dependent, makes performance worse especially with virtual-to-physical translations. Modern management unit (MMU) employs caching, e.g. page walk buffer (PWB) cache (PWC), scheduling...

10.1109/paap54281.2021.9720477 article EN 2021-12-10

A Survey of Power Consumption Modeling for GPU Architecture

OPENALEX - Publications

Qiong Wang Ning Li Li Shen Zhiying Wang

GPUs are of increasing interests in the multi-core era due to their high computing power. However, power consumption caused by rising performance has been a general concern. As consequence, it is becoming an imperative demand optimize GPU consumption, among which estimation one important and useful solutions. In this work, we give survey modeling for GPU. We first introduce current development heterogeneous architectures then summarize existing techniques consumption. The main two types...

10.23977/cpcs.2016.11006 article EN cc-by Computing Performance and Communication systems 2016-01-01

A Unified Page Walk Buffer and Page Walk Cache

OPENALEX - Publications

Dunbo Zhang Chaoyang Jia Qiong Wang Li Shen

GPU enables shared virtual memory (SVM) to eliminate complex data transfer in programming for programmers. However, SVM bring the expensive overhead of address translation due large number requests which are generated simultaneously GPU. Memory Management Unit (MMU) is designed handle translation. page walk buffer (PWB) and cache (PWC) have lots redundant information limited improvement performance. To these problems, we propose a unified PWB PWC. Unified PWC abandon traditional linear table...

10.1109/ispa-bdcloud-socialcom-sustaincom51426.2020.00038 article EN 2020-12-01

Optimizing Stencil Codes with Exploiting Data Reuse

OPENALEX - Publications

Xu Chang Li Shen Qiong Wang

Stencil code is widely used in the field of scientific computing. Currently, researchers are focusing on performance optimization for stencil applications by data-level parallelism or thread-level parallelism. Using vector/SIMD instructions, which commonly to achieve parallelism, could effectively improve computation with a large number repetitive operations, but usually limited due access memory bandwidth, data and control dependencies. The Scalable Vector Extension (SVE), Vector-Length...

10.1109/iceert53919.2021.00018 article EN 2021-10-01

Coming Soon ...