NFDI4DS | UHH-SEMS - Publication Details

Tigr

OPENALEX - Publications

Amir Hossein Nodehi Sabet Junqiao Qiu Zhijia Zhao

Graph analytics delivers deep knowledge by processing large volumes of highly connected data. In real-world graphs, the degree distribution tends to follow power law -- a small portion nodes own number neighbors. The high irregularity acts as major barrier their efficient on GPU architectures, which are primarily designed for accelerating computations regular data with SIMD executions. Existing solutions inefficiency GPU-based graph either modify programming abstraction or rely changes...

10.1145/3173162.3173180 article EN 2018-03-19

More Apps, Faster Hot-Launch on Mobile Devices via Fore/Background-aware GC-Swap Co-design

OPENALEX - Publications

Jiacheng Huang Yunmo Zhang Junqiao Qiu Yu Liang Rachata Ausavarungnirun and 2 more

Faster app launching is crucial for the user experience on mobile devices. Apps launched from a background cached state, called hot-launching, have much better performance than apps scratch. To increase number of hot-launches, leading vendors now cache more in by enabling swap. Recent work also proposed reducing Java heap to apps. However, this paper found that existing methods deteriorate hot-launch while increasing simultaneously improve and performance, proposes Fleet,...

10.1145/3620666.3651377 article EN 2024-04-24

MicroSpec

OPENALEX - Publications

Junqiao Qiu Zhijia Zhao Bin Ren

Finite state machines (FSMs) are basic computation models that play essential roles in many applications. Enabling efficient parallel FSM execution is critical to the performance of these However, they very challenging parallelize due their inherent data dependencies occur at each step computations.

10.1145/2967938.2967965 article EN 2016-08-31

Tigr

OPENALEX - Publications

Amir Hossein Nodehi Sabet Junqiao Qiu Zhijia Zhao

Graph analytics delivers deep knowledge by processing large volumes of highly connected data. In real-world graphs, the degree distribution tends to follow power law -- a small portion nodes own number neighbors. The high irregularity acts as major barrier their efficient on GPU architectures, which are primarily designed for accelerating computations regular data with SIMD executions. Existing solutions inefficiency GPU-based graph either modify programming abstraction or rely changes...

10.1145/3296957.3173180 article EN ACM SIGPLAN Notices 2018-03-19

Enabling scalability-sensitive speculative parallelization for FSM computations

OPENALEX - Publications

Junqiao Qiu Zhijia Zhao Bo Wu Abhinav Vishnu Shuaiwen Leon Song

Finite state machines (FSMs) are the backbone of many applications, but difficult to parallelize due their inherent dependencies. Speculative FSM parallelization has shown promise on multicore with up eight cores. However, as hardware parallelism grows (e.g., Xeon Phi 288 logical cores), a fundamental question raises: How does speculative scale number cores increases? Without answering this question, existing methods for simply choose use all available cores, which might not only waste...

10.1145/3079079.3079082 article EN 2017-05-31

GSpecPal: Speculation-Centric Finite State Machine Parallelization on GPUs

OPENALEX - Publications

Yu Guang Wang Robbie Watling Junqiao Qiu Zhenlin Wang

Finite State Machine (FSM) plays a critical role in many real-world applications, ranging from pattern matching to network security. In recent years, significant research efforts have been made accelerate FSM computations on different parallel platforms, including multicores, GPUs, and DRAM-based accelerators. A popular direction is the speculation-centric parallelization. Despite their abundance promising results, benefits of parallelization GPUs heavily depend high speculation accuracy are...

10.1109/ipdps53621.2022.00053 article EN 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2022-05-01

Scalable structural index construction for JSON analytics

OPENALEX - Publications

Lin Jiang Junqiao Qiu Zhijia Zhao

JavaScript Object Notation (JSON) and its variants have gained great popularity in recent years. Unfortunately, the performance of their analytics is often dragged down by expensive JSON parsing. To address this, work has shown that building bitwise indices on data, called structural , can greatly accelerate querying. Despite promise, existing index construction does not scale well as records become larger more complex, due to (inherently) sequential process involvement costly memory copies...

10.14778/3436905.3436926 article EN Proceedings of the VLDB Endowment 2020-12-01

Scalable FSM parallelization via path fusion and higher-order speculation

OPENALEX - Publications

Junqiao Qiu Xiaofan Sun Amir Hossein Nodehi Sabet Zhijia Zhao

Finite-state machine (FSM) is a fundamental computation model used by many applications. However, FSM execution known to be "embarrassingly sequential" due the state dependences among transitions. Existing solutions leverage enumerative or speculative parallelization break dependences. efficiency of both schemes highly depends on properties and its inputs. For those exhibiting unfavorable properties, former suffers from overhead maintaining multiple paths, while latter bottlenecked serial...

10.1145/3445814.3446705 article EN 2021-04-11

Two numerical algorithms and numerical experiments for efficiently solving inequality-and-bound constrained QP

OPENALEX - Publications

Junqiao Qiu Weibing Li Yunong Zhang Senbo Fu Hong‐Zhou Tan

This paper presents and investigates two new numerical algorithms (i.e., E47 algorithm 94LVI algorithm) for solving the quadratic programming (QP) problem subject to inequality bound constraints. Such a constrained QP is firstly converted equivalently into linear variational (LVI), then piecewise-linear projection equation (PLPE). The are employed solve resultant PLPE, thus optimal solution obtained readily. In this paper, we analyze computational complexities present global convergence of...

10.1109/icsai.2014.7009256 article EN 2014-11-01

Challenging Sequential Bitstream Processing via Principled Bitwise Speculation

OPENALEX - Publications

Junqiao Qiu Lin Jiang Zhijia Zhao

Many performance-critical applications traverse bitstreams with bitwise computations for better performance or higher space efficiency, such as multimedia processing and bitmap indexing. However, when these carry dependences, the entire bitstream traversal becomes serial, fundamentally limiting scalability. In this work, we show that bitstream-carried dependences are actually "breakable" in many cases, adoption of a systematic treatment - principled speculation (PBS). The core idea PBS stems...

10.1145/3373376.3378461 article EN 2020-03-09

Population projection of the Indian subcontinent using TP-aided WASD neuronet

OPENALEX - Publications

Yunong Zhang Li Wan Liangyu He Junqiao Qiu Hong‐Zhou Tan

The population projection of the Indian subcontinent, which is closely related to future development this region and even whole world, has catched great attention among sociologists as well scientists. However, most former researches are just based on fertility, mortality or other individual quantifiable factors by using some traditional statistical models thus may lack all-sidedness in their results. historical data comprehensive reflection under influence all factors. Based over 2000...

10.1109/fskd.2016.7603145 article EN 2016-08-01

Reliability Analysis for Unreliable FSM Computations

OPENALEX - Publications

Amir Hossein Nodehi Sabet Junqiao Qiu Zhijia Zhao Sriram Krishnamoorthy

Finite State Machines (FSMs) are fundamental in both hardware design and software development. However, the reliability of FSM computations remains poorly understood. Existing analyses mainly designed for generic unaware special error tolerance characteristics computations. This work introduces RelyFSM -- a state-level analysis framework By modeling behaviors unreliable executions qualitatively reasoning about transition structures, can precisely capture inherent Our evaluation with...

10.1145/3377456 article EN ACM Transactions on Architecture and Code Optimization 2020-05-29

Northern American population data recovery from 1500AD to 1950AD as well as prediction using WASD neuronet with 513-year data

OPENALEX - Publications

Yunong Zhang Li Wan Binbin Qiu Junqiao Qiu Hong‐Zhou Tan

The recovery and prediction of Northern American population data, which are closely related to the future development America even whole world, have become significant subjects captured great attention among sociologists as well scientists. However, most relevant researches just based on fertility, mortality or other individual quantifiable factors by traditional statistical models thus lack all-sidedness in their results. As we know, historical data comprehensive reflection under influence...

10.1109/cac.2015.7382466 article EN 2015-11-01

Transforming query sequences for high-throughput B+ tree processing on many-core processors

OPENALEX - Publications

Ruiqin Tian Junqiao Qiu Zhijia Zhao Xu Liu Bin Ren

The throughput of B+ tree query processing is critical to many databases, file systems, and cloud applications. Based on bulk synchronous parallel (BSP), latch-free has shown promise by queries in small batches avoiding the use locks. As number cores CPUs increases, it becomes possible process larger without adding any extra delays. In this work, we argue that as batch size there will be more optimization opportunities exposed beyond parallelism, especially when distributions are highly...

10.5555/3314872.3314887 article EN Symposium on Code Generation and Optimization 2019-02-16

Transforming Query Sequences for High-Throughput B+ Tree Processing on Many-Core Processors

OPENALEX - Publications

Ruiqin Tian Junqiao Qiu Zhijia Zhao Xu Liu Bin Ren

The throughput of B+ tree query processing is critical to many databases, file systems, and cloud applications. Based on bulk synchronous parallel (BSP), latch-free has shown promise by queries in small batches avoiding the use locks. As number cores CPUs increases, it becomes possible process larger without adding any extra delays. In this work, we argue that as batch size there will be more optimization opportunities exposed beyond parallelism, especially when distributions are highly...

10.1109/cgo.2019.8661166 article EN 2019-02-01

Exploring Scalable Parallelization for Edit Distance-Based Motif Search

OPENALEX - Publications

Junqiao Qiu Ali Ebnenasir

Motif Searching is an important problem that can reveal crucial information from biological data. Since the general motif searching NP-hard and volume of data growing exponentially in recent years, there a pressing need for developing time space-efficient algorithms to find motifs. In this paper, we explore scalable parallelization Edit Distance-Based Search (EMS). We introduce two parallel designs, recursEMS which integrates existing EMS solver into recursion tree running multiple...

10.1109/tcbb.2022.3208867 article EN IEEE/ACM Transactions on Computational Biology and Bioinformatics 2022-09-23

Challenging Sequential Bitstream Processing via Principled Bitwise Speculation

OPENALEX - Publications

Junqiao Qiu

10.5281/zenodo.3563497 article EN cc-by Zenodo (CERN European Organization for Nuclear Research) 2019-12-05

Challenging Sequential Bitstream Processing via Principled Bitwise Speculation

OPENALEX - Publications

Junqiao Qiu

10.5281/zenodo.3565136 article EN cc-by Zenodo (CERN European Organization for Nuclear Research) 2019-12-05

Challenging Sequential Bitstream Processing via Principled Bitwise Speculation

OPENALEX - Publications

Junqiao Qiu

10.5281/zenodo.3563465 article EN Zenodo (CERN European Organization for Nuclear Research) 2019-12-05

Challenging Sequential Bitstream Processing via Principled Bitwise Speculation

OPENALEX - Publications

Junqiao Qiu

10.5281/zenodo.3610556 article EN Zenodo (CERN European Organization for Nuclear Research) 2020-01-16