Wei Niu

ORCID: 0000-0002-2697-7042
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Neural Network Applications
  • Fault Detection and Control Systems
  • Domain Adaptation and Few-Shot Learning
  • Advanced Image and Video Retrieval Techniques
  • Anomaly Detection Techniques and Applications
  • Advanced Image Processing Techniques
  • Advanced Vision and Imaging
  • Human Pose and Action Recognition
  • Advanced Sensor and Control Systems
  • Adversarial Robustness in Machine Learning
  • Grey System Theory Applications
  • Recommender Systems and Techniques
  • Robotics and Sensor-Based Localization
  • Icing and De-icing Technologies
  • Machine Fault Diagnosis Techniques
  • Advanced Algorithms and Applications
  • Video Surveillance and Tracking Methods
  • Image Processing Techniques and Applications
  • Machine Learning and Data Classification
  • Autonomous Vehicle Technology and Safety
  • Topic Modeling
  • Advanced Measurement and Detection Methods
  • IoT and Edge/Fog Computing
  • CCD and CMOS Imaging Sensors
  • Hydraulic and Pneumatic Systems

Xi'an Aeronautical University
2016-2025

University of Georgia
2024-2025

William & Mary
2019-2024

China Southern Power Grid (China)
2021-2024

Guizhou University
2013-2024

Beihang University
2024

Beijing Jiaotong University
2023

Williams (United States)
2020-2022

Guizhou Electric Power Design and Research Institute
2019-2022

Institute of Software
2022

With the emergence of a spectrum high-end mobile devices, many applications that formerly required desktop-level computation capability are being transferred to these devices. However, executing Deep Neural Networks (DNNs) inference is still challenging considering high and storage demands, specifically, if real-time performance with accuracy needed. Weight pruning DNNs proposed, but existing schemes represent two extremes in design space: non-structured fine-grained, accurate, not hardware...

10.1145/3373376.3378534 preprint EN 2020-03-09

Model compression techniques on Deep Neural Network (DNN) have been widely acknowledged as an effective way to achieve acceleration a variety of platforms, and DNN weight pruning is straightforward method. There are currently two mainstreams methods representing extremes regularity: non-structured, fine-grained can high sparsity accuracy, but not hardware friendly; structured, coarse-grained exploits hardware-efficient structures in pruning, suffers from accuracy drop when the rate high. In...

10.1609/aaai.v34i04.5954 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

We present a framework for detecting and recognizing human activities outdoor video surveillance applications. Our research makes the following contributions: For activity detection tracking, we improve robustness by providing intelligent control fail-over mechanisms, built on top of low-level motion algorithms such as frame differencing feature correlation. recognition, propose an efficient representation that enables recognition different interaction patterns among group people based...

10.1109/icme.2004.1394293 article EN 2005-03-21

We propose a new model toward improving the quality of image recommendations in social sharing communities like Pinterest, Flickr, and Instagram. Concretely, we Neural Personalized Ranking (NPR) -- personalized pairwise ranking over implicit feedback datasets that is inspired by Bayesian (BPR) recent advances neural networks. further build an enhanced augmenting basic NPR with multiple contextual preference clues including user tags, geographic features, visual factors. In our experiments...

10.1145/3159652.3159728 article EN 2018-02-02

Deep Neural Networks (DNNs) have emerged as the core enabler of many major applications on mobile devices. To achieve high accuracy, DNN models become increasingly deep with hundreds or even thousands operator layers, leading to memory and computational requirements for inference. Operator fusion (or kernel/layer fusion) is key optimization in state-of-the-art execution frameworks, such TensorFlow, TVM, MNN, that aim improve efficiency However, these frameworks usually adopt approaches based...

10.1145/3453483.3454083 article EN 2021-06-18

The rapid development and wide utilization of object detection techniques have aroused attention on both accuracy speed detectors. However, the current state-of-the-art works are either accuracy-oriented using a large model but leading to high latency or speed-oriented lightweight sacrificing accuracy. In this work, we propose YOLObile framework, real-time mobile devices via compression-compilation co-design. A novel block-punched pruning scheme is proposed for any kernel size. To improve...

10.1609/aaai.v35i2.16179 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

Recurrent neural networks (RNNs) based automatic speech recognition has nowadays become promising and important on mobile devices such as smart phones. However, previous RNN compression techniques either suffer from hardware performance overhead due to irregularity or significant accuracy loss the preserved regularity for friendliness. In this work, we propose RTMobile that leverages both a novel block-based pruning approach compiler optimizations accelerate inference devices. Our proposed...

10.1109/dac18072.2020.9218499 article EN 2020-07-01

Artificial General Intelligence (AGI), possessing the capacity to comprehend, learn, and execute tasks with human cognitive abilities, engenders significant anticipation intrigue across scientific, commercial, societal arenas. This fascination extends particularly Internet of Things (IoT), a landscape characterized by interconnection countless devices, sensors, systems, collectively gathering sharing data enable intelligent decision-making automation. research embarks on an exploration...

10.48550/arxiv.2309.07438 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Complementary item recommendation finds products that go well with one another (e.g., a camera and specific lens). While complementary items are ubiquitous, the dimensions by which together can vary both product category, making it difficult to detect at scale. Moreover, in practice, user preferences for be complex combinations of quality evidence complementarity. Hence, we propose new neural recommender Encore jointly learn relationships preferences. Specifically, (i) effectively combines...

10.1145/3240323.3240368 article EN 2018-09-27

Though recent years have witnessed remarkable progress in single image super-resolution (SISR) tasks with the prosperous development of deep neural networks (DNNs), learning methods are confronted computation and memory consumption issues practice, especially for resource-limited platforms such as mobile devices. To overcome challenge facilitate real-time deployment SISR on mobile, we combine architecture search pruning propose an automatic framework that derives sparse (SR) models high...

10.1109/iccv48922.2021.00478 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

As deep convolutional neural networks (DNNs) are widely used in various fields of computer vision, leveraging the overfitting ability DNN to achieve video resolution upscaling has become a new trend modern delivery system. By dividing videos into chunks and over-fitting each chunk with super-resolution model, server encodes before transmitting them clients, thus achieving better quality transmission efficiency. However, large number expected ensure good quality, which substantially increases...

10.1109/cvpr52729.2023.00989 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

10.1049/icp.2024.3107 article EN IET conference proceedings. 2025-01-01

The engine reverse thrust control is of great significance to the landing deceleration aircraft. action time inverse overcurrent protection changes with size current, and upper lower levels has a high adaptive ability, which can quickly cut out fault near power side. Compared stage current protection, it better meet dual requirements quick movement selectivity distribution network, when system operation mode changes, its sensitivity will not change greatly. Aiming at electrical...

10.1049/icp.2024.2846 article EN IET conference proceedings. 2025-01-01

Fine-tuning helps large language models (LLM) recover degraded information and enhance task performance.Although Low-Rank Adaptation (LoRA) is widely used effective for fine-tuning, we have observed that its scaling factor can limit or even reduce performance as the rank size increases. To address this issue, propose RoRA (Rank-adaptive Reliability Optimization), a simple yet method optimizing LoRA's factor. By replacing $\alpha/r$ with $\alpha/\sqrt{r}$, ensures improved Moreover, enhances...

10.48550/arxiv.2501.04315 preprint EN arXiv (Cornell University) 2025-01-08

10.1109/icassp49660.2025.10889613 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

We propose a lightweight system for (i) semi-automatically discovering and tracking bias themes associated with opposing sides of topic; (ii) identifying strong partisans who drive the online discussion; (iii) inferring opinion "regular" participants. By taking just two hand-picked seeds to characterize topic-space (e.g., "pro-choice" "pro-life") as weak labels, we develop an efficient optimization-based propagation method over social/information network. show how this approach leads 20%...

10.1145/2806416.2806573 article EN 2015-10-17

Recently, a new trend of exploring sparsity for accelerating neural network training has emerged, embracing the paradigm on edge. This paper proposes novel Memory-Economic Sparse Training (MEST) framework targeting accurate and fast execution edge devices. The proposed MEST consists enhancements by Elastic Mutation (EM) Soft Memory Bound (&S) that ensure superior accuracy at high ratios. Different from existing works sparse training, this current work reveals importance schemes performance...

10.48550/arxiv.2110.14032 preprint EN cc-by arXiv (Cornell University) 2021-01-01

In the radar cross section (RCS) prediction of complex target, intensive computational burden occurs while calculating multiple scattering effect. order to overcome large computing, we present program executing on graphics processing units (GPUs). this paper, analyze properties satellite, which antennas are described as cubes and columns, by employing GPU-based combinational method geometrical optics (GO) physical (PO) together with kd-tree technique. Furthermore, due distinctive treatment,...

10.1109/tap.2012.2207679 article EN IEEE Transactions on Antennas and Propagation 2012-07-10

Recently, Vision Transformer (ViT) has continuously established new milestones in the computer vision field, while high computation and memory cost makes its propagation industrial production difficult. Pruning, a traditional model compression paradigm for hardware efficiency, been widely applied various DNN structures. Nevertheless, it stays ambiguous on how to perform exclusive pruning ViT structure. Considering three key points: structural characteristics, internal data pattern of ViTs,...

10.48550/arxiv.2112.13890 preprint EN cc-by arXiv (Cornell University) 2021-01-01

With the increasing demand to efficiently deploy DNNs on mobile edge devices, it becomes much more important reduce unnecessary computation and increase execution speed. Prior methods towards this goal, including model compression network architecture search (NAS), are largely performed independently, do not fully consider compiler-level optimizations which is a must-do for acceleration. In work, we first propose (i) general category of fine-grained structured pruning applicable various DNN...

10.1109/cvpr46437.2021.01403 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

User-generated item lists are a popular feature of many different platforms. Examples include books on Goodreads, playlists Spotify and YouTube, collections images Pinterest, answers question-answer sites like Zhihu. Recommending is critical for increasing user engagement connecting users to new items, but approaches designed the item-based recommendation, without careful consideration complex relationships between items lists. Hence, in this paper, we propose novel user-generated list...

10.1145/3357384.3358030 preprint EN 2019-11-03
Coming Soon ...