NFDI4DS | UHH-SEMS - Publication Details

Forward and Backward Information Retention for Accurate Binary Neural Networks

OPENALEX - Publications

Haotong Qin Ruihao Gong Xianglong Liu Mingzhu Shen Ziran Wei and 2 more

Weight and activation binarization is an effective approach to deep neural network compression can accelerate the inference by leveraging bitwise operations. Although many methods have improved accuracy of model minimizing quantization error in forward propagation, there remains a noticeable performance gap between binarized full-precision one. Our empirical study indicates that brings information loss both backward which bottleneck training accurate binary networks. To address these issues,...

10.1109/cvpr42600.2020.00232 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline

OPENALEX - Publications

Yangguang Li Bin Huang Zeren Chen Yufeng Cui Feng Liang and 6 more

Recently, perception task based on Bird's-Eye View (BEV) representation has drawn more and attention, BEV is promising as the foundation for next-generation Autonomous Vehicle (AV) perception. However, most existing solutions either require considerable resources to execute on-vehicle inference or suffer from modest performance. This paper proposes a simple yet effective framework, termed Fast-BEV, which capable of performing faster chips. Towards this goal, we first empirically find that...

10.1109/tpami.2024.3414835 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2024-01-01

Searching for Accurate Binary Neural Architectures

OPENALEX - Publications

Mingzhu Shen Kai Han Chunjing Xu Yunhe Wang

Binary neural networks have attracted tremendous attention due to the efficiency for deploying them on mobile devices. Since weak expression ability of binary weights and features, their accuracy is usually much lower than that full-precision (i.e. 32-bit) models. Here we present a new frame work automatically searching compact but accurate networks. In practice, number channels in each layer will be encoded into search space optimized using evolutionary algorithm. Experiments conducted...

10.1109/iccvw.2019.00256 preprint EN 2019-10-01

Biochar combined with organic and inorganic fertilizers promoted the rapeseed nutrient uptake and improved the purple soil quality

OPENALEX - Publications

Ming Liu Linna Cholidah Shumin Ma Qun Ma Wenfeng Song and 5 more

Biochar is a kind of organic matter that can be added into soil to improve quality. To study the effect biochar combined with and inorganic fertilizers on rapeseed growth purple fertility microbial community, completely randomized block design was designed three levels (B0: no biochar, B1: low-rate B2: high-rate biochar); two (F1: fertilizer; F2: fertilizer); (M1: M2: fertilizer). All combinations were repeated times. The application could pH, community richness: pH B1F2M1 increased 0.41...

10.3389/fnut.2022.997151 article EN cc-by Frontiers in Nutrition 2022-09-14

Balanced Binary Neural Networks with Gated Residual

OPENALEX - Publications

Mingzhu Shen Xianglong Liu Ruihao Gong Kai Han

Binary neural networks have attracted numerous attention in recent years. However, mainly due to the information loss stemming from biased binarization, how preserve accuracy of still remains a critical issue. In this paper, we attempt maintain propagated forward process and propose Balanced Neural Networks with Gated Residual (BBG for short). First, weight balanced binarization is introduced thus informative binary weights can capture more contained activations. Second, activations, gated...

10.1109/icassp40776.2020.9054599 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020-04-09

Once Quantization-Aware Training: High Performance Extremely Low-bit Architecture Search

OPENALEX - Publications

Mingzhu Shen Feng Liang Ruihao Gong Yuhang Li Chuming Li and 4 more

Quantization Neural Networks (QNN) have attracted a lot of attention due to their high efficiency. To enhance the quantization accuracy, prior works mainly focus on designing advanced algorithms but still fail achieve satisfactory results under extremely low-bit case. In this work, we take an architecture perspective investigate potential high-performance QNN. Therefore, propose combine Network Architecture Search methods with enjoy merits two sides. However, naive combination inevitably...

10.1109/iccv48922.2021.00529 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Cached Multi-Lora Composition for Multi-Concept Image Generation

OPENALEX - Publications

Xiaoqiang Zou Mingzhu Shen Christos-Savvas Bouganis Yiren Zhao

Low-Rank Adaptation (LoRA) has emerged as a widely adopted technique in text-to-image models, enabling precise rendering of multiple distinct elements, such characters and styles, multi-concept image generation. However, current approaches face significant challenges when composing these LoRAs for generation, resulting diminished generated quality. In this paper, we initially investigate the role denoising process through lens Fourier frequency domain. Based on hypothesis that applying could...

10.48550/arxiv.2502.04923 preprint EN arXiv (Cornell University) 2025-02-07

Dynamic Model Merging with Mixture of Weights

OPENALEX - Publications

Hancheng Ye Chenyu Huang Mingzhu Shen Tao Chen Yongqi Huang and 1 more

10.1109/tcsvt.2025.3544680 article EN IEEE Transactions on Circuits and Systems for Video Technology 2025-01-01

MixMix: All You Need for Data-Free Compression Are Feature and Data Mixing

OPENALEX - Publications

Yuhang Li Feng Zhu Ruihao Gong Mingzhu Shen Xin Dong and 3 more

User data confidentiality protection is becoming a rising challenge in the present deep learning research. Without access to data, conventional data-driven model compression faces higher risk of performance degradation. Recently, some works propose generate images from specific pretrained serve as training data. However, inversion process only utilizes biased feature statistics stored one and low-dimension high-dimension. As consequence, it inevitably encounters difficulties generalizability...

10.1109/iccv48922.2021.00437 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark

OPENALEX - Publications

Yuhang Li Mingzhu Shen Jian Ma Yan Ren Mingxin Zhao and 4 more

Model quantization has emerged as an indispensable technique to accelerate deep learning inference. While researchers continue push the frontier of algorithms, existing work is often unreproducible and undeployable. This because do not choose consistent training pipelines ignore requirements for hardware deployments. In this work, we propose Quantization Benchmark (MQBench), a first attempt evaluate, analyze, benchmark reproducibility deployability model algorithms. We multiple different...

10.48550/arxiv.2111.03759 preprint EN cc-by-nc-nd arXiv (Cornell University) 2021-01-01

Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception

OPENALEX - Publications

Bin Huang Yangguang Li Enze Xie Feng Liang Luya Wang and 5 more

Recently, the pure camera-based Bird's-Eye-View (BEV) perception removes expensive Lidar sensors, making it a feasible solution for economical autonomous driving. However, most existing BEV solutions either suffer from modest performance or require considerable resources to execute on-vehicle inference. This paper proposes simple yet effective framework, termed Fast-BEV, which is capable of performing real-time on chips. Towards this goal, we first empirically find that representation can be...

10.48550/arxiv.2301.07870 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline

OPENALEX - Publications

Yangguang Li Bin Huang Zeren Chen Yufeng Cui Feng Liang and 6 more

Recently, perception task based on Bird's-Eye View (BEV) representation has drawn more and attention, BEV is promising as the foundation for next-generation Autonomous Vehicle (AV) perception. However, most existing solutions either require considerable resources to execute on-vehicle inference or suffer from modest performance. This paper proposes a simple yet effective framework, termed Fast-BEV , which capable of performing faster chips. Towards this goal, we first empirically find that...

10.48550/arxiv.2301.12511 preprint EN other-oa arXiv (Cornell University) 2023-01-01

NNLQP: A Multi-Platform Neural Network Latency Query and Prediction System with An Evolving Database

OPENALEX - Publications

Liang Liu Mingzhu Shen Ruihao Gong Fengwei Yu Hailong Yang

Deep neural networks (DNNs) are widely used in various applications. The accurate and latency feedback is essential for model design deployment. In this work, we attempt to alleviate the cost of acquisition from two aspects: query prediction. To ease difficulty acquiring on multi-platform, our system can automatically convert DNN into corresponding executable format, measure target hardware. Powered by this, queries be fulfilled with a simple interface calling. For efficient utilization...

10.1145/3545008.3545051 article EN 2022-08-29

$\Delta$-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers

OPENALEX - Publications

Pengtao Chen Mingzhu Shen Peng Ye Jianjian Cao Chongjun Tu and 3 more

Diffusion models are widely recognized for generating high-quality and diverse images, but their poor real-time performance has led to numerous acceleration works, primarily focusing on UNet-based structures. With the more successful results achieved by diffusion transformers (DiT), there is still a lack of exploration regarding impact DiT structure generation, as well absence an framework tailored architecture. To tackle these challenges, we conduct investigation into correlation between...

10.48550/arxiv.2406.01125 preprint EN arXiv (Cornell University) 2024-06-03

Searching for Accurate Binary Neural Architectures

OPENALEX - Publications

Mingzhu Shen Kai Han Chunjing Xu Yunhe Wang

Binary neural networks have attracted tremendous attention due to the efficiency for deploying them on mobile devices. Since weak expression ability of binary weights and features, their accuracy is usually much lower than that full-precision (i.e. 32-bit) models. Here we present a new frame work automatically searching compact but accurate networks. In practice, number channels in each layer will be encoded into search space optimized using evolutionary algorithm. Experiments conducted...

10.48550/arxiv.1909.07378 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Balanced Binary Neural Networks with Gated Residual

OPENALEX - Publications

Mingzhu Shen Xianglong Liu Ruihao Gong Kai Han

Binary neural networks have attracted numerous attention in recent years. However, mainly due to the information loss stemming from biased binarization, how preserve accuracy of still remains a critical issue. In this paper, we attempt maintain propagated forward process and propose Balanced Neural Networks with Gated Residual (BBG for short). First, weight balanced binarization is introduced maximize entropy binary weights, thus informative weights can capture more contained activations....

10.48550/arxiv.1909.12117 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Forward and Backward Information Retention for Accurate Binary Neural Networks

OPENALEX - Publications

Haotong Qin Ruihao Gong Xianglong Liu Mingzhu Shen Ziran Wei and 2 more

Weight and activation binarization is an effective approach to deep neural network compression can accelerate the inference by leveraging bitwise operations. Although many methods have improved accuracy of model minimizing quantization error in forward propagation, there remains a noticeable performance gap between binarized full-precision one. Our empirical study indicates that brings information loss both backward which bottleneck training accurate binary networks. To address these issues,...

10.48550/arxiv.1909.10788 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Biochar Combined with Organic and Inorganic Fertilizers Promoted the Rapeseed Growth and Improved the Soil Quality in Purple Soil

OPENALEX - Publications

Ming Liu Linna Cholidah Shumin Ma Qun Ma Wenfeng Song and 5 more

Abstract Background Biochar is one kind of organic matter that can be added into soil as a amendment to improve its quality. To study the effect biochar addition combined with and inorganic fertilizers on growth fertility microbial community in purple soil, completely randomized block design was designed three levels [B0: no biochar, B1: low-rate (35 t/ha) , B2: high-rate (50 t/ha)]; two [F1: fertilizer (30 kg/ha N, 87.5 P 2 O 5 60 K O); F2: (60 175 120 O)]; [M1: fertilizer; M2: (4.5 t/ha)]....

10.21203/rs.3.rs-1883568/v1 preprint EN cc-by Research Square (Research Square) 2022-07-29

Hybrid Graph: A Unified Graph Representation with Datasets and Benchmarks for Complex Graphs

OPENALEX - Publications

Zehui Li Xiangyu Zhao Mingzhu Shen Guy‐Bart Stan Píetro Lió and 1 more

Graphs are widely used to encapsulate a variety of data formats, but real-world networks often involve complex node relations beyond only being pairwise. While hypergraphs and hierarchical graphs have been developed employed account for the relations, they cannot fully represent these complexities in practice. Additionally, though many Graph Neural Networks (GNNs) proposed representation learning on higher-order graphs, usually evaluated simple graph datasets. Therefore, there is need...

10.48550/arxiv.2306.05108 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Merging Vision Transformers from Different Tasks and Domains

OPENALEX - Publications

Peng Ye Chenyu Huang Mingzhu Shen Tao Chen Yongqi Huang and 2 more

This work targets to merge various Vision Transformers (ViTs) trained on different tasks (i.e., datasets with object categories) or domains the same categories but environments) into one unified model, yielding still good performance each task domain. Previous model merging works focus either CNNs NLP models, leaving ViTs research untouched. To fill this gap, we first explore and find that existing methods cannot well handle of whole ViT models have improvement space. enable ViT, propose a...

10.48550/arxiv.2312.16240 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Status Prediction for Questions Post on Technical Forums

OPENALEX - Publications

Mingzhu Shen Hui Liu

10.7544/issn1000-1239.2020.20190625 article EN Journal of Computer Research and Development 2020-03-01

MixMix: All You Need for Data-Free Compression Are Feature and Data Mixing

OPENALEX - Publications

Yuhang Li Feng Zhu Ruihao Gong Mingzhu Shen Xin Dong and 3 more

User data confidentiality protection is becoming a rising challenge in the present deep learning research. Without access to data, conventional data-driven model compression faces higher risk of performance degradation. Recently, some works propose generate images from specific pretrained serve as training data. However, inversion process only utilizes biased feature statistics stored one and low-dimension high-dimension. As consequence, it inevitably encounters difficulties generalizability...

10.48550/arxiv.2011.09899 preprint EN cc-by-nc-nd arXiv (Cornell University) 2020-01-01