NFDI4DS | UHH-SEMS - Publication Details

ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

OPENALEX - Publications

Xiangyu Zhang Xinyu Zhou Mengxiao Lin Jian Sun

We introduce an extremely computation-efficient CNN architecture named ShuffleNet, which is designed specially for mobile devices with very limited computing power (e.g., 10-150 MFLOPs). The new utilizes two operations, pointwise group convolution and channel shuffle, to greatly reduce computation cost while maintaining accuracy. Experiments on ImageNet classification MS COCO object detection demonstrate the superior performance of ShuffleNet over other structures, e.g. lower top-1 error...

10.1109/cvpr.2018.00716 preprint EN 2018-06-01

DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low Bitwidth Gradients

OPENALEX - Publications

Shuchang Zhou Yuxin Wu Zekun Ni Xinyu Zhou He Wen and 1 more

We propose DoReFa-Net, a method to train convolutional neural networks that have low bitwidth weights and activations using parameter gradients. In particular, during backward pass, gradients are stochastically quantized numbers before being propagated layers. As convolutions forward/backward passes can now operate on activations/gradients respectively, DoReFa-Net use bit convolution kernels accelerate both training inference. Moreover, as be efficiently implemented CPU, FPGA, ASIC GPU,...

10.48550/arxiv.1606.06160 preprint EN other-oa arXiv (Cornell University) 2016-01-01

EAST: An Efficient and Accurate Scene Text Detector

OPENALEX - Publications

Xinyu Zhou Cong Yao He Wen Yuzhi Wang Shuchang Zhou and 2 more

Previous approaches for scene text detection have already achieved promising performances across various benchmarks. However, they usually fall short when dealing with challenging scenarios, even equipped deep neural network models, because the overall performance is determined by interplay of multiple stages and components in pipelines. In this work, we propose a simple yet powerful pipeline that yields fast accurate natural scenes. The directly predicts words or lines arbitrary...

10.1109/cvpr.2017.283 preprint EN 2017-07-01

ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

OPENALEX - Publications

Xiangyu Zhang Xinyu Zhou Mengxiao Lin Jian Sun

We introduce an extremely computation-efficient CNN architecture named ShuffleNet, which is designed specially for mobile devices with very limited computing power (e.g., 10-150 MFLOPs). The new utilizes two operations, pointwise group convolution and channel shuffle, to greatly reduce computation cost while maintaining accuracy. Experiments on ImageNet classification MS COCO object detection demonstrate the superior performance of ShuffleNet over other structures, e.g. lower top-1 error...

10.48550/arxiv.1707.01083 preprint EN other-oa arXiv (Cornell University) 2017-01-01

DPGN: Distribution Propagation Graph Network for Few-Shot Learning

OPENALEX - Publications

L. Yang Liangliang Li Zilun Zhang Xinyu Zhou Erjin Zhou and 1 more

Most graph-network-based meta-learning approaches model instance-level relation of examples. We extend this idea further to explicitly the distribution-level one example all other examples in a 1-vs-N manner. propose novel approach named distribution propagation graph network (DPGN) for few-shot learning. It conveys both relations and each learning task. To combine examples, we construct dual complete which consists point with node standing an example. Equipped architecture, DPGN propagates...

10.1109/cvpr42600.2020.01340 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Scene Text Detection via Holistic, Multi-Channel Prediction

OPENALEX - Publications

Cong Yao Xiang Bai Nong Sang Xinyu Zhou Shuchang Zhou and 1 more

Recently, scene text detection has become an active research topic in computer vision and document analysis, because of its great importance significant challenge. However, vast majority the existing methods detect within local regions, typically through extracting character, word or line level candidates followed by candidate aggregation false positive elimination, which potentially exclude effect wide-scope long-range contextual cues scene. To take full advantage rich information available...

10.48550/arxiv.1606.09002 preprint EN other-oa arXiv (Cornell University) 2016-01-01

ERV-Net: An efficient 3D residual neural network for brain tumor segmentation

OPENALEX - Publications

Xinyu Zhou Xuanya Li Kai Hu Yuan Zhang Zhineng Chen and 1 more

10.1016/j.eswa.2021.114566 article EN Expert Systems with Applications 2021-01-09

EAST: An Efficient and Accurate Scene Text Detector

OPENALEX - Publications

Xinyu Zhou Cong Yao He Wen Yuzhi Wang Shuchang Zhou and 2 more

Previous approaches for scene text detection have already achieved promising performances across various benchmarks. However, they usually fall short when dealing with challenging scenarios, even equipped deep neural network models, because the overall performance is determined by interplay of multiple stages and components in pipelines. In this work, we propose a simple yet powerful pipeline that yields fast accurate natural scenes. The directly predicts words or lines arbitrary...

10.48550/arxiv.1704.03155 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Effective Quantization Methods for Recurrent Neural Networks

OPENALEX - Publications

Qinyao He He Wen Shuchang Zhou Yuxin Wu Cong Yao and 2 more

Reducing bit-widths of weights, activations, and gradients a Neural Network can shrink its storage size memory usage, also allow for faster training inference by exploiting bitwise operations. However, previous attempts quantization RNNs show considerable performance degradation when using low bit-width weights activations. In this paper, we propose methods to quantize the structure gates interlinks in LSTM GRU cells. addition, balanced further reduce degradation. Experiments on PTB IMDB...

10.48550/arxiv.1611.10176 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Determination of beam oscillating pattern for tailoring melt flow and microstructural characteristics of laser welded Ti–6Al–4V alloy

OPENALEX - Publications

Yunhao Liu Yue Li Xinyu Zhou Chao Ma Yanqiu Zhao and 2 more

10.1016/j.ijthermalsci.2024.109156 article EN International Journal of Thermal Sciences 2024-05-17

An integer encoding grey wolf optimizer for virtual network function placement

OPENALEX - Publications

Huanlai Xing Xinyu Zhou Xinhan Wang Shouxi Luo Penglin Dai and 2 more

10.1016/j.asoc.2018.12.037 article EN Applied Soft Computing 2019-01-04

Structure and mechanical property modification of a Ti-based metallic glass by ion irradiation

OPENALEX - Publications

Yongjiang Huang Hongbo Fan Xinyu Zhou Peng Xue Zhiliang Ning and 3 more

10.1016/j.scriptamat.2015.03.002 article EN Scripta Materialia 2015-03-25

Exploiting Local Structures with the Kronecker Layer in Convolutional Networks

OPENALEX - Publications

Shuchang Zhou Jianan Wu Yuxin Wu Xinyu Zhou

In this paper, we propose and study a technique to reduce the number of parameters computation time in convolutional neural networks. We use Kronecker product exploit local structures within convolution fully-connected layers, by replacing large weight matrices combinations multiple products smaller matrices. Just as is generalization outer from vectors matrices, our method low rank approximation for also introduce different shapes increase modeling capacity. Experiments on SVHN, scene text...

10.48550/arxiv.1512.09194 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Feature Space Singularity for Out-of-Distribution Detection

OPENALEX - Publications

Haiwen Huang Zhihan Li Lulu Wang Sishuo Chen Bin Dong and 1 more

Out-of-Distribution (OoD) detection is important for building safe artificial intelligence systems. However, current OoD methods still cannot meet the performance requirements practical deployment. In this paper, we propose a simple yet effective algorithm based on novel observation: in trained neural network, samples with bounded norms well concentrate feature space. We call center of features Feature Space Singularity (FSS), and denote distance sample to FSS as FSSD. Then, can be...

10.48550/arxiv.2011.14654 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Judicial Waves, Ethical Shifts: Bankruptcy Courts and Corporate ESG Performance

OPENALEX - Publications

Xinyu Zhou Zixun Zhou

10.5465/amproc.2024.20169abstract article EN Academy of Management Proceedings 2024-07-09

Resource Allocation for the Training of Image Semantic Communication Networks

OPENALEX - Publications

Yang Li Xinyu Zhou Jun Zhao

Semantic communication is a new paradigm that aims at providing more efficient for the next-generation wireless network. It focuses on transmitting extracted, meaningful information instead of raw data. However, deep learning-enabled image semantic models often require significant amount time and energy training, which unacceptable, especially mobile devices. To solve this challenge, our paper first introduces distributed system where base station local devices will collaboratively train...

10.48550/arxiv.2501.04408 preprint EN arXiv (Cornell University) 2025-01-08

Adaptive Global Dense Nested Reasoning Network into Small Target Detection in Large-Scale Hyperspectral Remote Sensing Image

OPENALEX - Publications

Siyu Zhan Yuxuan Yang Mingrui Zhong Guoming Lu Xinyu Zhou

Small and dim target detection is a critical challenge in hyperspectral remote sensing, particularly complex, large-scale scenes where spectral variability across diverse land cover types complicates the process. In this paper, we propose novel reasoning algorithm named Adaptive Global Dense Nested Reasoning Network (AGDNR). This integrates spatial, spectral, domain knowledge to enhance accuracy of small targets environments simultaneously enables about categories. The proposed method...

10.3390/rs17060948 article EN cc-by Remote Sensing 2025-03-07

MFT-Reasoning RCNN: A Novel Multi-Stage Feature Transfer Based Reasoning RCNN for Synthetic Aperture Radar (SAR) Ship Detection

OPENALEX - Publications

Siyu Zhan Manli Zhong Yuxuan Yang Guoming Lu Xinyu Zhou

Conventional ship detection using synthetic aperture radar (SAR) is typically limited to fully focused spatial features of the target in SAR images. In this paper, we propose a multi-stage feature transfer (MFT)-based reasoning RCNN (MFT-Reasoning RCNN) detect ships This algorithm can MFT strategy and adaptive global module over all object regions by exploiting diverse knowledge between its surrounding elements. Specifically, first calculate probability simultaneous occurrence environmental...

10.3390/rs17071170 article EN cc-by Remote Sensing 2025-03-26

Learning to Deblur Polarized Images

OPENALEX - Publications

Chu Zhou Minggui Teng Xinyu Zhou Chao Xu Imari Sato and 1 more

10.1007/s11263-025-02459-7 article EN International Journal of Computer Vision 2025-05-19

Deep Low-Rank and Sparse Patch-Image Network for Infrared Dim and Small Target Detection

OPENALEX - Publications

Xinyu Zhou Peng Li Ye Zhang Xin Lu Yue Hu

Detection of infrared dim and small targets with diverse cluttered background plays a significant role in many applications. In this paper, we propose deep low-rank sparse patch-image network, termed as Deep-LSP-Net, to effectively detect single image. Specifically, by using the local patch construction scheme, first transform original image into patch-image, which can be decomposed superposition component target component. The detection is thus formulated an optimization problem...

10.1109/tgrs.2023.3288574 article EN IEEE Transactions on Geoscience and Remote Sensing 2023-01-01

ICDAR 2015 Text Reading in the Wild Competition

OPENALEX - Publications

Xinyu Zhou Shuchang Zhou Cong Yao Zhimin Cao Qi Yin

Recently, text detection and recognition in natural scenes are becoming increasing popular the computer vision community as well document analysis community. However, majority of existing ideas, algorithms systems specifically designed for English. This technical report presents final results ICDAR 2015 Text Reading Wild (TRW 2015) competition, which aims at establishing a benchmark assessing devised both Chinese English scripts providing playground researchers from In this article, we...

10.48550/arxiv.1506.03184 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Learning Delicate Local Representations for Multi-Person Pose Estimation

OPENALEX - Publications

Yuanhao Cai Zhicheng Wang Zhengxiong Luo Binyi Yin Angang Du and 5 more

In this paper, we propose a novel method called Residual Steps Network (RSN). RSN aggregates features with the same spatial size (Intra-level features) efficiently to obtain delicate local representations, which retain rich low-level information and result in precise keypoint localization. Additionally, observe output contribute differently final performance. To tackle problem, an efficient attention mechanism - Pose Refine Machine (PRM) make trade-off between global representations further...

10.48550/arxiv.2003.04030 preprint EN other-oa arXiv (Cornell University) 2020-01-01