NFDI4DS | UHH-SEMS - Publication Details

Searching Central Difference Convolutional Networks for Face Anti-Spoofing

OPENALEX - Publications

Zitong Yu Chenxu Zhao Zezheng Wang Yunxiao Qin Zhuo Su and 3 more

Face anti-spoofing (FAS) plays a vital role in face recognition systems. Most state-of-the-art FAS methods 1) rely on stacked convolutions and expert-designed network, which is weak describing detailed fine-grained information easily being ineffective when the environment varies (e.g., different illumination), 2) prefer to use long sequence as input extract dynamic features, making them difficult deploy into scenarios need quick response. Here we propose novel frame level method based...

10.1109/cvpr42600.2020.00534 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Pixel Difference Networks for Efficient Edge Detection

OPENALEX - Publications

Zhuo Su Wenzhe Liu Zitong Yu Dewen Hu Qing Liao and 3 more

Recently, deep Convolutional Neural Networks (CNNs) can achieve human-level performance in edge detection with the rich and abstract representation capacities. However, high of CNN based is achieved a large pretrained backbone, which memory energy consuming. In addition, it surprising that previous wisdom from traditional detectors, such as Canny, Sobel, LBP are rarely investigated rapid-developing learning era. To address these issues, we propose simple, lightweight yet effective...

10.1109/iccv48922.2021.00507 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Lightweight Pixel Difference Networks for Efficient Visual Representation Learning

OPENALEX - Publications

Zhuo Su Jiehua Zhang Longguang Wang Hua Zhang Zhen Liu and 2 more

Recently, there have been tremendous efforts in developing lightweight Deep Neural Networks (DNNs) with satisfactory accuracy, which can enable the ubiquitous deployment of DNNs edge devices. The core challenge compact and efficient lies how to balance competing goals achieving high accuracy efficiency. In this paper we propose two novel types convolutions, dubbed \emph{Pixel Difference Convolution (PDC) Binary PDC (Bi-PDC)} enjoy following benefits: capturing higher-order local differential...

10.1109/tpami.2023.3300513 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2023-08-01

Highly Efficient and Unsupervised Framework for Moving Object Detection in Satellite Videos

OPENALEX - Publications

Chao Xiao Wei An Yifan Zhang Zhuo Su Miao Li and 3 more

Moving object detection in satellite videos (SVMOD) is a challenging task due to the extremely dim and small target characteristics. Current learning-based methods extract spatio-temporal information from multi-frame dense representation with labor-intensive manual labels tackle SVMOD, which needs high annotation costs contains tremendous computational redundancy severe imbalance between foreground background regions. In this paper, we propose highly efficient unsupervised framework for...

10.1109/tpami.2024.3409824 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2024-06-05

Enhancing Information Maximization With Distance-Aware Contrastive Learning for Source-Free Cross-Domain Few-Shot Learning

OPENALEX - Publications

Huali Xu Li Liu Shuaifeng Zhi Shaojing Fu Zhuo Su and 2 more

Existing Cross-Domain Few-Shot Learning (CDFSL) methods require access to source domain data train a model in the pre-training phase. However, due increasing concerns about privacy and desire reduce transmission training costs, it is necessary develop CDFSL solution without accessing data. For this reason, paper explores Source-Free (SF-CDFSL) problem, which addressed through use of existing pretrained models instead with data, avoiding lack we face two key challenges: effectively tackling...

10.1109/tip.2024.3374222 article EN IEEE Transactions on Image Processing 2024-01-01

Dynamic Binary Neural Network by Learning Channel-Wise Thresholds

OPENALEX - Publications

Jiehua Zhang Zhuo Su Yanghe Feng Xin Lü Matti Pietikäinen and 1 more

Binary neural networks (BNNs) constrain weights and activations to +1 or -1 with limited storage computational cost, which is hardware-friendly for portable devices. Recently, BNNs have achieved remarkable progress been adopted into various fields. However, the performance of sensitive activation distribution. The existing utilized Sign function predefined learned static thresholds binarize activations. This process limits representation capacity since different samples may adapt unequal...

10.1109/icassp43922.2022.9747328 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022-04-27

OHTA: One-shot Hand Avatar via Data-driven Implicit Priors

OPENALEX - Publications

Xiaozheng Zheng Chao Wen Zhuo Su Zeran Xu Zhaohu Li and 2 more

10.1109/cvpr52733.2024.00082 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Deep ladder reconstruction-classification network for unsupervised domain adaptation

OPENALEX - Publications

Wanxia Deng Zhuo Su Qiang Qiu Lingjun Zhao Gangyao Kuang and 3 more

10.1016/j.patrec.2021.10.009 article EN Pattern Recognition Letters 2021-10-13

SVNet: Where SO(3) Equivariance Meets Binarization on Point Cloud Representation

OPENALEX - Publications

Zhuo Su Max Welling Matti Pietikäinen Li Liu

Efficiency and robustness are increasingly needed for applications on 3D point clouds, with the ubiquitous use of edge devices in scenarios like autonomous driving robotics, which often demand real-time reliable responses. The paper tackles challenge by designing a general framework to construct learning architectures SO(3) equivariance network binarization. However, naive combination equivariant networks binarization either causes sub-optimal computational efficiency or geometric ambiguity....

10.1109/3dv57658.2022.00084 article EN 2021 International Conference on 3D Vision (3DV) 2022-09-01

Research and Implementation of Personalized Clothing Recommendation Algorithm

OPENALEX - Publications

Qianqian Deng Ruomei Wang Zixiao Gong Guifeng Zheng Zhuo Su

Research of the clothing recommendation algorithm is important that can be used to provide a more efficient method for consumers select their expected clothing. Considering characteristics product, in this paper, personalized based on fine-grained attributes reported. In method, are established image. And preference model each user combining with and personal parameters built. This an application system client/server framework mobile phone software Android platform.

10.1109/icdh.2018.00046 article EN 2018-11-01

Searching Central Difference Convolutional Networks for Face Anti-Spoofing

OPENALEX - Publications

Zitong Yu Chenxu Zhao Zezheng Wang Yunxiao Qin Zhuo Su and 3 more

Face anti-spoofing (FAS) plays a vital role in face recognition systems. Most state-of-the-art FAS methods 1) rely on stacked convolutions and expert-designed network, which is weak describing detailed fine-grained information easily being ineffective when the environment varies (e.g., different illumination), 2) prefer to use long sequence as input extract dynamic features, making them difficult deploy into scenarios need quick response. Here we propose novel frame level method based...

10.48550/arxiv.2003.04092 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Pixel Difference Networks for Efficient Edge Detection

OPENALEX - Publications

Zhuo Su Wenzhe Liu Zitong Yu Dewen Hu Qing Liao and 3 more

Recently, deep Convolutional Neural Networks (CNNs) can achieve human-level performance in edge detection with the rich and abstract representation capacities. However, high of CNN based is achieved a large pretrained backbone, which memory energy consuming. In addition, it surprising that previous wisdom from traditional detectors, such as Canny, Sobel, LBP are rarely investigated rapid-developing learning era. To address these issues, we propose simple, lightweight yet effective...

10.48550/arxiv.2108.07009 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Beyond Vanilla Convolution: Random Pixel Difference Convolution for Face Perception

OPENALEX - Publications

Wenzhe Liu Zhuo Su Li Liu

Face perception is an essential and significant problem in pattern recognition, concretely including Recognition (FR), Facial Expression (FER), Race Categorization (RC). Though handcrafted features perform well on face images, Deep Convolutional Neural Networks (DCNNs) have brought new vitality to this field recently. Vanilla DCNNs are powerful at learning high-level semantic features, but weak capturing low-level image characteristic changes illumination, intensity,and texture regarded as...

10.1109/access.2021.3117955 article EN cc-by IEEE Access 2021-01-01

Dynamic Group Convolution for Accelerating Convolutional Neural Networks

OPENALEX - Publications

Zhuo Su Linpu Fang Wenxiong Kang Dewen Hu Matti Pietikäinen and 1 more

Replacing normal convolutions with group can significantly increase the computational efficiency of modern deep convolutional networks, which has been widely adopted in compact network architecture designs. However, existing undermine original structures by cutting off some connections permanently resulting significant accuracy degradation. In this paper, we propose dynamic convolution (DGC) that adaptively selects part input channels to be connected within each for individual samples on...

10.48550/arxiv.2007.04242 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Binary Neural Network for Automated Visual Surface Defect Detection

OPENALEX - Publications

Wenzhe Liu Jiehua Zhang Zhuo Su Zhongzhu Zhou Li Liu

As is well-known, defects precisely affect the lives and functions of machines in which they occur, even cause potentially catastrophic casualties. Therefore, quality assessment before mounting an indispensable requirement for factories. Apart from recognition accuracy, current networks suffer excessive computing complexity, making it great difficulty to deploy manufacturing process. To address these issues, this paper introduces binary into area surface defect detection first time, reason...

10.3390/s21206868 article EN cc-by Sensors 2021-10-16

Boosting Convolutional Neural Networks With Middle Spectrum Grouped Convolution

OPENALEX - Publications

Zhuo Su Jiehua Zhang Tianpeng Liu Zhen Liu Shuanghui Zhang and 2 more

This article proposes a novel module called middle spectrum grouped convolution (MSGC) for efficient deep convolutional neural networks (DCNNs) with the mechanism of convolution. It explores broad "middle spectrum" area between channel pruning and conventional Compared pruning, MSGC can retain most information from input feature maps due to group mechanism; compared convolution, benefits learnability, core constructing its topology, leading better division. The is unfolded along four...

10.1109/tnnls.2024.3355489 article EN IEEE Transactions on Neural Networks and Learning Systems 2024-02-08

HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors

OPENALEX - Publications

Panwang Pan Zhuo Su Chenguo Lin Zhen Fan Yongjie Zhang and 4 more

Despite recent advancements in high-fidelity human reconstruction techniques, the requirements for densely captured images or time-consuming per-instance optimization significantly hinder their applications broader scenarios. To tackle these issues, we present HumanSplat which predicts 3D Gaussian Splatting properties of any from a single input image generalizable manner. In particular, comprises 2D multi-view diffusion model and latent transformer with structure priors that adeptly...

10.48550/arxiv.2406.12459 preprint EN arXiv (Cornell University) 2024-06-18

HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors

OPENALEX - Publications

Xiaozheng Zheng Chao Wen Zhecheng Li Weiyi Zhang Zhuo Su and 7 more

In this paper, we present a novel 3D head avatar creation approach capable of generalizing from few-shot in-the-wild data with high-fidelity and animatable robustness. Given the underconstrained nature problem, incorporating prior knowledge is essential. Therefore, propose framework comprising learning phases. The phase leverages priors derived large-scale multi-view dynamic dataset, applies these for personalization. Our effectively captures by utilizing Gaussian Splatting-based...

10.48550/arxiv.2408.06019 preprint EN arXiv (Cornell University) 2024-08-12

Autonomous Navigation of Soft Rolling Microrobots under a Helmholtz Coil System across Fields of View using Image Stitching

OPENALEX - Publications

Lijun Fang Min Jun Kim Zhuo Su U Kei Cheang

10.1109/lra.2024.3504234 article EN IEEE Robotics and Automation Letters 2024-01-01

An Efficient and Privacy-Preserving Framework for Cloud-Assisted Mobile Deep Learning with Filter Pruning and Adversarial Mechanisms

OPENALEX - Publications

Panpan Zheng Zhuo Su Yongquan Xue Feng Zhou Chong Peng

10.2139/ssrn.5070623 preprint EN 2024-01-01

Learning Transmission Filtering Network for Image-Based Pm2.5 Estimation

OPENALEX - Publications

Yinghong Liao Bin Qiu Zhuo Su Ruomei Wang Xiangjian He

PM2.5 is an important indicator of the severity air pollution and its level can be predicted through hazy photographs caused by degradation. Image-based estimation thus extensively employed in various multimedia applications but challenging because ill-posed property. In this paper, we convert it to problem estimating PM2.5-relevant haze transmission propose a learning model called filtering network. Different from most methods that generate map directly image, our takes coarse derived dark...

10.1109/icme.2019.00054 article EN 2022 IEEE International Conference on Multimedia and Expo (ICME) 2019-07-01

A Study on Brake Noise Using the Complex Modal Analysis Method

OPENALEX - Publications

Ye Tao Zhuo Su Sheng Bao Lu

The paper analyzed the influence of friction factor theoretically on brake system to produce noise, through complex modal analysis method, established finite element model air disc analyze and forecast noise get noises frequency a certain test conditions. Through multiple sets under different coefficient, it is concluded that increase coefficient has promoting effect noise.

10.4028/www.scientific.net/amm.494-495.42 article EN Applied Mechanics and Materials 2014-02-06