Zhuo Su

ORCID: 0000-0002-6448-0651
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Neural Network Applications
  • Advanced Image and Video Retrieval Techniques
  • Domain Adaptation and Few-Shot Learning
  • Video Surveillance and Tracking Methods
  • COVID-19 diagnosis using AI
  • Face recognition and analysis
  • Remote-Sensing Image Classification
  • 3D Shape Modeling and Analysis
  • Adversarial Robustness in Machine Learning
  • Human Pose and Action Recognition
  • Image Enhancement Techniques
  • Biometric Identification and Security
  • User Authentication and Security Systems
  • Medical Image Segmentation Techniques
  • Advanced Image Fusion Techniques
  • Face and Expression Recognition
  • Industrial Vision Systems and Defect Detection
  • Brain Tumor Detection and Classification
  • Human Motion and Animation
  • Neural Networks and Applications
  • Urban Heat Island Mitigation
  • Micro and Nano Robotics
  • Soft Robotics and Applications
  • Multimodal Machine Learning Applications
  • Visual Attention and Saliency Detection

Nankai University
2023-2024

University of Oulu
2019-2024

Southern University of Science and Technology
2024

University of Amsterdam
2022

Wuhan University of Technology
2014-2019

Sun Yat-sen University
2016-2019

China Southern Power Grid (China)
2018

Face anti-spoofing (FAS) plays a vital role in face recognition systems. Most state-of-the-art FAS methods 1) rely on stacked convolutions and expert-designed network, which is weak describing detailed fine-grained information easily being ineffective when the environment varies (e.g., different illumination), 2) prefer to use long sequence as input extract dynamic features, making them difficult deploy into scenarios need quick response. Here we propose novel frame level method based...

10.1109/cvpr42600.2020.00534 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Recently, deep Convolutional Neural Networks (CNNs) can achieve human-level performance in edge detection with the rich and abstract representation capacities. However, high of CNN based is achieved a large pretrained backbone, which memory energy consuming. In addition, it surprising that previous wisdom from traditional detectors, such as Canny, Sobel, LBP are rarely investigated rapid-developing learning era. To address these issues, we propose simple, lightweight yet effective...

10.1109/iccv48922.2021.00507 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Recently, there have been tremendous efforts in developing lightweight Deep Neural Networks (DNNs) with satisfactory accuracy, which can enable the ubiquitous deployment of DNNs edge devices. The core challenge compact and efficient lies how to balance competing goals achieving high accuracy efficiency. In this paper we propose two novel types convolutions, dubbed \emph{Pixel Difference Convolution (PDC) Binary PDC (Bi-PDC)} enjoy following benefits: capturing higher-order local differential...

10.1109/tpami.2023.3300513 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2023-08-01

Moving object detection in satellite videos (SVMOD) is a challenging task due to the extremely dim and small target characteristics. Current learning-based methods extract spatio-temporal information from multi-frame dense representation with labor-intensive manual labels tackle SVMOD, which needs high annotation costs contains tremendous computational redundancy severe imbalance between foreground background regions. In this paper, we propose highly efficient unsupervised framework for...

10.1109/tpami.2024.3409824 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2024-06-05

Existing Cross-Domain Few-Shot Learning (CDFSL) methods require access to source domain data train a model in the pre-training phase. However, due increasing concerns about privacy and desire reduce transmission training costs, it is necessary develop CDFSL solution without accessing data. For this reason, paper explores Source-Free (SF-CDFSL) problem, which addressed through use of existing pretrained models instead with data, avoiding lack we face two key challenges: effectively tackling...

10.1109/tip.2024.3374222 article EN IEEE Transactions on Image Processing 2024-01-01

Binary neural networks (BNNs) constrain weights and activations to +1 or -1 with limited storage computational cost, which is hardware-friendly for portable devices. Recently, BNNs have achieved remarkable progress been adopted into various fields. However, the performance of sensitive activation distribution. The existing utilized Sign function predefined learned static thresholds binarize activations. This process limits representation capacity since different samples may adapt unequal...

10.1109/icassp43922.2022.9747328 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022-04-27

10.1109/cvpr52733.2024.00082 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Efficiency and robustness are increasingly needed for applications on 3D point clouds, with the ubiquitous use of edge devices in scenarios like autonomous driving robotics, which often demand real-time reliable responses. The paper tackles challenge by designing a general framework to construct learning architectures SO(3) equivariance network binarization. However, naive combination equivariant networks binarization either causes sub-optimal computational efficiency or geometric ambiguity....

10.1109/3dv57658.2022.00084 article EN 2021 International Conference on 3D Vision (3DV) 2022-09-01

Research of the clothing recommendation algorithm is important that can be used to provide a more efficient method for consumers select their expected clothing. Considering characteristics product, in this paper, personalized based on fine-grained attributes reported. In method, are established image. And preference model each user combining with and personal parameters built. This an application system client/server framework mobile phone software Android platform.

10.1109/icdh.2018.00046 article EN 2018-11-01

Face anti-spoofing (FAS) plays a vital role in face recognition systems. Most state-of-the-art FAS methods 1) rely on stacked convolutions and expert-designed network, which is weak describing detailed fine-grained information easily being ineffective when the environment varies (e.g., different illumination), 2) prefer to use long sequence as input extract dynamic features, making them difficult deploy into scenarios need quick response. Here we propose novel frame level method based...

10.48550/arxiv.2003.04092 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Recently, deep Convolutional Neural Networks (CNNs) can achieve human-level performance in edge detection with the rich and abstract representation capacities. However, high of CNN based is achieved a large pretrained backbone, which memory energy consuming. In addition, it surprising that previous wisdom from traditional detectors, such as Canny, Sobel, LBP are rarely investigated rapid-developing learning era. To address these issues, we propose simple, lightweight yet effective...

10.48550/arxiv.2108.07009 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Face perception is an essential and significant problem in pattern recognition, concretely including Recognition (FR), Facial Expression (FER), Race Categorization (RC). Though handcrafted features perform well on face images, Deep Convolutional Neural Networks (DCNNs) have brought new vitality to this field recently. Vanilla DCNNs are powerful at learning high-level semantic features, but weak capturing low-level image characteristic changes illumination, intensity,and texture regarded as...

10.1109/access.2021.3117955 article EN cc-by IEEE Access 2021-01-01

Replacing normal convolutions with group can significantly increase the computational efficiency of modern deep convolutional networks, which has been widely adopted in compact network architecture designs. However, existing undermine original structures by cutting off some connections permanently resulting significant accuracy degradation. In this paper, we propose dynamic convolution (DGC) that adaptively selects part input channels to be connected within each for individual samples on...

10.48550/arxiv.2007.04242 preprint EN other-oa arXiv (Cornell University) 2020-01-01

As is well-known, defects precisely affect the lives and functions of machines in which they occur, even cause potentially catastrophic casualties. Therefore, quality assessment before mounting an indispensable requirement for factories. Apart from recognition accuracy, current networks suffer excessive computing complexity, making it great difficulty to deploy manufacturing process. To address these issues, this paper introduces binary into area surface defect detection first time, reason...

10.3390/s21206868 article EN cc-by Sensors 2021-10-16

This article proposes a novel module called middle spectrum grouped convolution (MSGC) for efficient deep convolutional neural networks (DCNNs) with the mechanism of convolution. It explores broad "middle spectrum" area between channel pruning and conventional Compared pruning, MSGC can retain most information from input feature maps due to group mechanism; compared convolution, benefits learnability, core constructing its topology, leading better division. The is unfolded along four...

10.1109/tnnls.2024.3355489 article EN IEEE Transactions on Neural Networks and Learning Systems 2024-02-08

Despite recent advancements in high-fidelity human reconstruction techniques, the requirements for densely captured images or time-consuming per-instance optimization significantly hinder their applications broader scenarios. To tackle these issues, we present HumanSplat which predicts 3D Gaussian Splatting properties of any from a single input image generalizable manner. In particular, comprises 2D multi-view diffusion model and latent transformer with structure priors that adeptly...

10.48550/arxiv.2406.12459 preprint EN arXiv (Cornell University) 2024-06-18

In this paper, we present a novel 3D head avatar creation approach capable of generalizing from few-shot in-the-wild data with high-fidelity and animatable robustness. Given the underconstrained nature problem, incorporating prior knowledge is essential. Therefore, propose framework comprising learning phases. The phase leverages priors derived large-scale multi-view dynamic dataset, applies these for personalization. Our effectively captures by utilizing Gaussian Splatting-based...

10.48550/arxiv.2408.06019 preprint EN arXiv (Cornell University) 2024-08-12

PM2.5 is an important indicator of the severity air pollution and its level can be predicted through hazy photographs caused by degradation. Image-based estimation thus extensively employed in various multimedia applications but challenging because ill-posed property. In this paper, we convert it to problem estimating PM2.5-relevant haze transmission propose a learning model called filtering network. Different from most methods that generate map directly image, our takes coarse derived dark...

10.1109/icme.2019.00054 article EN 2022 IEEE International Conference on Multimedia and Expo (ICME) 2019-07-01

The paper analyzed the influence of friction factor theoretically on brake system to produce noise, through complex modal analysis method, established finite element model air disc analyze and forecast noise get noises frequency a certain test conditions. Through multiple sets under different coefficient, it is concluded that increase coefficient has promoting effect noise.

10.4028/www.scientific.net/amm.494-495.42 article EN Applied Mechanics and Materials 2014-02-06
Coming Soon ...