NFDI4DS | UHH-SEMS - Publication Details

Lizhuang Ma

ORCID: 0000-0003-1653-4341

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5084218062

Research Areas

Advanced Vision and Imaging
3D Shape Modeling and Analysis
Computer Graphics and Visualization Techniques
Face recognition and analysis
Image Enhancement Techniques
Advanced Neural Network Applications
Human Pose and Action Recognition
Advanced Image and Video Retrieval Techniques
Advanced Image Processing Techniques
Generative Adversarial Networks and Image Synthesis
Domain Adaptation and Few-Shot Learning
3D Surveying and Cultural Heritage
Visual Attention and Saliency Detection
Human Motion and Animation
Advanced Numerical Analysis Techniques
Biometric Identification and Security
Multimodal Machine Learning Applications
Video Surveillance and Tracking Methods
Anomaly Detection Techniques and Applications
Video Analysis and Summarization
Robotics and Sensor-Based Localization
Face and Expression Recognition
Digital Media Forensic Detection
Remote Sensing and LiDAR Applications
Traditional Chinese Medicine Studies

Shanghai Jiao Tong University
2016-2025

East China Normal University
2017-2025

Shanghai Normal University
2025

Shanghai University
2013-2024

Chongqing Normal University
2024

Motion Control (United States)
2021

ETH Zurich
2020

Shanghai University of Traditional Chinese Medicine
2008-2013

National Rehabilitation Center
2013

Zhejiang Ocean University
2013

Contrastive Learning for Compact Single Image Dehazing

OPENALEX - Publications

Haiyan Wu Yanyun Qu Shaohui Lin Jian Zhou Ruizhi Qiao and 3 more

Single image dehazing is a challenging ill-posed problem due to the severe information degeneration. However, existing deep learning based methods only adopt clear images as positive samples guide training of network while negative unexploited. Moreover, most them focus on strengthening with an increase depth and width, leading significant requirement computation memory. In this paper, we propose novel contrastive regularization (CR) built upon exploit both hazy samples, respectively. CR...

10.1109/cvpr46437.2021.01041 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

DMT: Dynamic mutual training for semi-supervised learning

OPENALEX - Publications

Zhengyang Feng Qianyu Zhou Qiqi Gu Xin Tan Guangliang Cheng and 3 more

10.1016/j.patcog.2022.108777 article EN Pattern Recognition 2022-05-11

Rethinking Efficient Lane Detection via Curve Modeling

OPENALEX - Publications

Zhengyang Feng Shaohua Guo Xin Tan Ke Xu Min Wang and 1 more

This paper presents a novel parametric curve-based method for lane detection in RGB images. Unlike state-of-the-art segmentation-based and point detection-based methods that typically require heuristics to either decode predictions or formulate large sum of anchors, the can learn holistic representations naturally. To handle optimization difficulties existing poly-nomial curve methods, we propose exploit Bézier due its ease computation, stability, high freedom degrees transformations. In...

10.1109/cvpr52688.2022.01655 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior

OPENALEX - Publications

Junshu Tang Tengfei Wang Bo Zhang Ting Zhang Ran Yi and 2 more

In this work, we investigate the problem of creating high-fidelity 3D content from only a single image. This is inherently challenging: it essentially involves estimating underlying geometry while simultaneously hallucinating unseen textures. To address challenge, leverage prior knowledge well-trained 2D diffusion model to act as 3D-aware supervision for creation. Our approach, Make-It-3D, employs two-stage optimization pipeline: first stage optimizes neural radiance field by incorporating...

10.1109/iccv51070.2023.02086 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

TransVOD: End-to-End Video Object Detection With Spatial-Temporal Transformers

OPENALEX - Publications

Qianyu Zhou Xiangtai Li Lu H Yibo Yang Guangliang Cheng and 3 more

Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors. However, their on Video Object (VOD) has not well explored. In this paper, we present TransVOD, first end-to-end video system based simple yet effective spatial-temporal architectures. The goal of paper is streamline pipeline current VOD, effectively removing feature...

10.1109/tpami.2022.3223955 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2022-11-24

Context-Aware Mixup for Domain Adaptive Semantic Segmentation

OPENALEX - Publications

Qianyu Zhou Zhengyang Feng Qiqi Gu Jiangmiao Pang Guangliang Cheng and 3 more

Unsupervised domain adaptation (UDA) aims to adapt a model of the labeled source an unlabeled target domain. Existing UDA-based semantic segmentation approaches always reduce shifts in pixel level, feature and output level. However, almost all them largely neglect contextual dependency, which is generally shared across different domains, leading less-desired performance. In this paper, we propose novel Context-Aware Mixup (CAMix) framework for adaptive segmentation, exploits important clue...

10.1109/tcsvt.2022.3206476 article EN IEEE Transactions on Circuits and Systems for Video Technology 2022-09-14

Glass Makes Blurs: Learning the Visual Blurriness for Glass Surface Detection

OPENALEX - Publications

F. Qi Xin Tan Zhizhong Zhang Mingang Chen Yuan Xie and 1 more

Glass surface detection is challenging as glass normally borrows similar visual appearances from the arbitrary objects/scenes behind it. Although some methods have been proposed to address this problem, they may fail if reference objects are nonexistent or additional annotations missing. This article aims problem by utilizing intrinsic properties without and annotations. We observe makes blurs naturally. Based on investigation of blurriness cue, we propose a novel aggregation module model...

10.1109/tii.2024.3352232 article EN IEEE Transactions on Industrial Informatics 2024-01-19

Color transfer in correlated color space

OPENALEX - Publications

Xuezhong Xiao Lizhuang Ma

In this paper we present a process called color transfer which can borrow one image's characteristics from another. Recently Reinhard and his colleagues reported pioneering work of transfer. Their technology produce very believable results, but has to transform pixel values RGB lαβ. Inspired by their work, advise an approach directly deal with the in any 3D space.From view statistics, consider pixel's value as three-dimension stochastic variable image set samples, so correlations between...

10.1145/1128923.1128974 article EN 2006-06-14

Facial Action Unit Detection Using Attention and Relation Learning

OPENALEX - Publications

Zhiwen Shao Zhilei Liu Jianfei Cai Yunsheng Wu Lizhuang Ma

Attention mechanism has recently attracted increasing attentions in the field of facial action unit (AU) detection. By finding region interest each AU with attention mechanism, AU-related local features can be captured. Most existing based detection works use prior knowledge to predefine fixed or refine predefined within a small range, which limits their capacity model various AUs. In this paper, we propose an end-to-end deep learning and relation framework for only labels, not been explored...

10.1109/taffc.2019.2948635 article EN IEEE Transactions on Affective Computing 2019-10-24

Farewell to Mutual Information: Variational Distillation for Cross-Modal Person Re-Identification

OPENALEX - Publications

Xudong Tian Zhizhong Zhang Shaohui Lin Yanyun Qu Yuan Xie and 1 more

The Information Bottleneck (IB) provides an information theoretic principle for representation learning, by retaining all relevant predicting label while minimizing the redundancy. Though IB has been applied to a wide range of applications, its optimization remains challenging problem which heavily relies on accurate estimation mutual information. In this paper, we present new strategy, Variational Self-Distillation (VSD), scalable, flexible and analytic solution essentially fitting but...

10.1109/cvpr46437.2021.00157 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

NTIRE 2020 Challenge on NonHomogeneous Dehazing

OPENALEX - Publications

Codruta O. Ancuti Cosmin Ancuţi Florin-Alexandru Vasluianu Radu Timofte Jing Liu and 47 more

This paper reviews the NTIRE 2020 Challenge on Non-Homogeneous Dehazing of images (restoration rich details in hazy image). We focus proposed solutions and their results evaluated NH-Haze, a novel dataset consisting 55 pairs real haze free nonhomogeneous recorded outdoor. NH-Haze is first realistic that provides ground truth images. The has been produced using professional generator imitates conditions scenes. 168 participants registered challenge 27 teams competed final testing phase. gauge...

10.1109/cvprw50498.2020.00253 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2020-06-01

Joint Deep Multi-View Learning for Image Clustering

OPENALEX - Publications

Yuan Xie Bingqian Lin Yanyun Qu Cuihua Li Wensheng Zhang and 3 more

In this paper, a novel <bold xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">D eep xmlns:xlink="http://www.w3.org/1999/xlink">M ulti-view xmlns:xlink="http://www.w3.org/1999/xlink">J oint xmlns:xlink="http://www.w3.org/1999/xlink">C lustering ( xmlns:xlink="http://www.w3.org/1999/xlink">DMJC ) framework is proposed, where multiple deep embedded features, multi-view fusion mechanism, and clustering assignments can be learned...

10.1109/tkde.2020.2973981 article EN IEEE Transactions on Knowledge and Data Engineering 2020-02-14

Spatiotemporal Inconsistency Learning for DeepFake Video Detection

OPENALEX - Publications

Zhihao Gu Yang Chen Taiping Yao Shouhong Ding Jilin Li and 2 more

The rapid development of facial manipulation techniques has aroused public concerns in recent years. Following the success deep learning, existing methods always formulate DeepFake video detection as a binary classification problem and develop frame-based video-based solutions. However, little attention been paid to capturing spatial-temporal inconsistency forged videos. To address this issue, we term task Spatial-Temporal Inconsistency Learning (STIL) process instantiate it into novel STIL...

10.1145/3474085.3475508 article EN Proceedings of the 30th ACM International Conference on Multimedia 2021-10-17

Night-Time Scene Parsing With a Large Real Dataset

OPENALEX - Publications

Xin Tan Ke Xu Ying Cao Yiheng Zhang Lizhuang Ma and 1 more

Although huge progress has been made on scene analysis in recent years, most existing works assume the input images to be day-time with good lighting conditions. In this work, we aim address night-time parsing (NTSP) problem, which two main challenges: 1) labeled data are scarce, and 2) over- under-exposures may co-occur not explicitly modeled pipelines. To tackle scarcity of data, collect a novel dataset, named NightCity, 4,297 real ground truth pixel-level semantic annotations. our...

10.1109/tip.2021.3122004 article EN IEEE Transactions on Image Processing 2021-01-01

Adaptive Normalized Representation Learning for Generalizable Face Anti-Spoofing

OPENALEX - Publications

ShuBao Liu Ke-Yue Zhang Taiping Yao Mingwei Bi Shouhong Ding and 3 more

With various face presentation attacks arising under unseen scenarios, anti-spoofing (FAS) based on domain generalization (DG) has drawn growing attention due to its robustness. Most existing methods utilize DG frameworks align the features seek a compact and generalized feature space. However, little been paid extraction process for FAS task, especially influence of normalization, which also great impact learned representation. To address this issue, we propose novel perspective that...

10.1145/3474085.3475279 article EN Proceedings of the 30th ACM International Conference on Multimedia 2021-10-17

End-to-End Video Object Detection with Spatial-Temporal Transformers

OPENALEX - Publications

Lu H Qianyu Zhou Xiangtai Li Li Niu Guangliang Cheng and 5 more

Recently, DETR and Deformable have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors. However, their on Video Object Detection (VOD) has not well explored. In this paper, we present TransVOD, an end-to-end video model based a spatial-temporal Transformer architecture. The goal of paper is streamline pipeline VOD, effectively removing feature aggregation, e.g., optical flow,...

10.1145/3474085.3475285 article EN Proceedings of the 30th ACM International Conference on Multimedia 2021-10-17

Trident Dehazing Network

OPENALEX - Publications

Jing Liu Haiyan Wu Yuan Xie Yanyun Qu Lizhuang Ma

Most existing dehazing methods are not robust to nonhomogeneous haze. Meanwhile, the information of dense haze region is usually unknown and hard estimate, leading blurry in dehaze result for those regions. Focusing on these two issues, we propose a novel coarse-to-fine model, namely Trident Dehazing Network (TDN), learn hazy hazy- free image mapping with automatic density recognition. In detail, TDN composed three sub-nets: EncoderDecoder Net (EDN) main net reconstruct coarse hazy-free...

10.1109/cvprw50498.2020.00223 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2020-06-01

Robust Kernelized Multiview Self-Representation for Subspace Clustering

OPENALEX - Publications

Yuan Xie Jinyan Liu Yanyun Qu Dacheng Tao Wensheng Zhang and 2 more

In this article, we propose a multiview self-representation model for nonlinear subspaces clustering. By assuming that the heterogeneous features lie within union of multiple linear subspaces, recent subspace learning methods aim to capture complementary and consensus from views boost performance. However, in real-world applications, data feature usually resides leading undesirable results. To end, kernelized version tensor-based clustering, which is referred as Kt-SVD-MSC, jointly learn...

10.1109/tnnls.2020.2979685 article EN IEEE Transactions on Neural Networks and Learning Systems 2020-04-10

Low Rank Matrix Approximation for 3D Geometry Filtering

OPENALEX - Publications

Xuequan Lu Scott Schaefer Jun Luo Lizhuang Ma Ying He

We propose a robust normal estimation method for both point clouds and meshes using low rank matrix approximation algorithm. First, we compute local isotropic structure each find its similar, non-local structures that organize into matrix. then show algorithm can robustly estimate normals meshes. Furthermore, provide new filtering cloud data to smooth the position fit estimated normals. applications of our filtering, set upsampling, surface reconstruction, mesh denoising, geometric texture...

10.1109/tvcg.2020.3026785 article EN IEEE Transactions on Visualization and Computer Graphics 2020-10-01

Dual Reweighting Domain Generalization for Face Presentation Attack Detection

OPENALEX - Publications

Shubao Liu Ke-Yue Zhang Taiping Yao Kekai Sheng Shouhong Ding and 4 more

Face anti-spoofing approaches based on domain generalization (DG) have drawn growing attention due to their robustness for unseen scenarios. Previous methods treat each sample from multiple domains indiscriminately during the training process, and endeavor extract a common feature space improve generalization. However, complex biased data distribution, directly treating them equally will corrupt ability. To settle issue, we propose novel Dual Reweighting Domain Generalization (DRDG)...

10.24963/ijcai.2021/120 article EN 2021-08-01

Multi-site clustering and nested feature extraction for identifying autism spectrum disorder with resting-state fMRI

OPENALEX - Publications

Nan Wang Dongren Yao Lizhuang Ma Mingxia Liu

10.1016/j.media.2021.102279 article EN Medical Image Analysis 2021-10-22

HybridCR: Weakly-Supervised 3D Point Cloud Semantic Segmentation via Hybrid Contrastive Regularization

OPENALEX - Publications

Mengtian Li Yuan Xie Yunhang Shen Bo Ke Ruizhi Qiao and 3 more

To address the huge labeling cost in large-scale point cloud semantic segmentation, we propose a novel hybrid contrastive regularization (HybridCR) framework weakly-supervised setting, which obtains competitive performance compared to its fully-supervised counterpart. Specifically, HybridCR is first leverage both consistency and employ with pseudo an end-to-end manner. Fundamentally, explicitly effectively considers similarity between local neighboring points global characteristics of 3D...

10.1109/cvpr52688.2022.01451 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Uncertainty-aware consistency regularization for cross-domain semantic segmentation

OPENALEX - Publications

Qianyu Zhou Zhengyang Feng Qiqi Gu Guangliang Cheng Xuequan Lu and 2 more

10.1016/j.cviu.2022.103448 article EN Computer Vision and Image Understanding 2022-05-21

Coming Soon ...