NFDI4DS | UHH-SEMS - Publication Details

Pan Zhou

ORCID: 0000-0003-3400-8943

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5010883708

Research Areas

Domain Adaptation and Few-Shot Learning
Advanced Neural Network Applications
Sparse and Compressive Sensing Techniques
Stochastic Gradient Optimization Techniques
Multimodal Machine Learning Applications
Natural Language Processing Techniques
Topic Modeling
Advanced Image and Video Retrieval Techniques
Machine Learning and Data Classification
Advanced Graph Neural Networks
Visual Attention and Saliency Detection
Music and Audio Processing
Reconstructive Surgery and Microvascular Techniques
Adversarial Robustness in Machine Learning
Generative Adversarial Networks and Image Synthesis
Neural Networks and Applications
Speech Recognition and Synthesis
Human Pose and Action Recognition
Machine Learning and ELM
Tensor decomposition and applications
Speech and Audio Processing
Text and Document Classification Technologies
Anomaly Detection Techniques and Applications
Speech and dialogue systems
Wound Healing and Treatments

Huazhong University of Science and Technology
2018-2025

Tianjin Medical University
2025

Singapore Management University
2023-2025

The Fourth People's Hospital of Ningxia Hui Autonomous Region
2020-2024

Chongqing University
2024

University of Chinese Academy of Sciences
2021-2024

Shenzhen Institutes of Advanced Technology
2021-2024

Shanghai Institute of Optics and Fine Mechanics
2023

Shanghai Institute of Technical Physics
2023

Union Hospital
2023

MetaFormer is Actually What You Need for Vision

OPENALEX - Publications

Weihao Yu Mi Luo Pan Zhou Chenyang Si Yichen Zhou and 3 more

Transformers have shown great potential in computer vision tasks. A common belief is their attention-based token mixer module contributes most to competence. However, recent works show the transformers can be replaced by spatial MLPs and resulted models still perform quite well. Based on this observation, we hypothesize that general architecture of transformers, instead specific module, more essential model's performance. To verify this, deliberately replace attention with an embarrassingly...

10.1109/cvpr52688.2022.01055 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Prototypical Contrastive Learning of Unsupervised Representations

OPENALEX - Publications

Junnan Li Pan Zhou Caiming Xiong Richard Socher Steven C. H. Hoi

This paper presents Prototypical Contrastive Learning (PCL), an unsupervised representation learning method that addresses the fundamental limitations of instance-wise contrastive learning. PCL not only learns low-level features for task instance discrimination, but more importantly, it implicitly encodes semantic structures data into learned embedding space. Specifically, we introduce prototypes as latent variables to help find maximum-likelihood estimation network parameters in...

10.48550/arxiv.2005.04966 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Tensor Factorization for Low-Rank Tensor Completion

OPENALEX - Publications

Pan Zhou Canyi Lu Zhouchen Lin Chao Zhang

Recently, a tensor nuclear norm (TNN) based method was proposed to solve the completion problem, which has achieved state-of-the-art performance on image and video inpainting tasks. However, it requires computing singular value decomposition (t-SVD), costs much computation thus cannot efficiently handle data, due its natural large scale. Motivated by TNN, we propose novel low-rank factorization for solving 3-way problem. Our preserves structure of factorizing into product two tensors smaller...

10.1109/tip.2017.2762595 article EN IEEE Transactions on Image Processing 2017-10-12

Deep Adversarial Subspace Clustering

OPENALEX - Publications

Pan Zhou Yunqing Hou Jiashi Feng

Most existing subspace clustering methods hinge on self-expression of handcrafted representations and are unaware potential errors. Thus they perform unsatisfactorily real data with complex underlying subspaces. To solve this issue, we propose a novel deep adversarial (DASC) model, which learns more favorable sample by learning for clustering, importantly introduces to supervise representation clustering. Specifically, DASC consists generator quality-verifying discriminator, learn against...

10.1109/cvpr.2018.00172 article EN 2018-06-01

Early, low-dose and short-term application of corticosteroid treatment in patients with severe COVID-19 pneumonia: single-center experience from Wuhan, China

OPENALEX - Publications

Yin Wang Weiwei Jiang Qi He Cheng Wang Baoju Liu and 3 more

Abstract Background Severe patients with 2019 novel coronavirus (2019-nCoV) pneumonia progressed rapidly to acute respiratory failure. We aimed evaluate the definite efficacy and safety of corticosteroid in treatment severe COVID-19 pneumonia. Methods Forty-six hospitalized at Wuhan Union Hospital from January 20 February 25, 2020, were retrospectively reviewed. The divided into two groups based on whether they received treatment. clinical symptoms chest computed tomography(CT) results...

10.1101/2020.03.06.20032342 preprint EN cc-by-nc-nd medRxiv (Cold Spring Harbor Laboratory) 2020-03-12

Inception Transformer

OPENALEX - Publications

Chenyang Si Weihao Yu Pan Zhou Yichen Zhou Xinchao Wang and 1 more

Recent studies show that Transformer has strong capability of building long-range dependencies, yet is incompetent in capturing high frequencies predominantly convey local information. To tackle this issue, we present a novel and general-purpose Inception Transformer, or iFormer for short, effectively learns comprehensive features with both high- low-frequency information visual data. Specifically, design an mixer to explicitly graft the advantages convolution max-pooling high-frequency...

10.48550/arxiv.2205.12956 preprint EN other-oa arXiv (Cornell University) 2022-01-01

InceptionNeXt: When Inception Meets ConvNeXt

OPENALEX - Publications

Weihao Yu Pan Zhou Shuicheng Yan Xinchao Wang

10.1109/cvpr52733.2024.00542 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

MetaFormer Baselines for Vision

OPENALEX - Publications

Weihao Yu Chenyang Si Pan Zhou Mi Luo Yichen Zhou and 3 more

MetaFormer, the abstracted architecture of Transformer, has been found to play a significant role in achieving competitive performance. In this paper, we further explore capacity again, by migrating our focus away from token mixer design: introduce several baseline models under MetaFormer using most basic or common mixers, and demonstrate their gratifying We summarize observations as follows: (1) ensures solid lower bound By merely adopting identity mapping mixer, model, termed...

10.1109/tpami.2023.3329173 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2023-11-01

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

OPENALEX - Publications

Xingyu Xie Pan Zhou Huan Li Zhouchen Lin Shuicheng Yan

In deep learning, different kinds of networks typically need optimizers, which have to be chosen after multiple trials, making the training process inefficient. To relieve this issue and consistently improve model speed across networks, we propose ADAptive Nesterov momentum algorithm, Adan for short. first reformulates vanilla acceleration develop a new estimation (NME) method, avoids extra overhead computing gradient at extrapolation point. Then adopts NME estimate gradient's first-...

10.1109/tpami.2024.3423382 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2024-07-04

Towards Understanding Convergence and Generalization of AdamW

OPENALEX - Publications

Pan Zhou Xingyu Xie Zhouchen Lin Shuicheng Yan

AdamW modifies Adam by adding a decoupled weight decay to network weights per training iteration. For adaptive algorithms, this does not affect specific optimization steps, and differs from the widely used <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$\ell _{2}$</tex-math></inline-formula> -regularizer which changes steps via changing first- second-order gradient moments. Despite its great practical...

10.1109/tpami.2024.3382294 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2024-03-27

Tensor Low-Rank Representation for Data Recovery and Clustering

OPENALEX - Publications

Pan Zhou Canyi Lu Jiashi Feng Zhouchen Lin Shuicheng Yan

Multi-way or tensor data analysis has attracted increasing attention recently, with many important applications in practice. This article develops a low-rank representation (TLRR) method, which is the first approach that can exactly recover clean of intrinsic structure and accurately cluster them as well, provable performance guarantees. In particular, for arbitrary sparse corruptions, TLRR under mild conditions; meanwhile verify their true origin subspaces hence accurately. objective...

10.1109/tpami.2019.2954874 article EN publisher-specific-oa IEEE Transactions on Pattern Analysis and Machine Intelligence 2019-11-21

Changing patterns and determinants of transportation carbon emissions in Chinese cities

OPENALEX - Publications

Fangyi Li Bofeng Cai Zhaoyang Ye Zheng Wang Wei Zhang and 2 more

10.1016/j.energy.2019.02.179 article EN Energy 2019-02-27

Towards Theoretically Understanding Why SGD Generalizes Better Than ADAM in Deep Learning

OPENALEX - Publications

Pan Zhou Jiashi Feng Chao Ma Caiming Xiong Steven C. H. Hoi and 1 more

It is not clear yet why ADAM-alike adaptive gradient algorithms suffer from worse generalization performance than SGD despite their faster training speed. This work aims to provide understandings on this gap by analyzing local convergence behaviors. Specifically, we observe the heavy tails of noise in these algorithms. motivates us analyze through Levy-driven stochastic differential equations (SDEs) because similar behaviors an algorithm and its SDE. Then establish escaping time SDEs a...

10.48550/arxiv.2010.05627 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Prototypical Graph Contrastive Learning

OPENALEX - Publications

Shuai Lin Chen Liu Pan Zhou Zi-Yuan Hu Shuojia Wang and 5 more

Graph-level representations are critical in various real-world applications, such as predicting the properties of molecules. But practice, precise graph annotations generally very expensive and time-consuming. To address this issue, contrastive learning constructs instance discrimination task which pulls together positive pairs (augmentation same graph) pushes away negative different graphs) for unsupervised representation learning. However, since a query, its negatives uniformly sampled...

10.1109/tnnls.2022.3191086 article EN IEEE Transactions on Neural Networks and Learning Systems 2022-07-27

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

OPENALEX - Publications

Xingyu Xie Pan Zhou Huan Li Zhouchen Lin Shuicheng Yan

10.48550/arxiv.2208.06677 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Outlier-Robust Tensor PCA

OPENALEX - Publications

Pan Zhou Jiashi Feng

Low-rank tensor analysis is important for various real applications in computer vision. However, existing methods focus on recovering a low-rank contaminated by Gaussian or gross sparse noise and hence cannot effectively handle outliers that are common practical data. To solve this issue, we propose an outlier-robust principle component (OR-TPCA) method simultaneous recovery outlier detection. For intrinsically observations with arbitrary corruption, OR-TPCA the first has provable...

10.1109/cvpr.2017.419 article EN 2017-07-01

Integrated Low-Rank-Based Discriminative Feature Learning for Recognition

OPENALEX - Publications

Pan Zhou Zhouchen Lin Chao Zhang

Feature learning plays a central role in pattern recognition. In recent years, many representation-based feature methods have been proposed and achieved great success applications. However, these perform subsequent classification two separate steps, which may not be optimal for recognition tasks. this paper, we present supervised low-rank-based approach discriminative features. By integrating latent low-rank representation (LatLRR) with ridge regression-based classifier, our combines...

10.1109/tnnls.2015.2436951 article EN IEEE Transactions on Neural Networks and Learning Systems 2015-06-11

A Lightweight Attention-Based CNN Model for Efficient Gait Recognition with Wearable IMU Sensors

OPENALEX - Publications

Haohua Huang Pan Zhou Ye Li Fangmin Sun

Wearable sensors-based gait recognition is an effective method to recognize people’s identity by recognizing the unique way they walk. Recently, adoption of deep learning networks for has achieved significant performance improvement and become a new promising trend. However, most existing studies mainly focused on improving accuracy while ignored model complexity, which make them unsuitable wearable devices. In this study, we proposed lightweight attention-based Convolutional Neural Networks...

10.3390/s21082866 article EN cc-by Sensors 2021-04-19

Contrastive Video Question Answering via Video Graph Transformer

OPENALEX - Publications

Junbin Xiao Pan Zhou Angela Yao Yicong Li Richang Hong and 2 more

We propose to perform video question answering (VideoQA) in a Contrastive manner via Video Graph Transformer model (CoVGT). CoVGT's uniqueness and superiority are three-fold: 1) It proposes dynamic graph transformer module which encodes by explicitly capturing the visual objects, their relations dynamics, for complex spatio-temporal reasoning. 2) designs separate text transformers contrastive learning between QA, instead of multi-modal answer classification. Fine-grained video-text...

10.1109/tpami.2023.3292266 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2023-07-04

Iterative Graph Self-Distillation

OPENALEX - Publications

Hanlin Zhang Shuai Lin Weiyang Liu Pan Zhou Jian Tang and 2 more

Recently, there has been increasing interest in the challenge of how to discriminatively vectorize graphs. To address this, we propose a method called Iterative Graph Self-Distillation (IGSD) which learns graph-level representation an unsupervised manner through instance discrimination using self-supervised contrastive learning approach. IGSD involves teacher-student distillation process that uses graph diffusion augmentations and constructs teacher model exponential moving average student...

10.1109/tkde.2023.3303885 article EN IEEE Transactions on Knowledge and Data Engineering 2023-08-14

Coming Soon ...