NFDI4DS | UHH-SEMS - Publication Details

SimVP: Simpler yet Better Video Prediction

OPENALEX - Publications

Zhangyang Gao Cheng Tan Lirong Wu Stan Z. Li

From CNN, RNN, to ViT, we have witnessed remarkable advancements in video prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated training strategies. We admire these progresses but are confused about the necessity: is there a simple method that can perform comparably well? This paper proposes SimVp, prediction model completely built upon CNN trained by MSE loss an end-to-end fashion. Without introducing any additional tricks complicated strategies,...

10.1109/cvpr52688.2022.00317 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Self-Supervised Learning on Graphs: Contrastive, Generative, or Predictive

OPENALEX - Publications

Lirong Wu Haitao Lin Cheng Tan Zhangyang Gao Stan Z. Li

Deep learning on graphs has recently achieved remarkable success a variety of tasks, while such relies heavily the massive and carefully labeled data. However, precise annotations are generally very expensive time-consuming. To address this problem, self-supervised (SSL) is emerging as new paradigm for extracting informative knowledge through well-designed pretext tasks without relying manual labels. In survey, we extend concept SSL, which first emerged in fields computer vision natural...

10.1109/tkde.2021.3131584 article EN IEEE Transactions on Knowledge and Data Engineering 2021-12-01

A Survey on Generative Diffusion Models

OPENALEX - Publications

Hanqun Cao Cheng Tan Zhangyang Gao Yilun Xu Guangyong Chen and 2 more

Deep generative models have unlocked another profound realm of human creativity. By capturing and generalizing patterns within data, we entered the epoch all-encompassing Artificial Intelligence for General Creativity (AIGC). Notably, diffusion models, recognized as one paramount materialize ideation into tangible instances across diverse domains, encompassing imagery, text, speech, biology, healthcare. To provide advanced comprehensive insights diffusion, this survey comprehensively...

10.1109/tkde.2024.3361474 article EN IEEE Transactions on Knowledge and Data Engineering 2024-02-02

Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning

OPENALEX - Publications

Cheng Tan Zhangyang Gao Lirong Wu Yongjie Xu Jun Xia and 2 more

Spatiotemporal predictive learning aims to generate future frames by from historical frames. In this paper, we investigate existing methods and present a general framework of spatiotemporal learning, in which the spatial encoder decoder capture intra-frame features middle temporal module catches inter-frame correlations. While mainstream employ recurrent units long-term dependencies, they suffer low computational efficiency due their unparallelizable architectures. To parallelize module,...

10.1109/cvpr52729.2023.01800 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Mole-BERT: Rethinking Pre-training Graph Neural Networks for Molecules

OPENALEX - Publications

Jun Xia Chengshuai Zhao Bozhen Hu Zhangyang Gao Cheng Tan and 3 more

Recent years have witnessed the prosperity of pre-training graph neural networks (GNNs) for molecules. Typically, atom types as node attributes are randomly masked and GNNs then trained to predict in AttrMask \citep{hu2020strategies}, following Masked Language Modeling (MLM) task BERT~\citep{devlin2019bert}. However, unlike MLM where vocabulary is large, does not learn informative molecular representations due small unbalanced `vocabulary'. To amend this problem, we propose a variant...

10.26434/chemrxiv-2023-dngg4 preprint EN cc-by-nc-nd 2023-04-13

A Survey on Generative Diffusion Model

OPENALEX - Publications

Hanqun Cao Cheng Tan Zhangyang Gao Guangyong Chen Pheng‐Ann Heng and 1 more

Deep generative models have unlocked another profound realm of human creativity. By capturing and generalizing patterns within data, we entered the epoch all-encompassing Artificial Intelligence for General Creativity (AIGC). Notably, diffusion models, recognized as one paramount materialize ideation into tangible instances across diverse domains, encompassing imagery, text, speech, biology, healthcare. To provide advanced comprehensive insights diffusion, this survey comprehensively...

10.48550/arxiv.2209.02646 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Conditional Local Convolution for Spatio-Temporal Meteorological Forecasting

OPENALEX - Publications

Haitao Lin Zhangyang Gao Yongjie Xu Lirong Wu Ling Li and 1 more

Spatio-temporal forecasting is challenging attributing to the high nonlinearity in temporal dynamics as well complex location-characterized patterns spatial domains, especially fields like weather forecasting. Graph convolutions are usually used for modeling dependency meteorology handle irregular distribution of sensors' location. In this work, a novel graph-based convolution imitating meteorological flows proposed capture local patterns. Based on assumption smoothness patterns, we propose...

10.1609/aaai.v36i7.20711 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2022-06-28

MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning

OPENALEX - Publications

Zhe Li Laurence T. Yang Bocheng Ren Xin Nie Zhangyang Gao and 2 more

10.1109/cvpr52733.2024.01112 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

SimVPv2: Towards Simple yet Powerful Spatiotemporal Predictive Learning

OPENALEX - Publications

Cheng Tan Zhangyang Gao Siyuan Li Stan Z. Li

10.1109/tmm.2025.3543051 article EN IEEE Transactions on Multimedia 2025-01-01

Relation-Aware Equivariant Graph Networks for Epitope-Unknown Antibody Design and Specificity Optimization

OPENALEX - Publications

Lirong Wu Haitao Lin Yufei Huang Zhangyang Gao Cheng Tan and 3 more

Antibodies are Y-shaped proteins that protect the host by binding to specific antigens, and their is mainly determined Complementary Determining Regions (CDRs) in antibody. Despite great progress made CDR design, existing computational methods still encounter several challenges: 1) poor capability of modeling complex CDRs with long sequences due insufficient contextual information; 2) conditioned on pre-given antigenic epitopes static interaction target antibody; 3) neglect specificity...

10.1609/aaai.v39i1.32074 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

SimVP: Towards Simple yet Powerful Spatiotemporal Predictive Learning

OPENALEX - Publications

Cheng Tan Zhangyang Gao Siyuan Li Stan Z. Li

Recent years have witnessed remarkable advances in spatiotemporal predictive learning, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated training strategies. Although impressive, the system complexity of mainstream methods is increasing as well, which may hinder convenient applications. This paper proposes SimVP, a simple baseline model that completely built upon convolutional networks without recurrent architectures trained by common mean squared error loss...

10.48550/arxiv.2211.12509 preprint EN other-oa arXiv (Cornell University) 2022-01-01

OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning

OPENALEX - Publications

Cheng Tan Siyuan Li Zhangyang Gao Wenfei Guan Zedong Wang and 3 more

Spatio-temporal predictive learning is a paradigm that enables models to learn spatial and temporal patterns by predicting future frames from given past in an unsupervised manner. Despite remarkable progress recent years, lack of systematic understanding persists due the diverse settings, complex implementation, difficult reproducibility. Without standardization, comparisons can be unfair insights inconclusive. To address this dilemma, we propose OpenSTL, comprehensive benchmark for...

10.48550/arxiv.2306.11249 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Beyond Homophily and Homogeneity Assumption: Relation-Based Frequency Adaptive Graph Neural Networks

OPENALEX - Publications

Lirong Wu Haitao Lin Bozhen Hu Cheng Tan Zhangyang Gao and 2 more

Graph neural networks (GNNs) have been playing important roles in various graph-related tasks. However, most existing GNNs are based on the assumption of homophily, so they cannot be directly generalized to heterophily settings where connected nodes may different features and class labels. Moreover, real-world graphs often arise from highly entangled latent factors, but tend ignore this simply denote heterogeneous relations between as binary-valued homogeneous edges. In article, we propose a...

10.1109/tnnls.2022.3230417 article EN IEEE Transactions on Neural Networks and Learning Systems 2023-01-06

A Teacher-Free Graph Knowledge Distillation Framework With Dual Self-Distillation

OPENALEX - Publications

Lirong Wu Haitao Lin Zhangyang Gao Guojiang Zhao Stan Z. Li

Recent years have witnessed great success in handling graph-related tasks with Graph Neural Networks (GNNs). Despite their <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">academic</i> success, Multi-Layer Perceptrons (MLPs) remain the primary workhorse for practical xmlns:xlink="http://www.w3.org/1999/xlink">industrial</i> applications. One reason such an academic-industry gap is neighborhood-fetching latency incurred by data dependency GNNs....

10.1109/tkde.2024.3374773 article EN IEEE Transactions on Knowledge and Data Engineering 2024-03-20

PiFold: Toward effective and efficient protein inverse folding

OPENALEX - Publications

Zhangyang Gao Cheng Tan Stan Z. Li

How can we design protein sequences folding into the desired structures effectively and efficiently? AI methods for structure-based have attracted increasing attention in recent years; however, few simultaneously improve accuracy efficiency due to lack of expressive features autoregressive sequence decoder. To address these issues, propose PiFold, which contains a novel residue featurizer PiGNN layers generate one-shot way with improved recovery. Experiments show that PiFold could achieve...

10.48550/arxiv.2209.12643 preprint EN other-oa arXiv (Cornell University) 2022-01-01

PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for Efficient and Generalizable Compound-Protein Interaction Prediction

OPENALEX - Publications

Lirong Wu Yufei Huang Cheng Tan Zhangyang Gao Bozhen Hu and 3 more

Compound-Protein Interaction (CPI) prediction aims to predict the pattern and strength of compound-protein interactions for rational drug discovery. Existing deep learning-based methods utilize only single modality protein sequences or structures lack co-modeling joint distribution two modalities, which may lead significant performance drops in complex real-world scenarios due various factors, e.g., missing domain shifting. More importantly, these model at a fixed scale, neglecting more...

10.1609/aaai.v38i1.27784 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

General Point Model Pretraining with Autoencoding and Autoregressive

OPENALEX - Publications

Zhe Li Zhangyang Gao Cheng Tan Bocheng Ren Laurence T. Yang and 1 more

10.1109/cvpr52733.2024.01980 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Hyperspherical Consistency Regularization

OPENALEX - Publications

Cheng Tan Zhangyang Gao Lirong Wu Siyuan Li Stan Z. Li

Recent advances in contrastive learning have enlightened diverse applications across various semi-supervised fields. Jointly training supervised and unsupervised with a shared feature encoder becomes common scheme. Though it benefits from taking advantage of both feature-dependent information self-supervised label-dependent learning, this scheme remains suffering bias the classifier. In work, we systematically explore relationship between study how helps robust data-efficient deep learning....

10.1109/cvpr52688.2022.00710 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

One-shot Cryo-EM Complex Structure Determination with High Accuracy and Ultra-fast Speed.

OPENALEX - Publications

Jue Wang Cheng Tan Zhangyang Gao Guijun Zhang Yang Zhang and 1 more

<title>Abstract</title> While cryo-electron microscopy (Cryo-EM) yields high-resolution density maps for complex structures, accurate determination of the corresponding three-dimensional atomic structures still necessitates significant expertise and labor-intensive manual interpretation. Recently, AI-based methods have emerged to streamline this process in biological community; however, several challenges persist. First, existing typically require multi-stage training inference, causing...

10.21203/rs.3.rs-5776842/v1 preprint EN Research Square (Research Square) 2025-02-05

G2PDiffusion: Genotype-to-Phenotype Prediction with Diffusion Models

OPENALEX - Publications

Mengdi Liu Zhangyang Gao Hong Chang Stan Z. Li Shiguang Shan and 1 more

Discovering the genotype-phenotype relationship is crucial for genetic engineering, which will facilitate advances in fields such as crop breeding, conservation biology, and personalized medicine. Current research usually focuses on single species small datasets due to limitations phenotypic data collection, especially traits that require visual assessments or physical measurements. Deciphering complex composite phenotypes, morphology, from at scale remains an open question. To break through...

10.48550/arxiv.2502.04684 preprint EN arXiv (Cornell University) 2025-02-07

FoldToken: Learning Protein Language via Vector Quantization and Beyond

OPENALEX - Publications

Zhangyang Gao Cheng Tan Jue Wang Yufei Huang Lirong Wu and 1 more

Is there a foreign language describing protein sequences and structures simultaneously? Protein structures, represented by continuous 3D points, have long posed challenge due to the contrasting modeling paradigms of discrete sequences. We introduce FoldTokenizer represent sequence-structure as symbols. This approach involves projecting residue types into space, guided reconstruction loss for information preservation. name learned symbols FoldToken, sequence FoldTokens serves new language,...

10.1609/aaai.v39i1.31998 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

dyAb: Flow Matching for Flexible Antibody Design with AlphaFold-driven Pre-binding Antigen

OPENALEX - Publications

Cheng Tan Yijie Zhang Zhangyang Gao Yufei Huang Haitao Lin and 4 more

The development of therapeutic antibodies heavily relies on accurate predictions how antigens will interact with antibodies. Existing computational methods in antibody design often overlook crucial conformational changes that undergo during the binding process, significantly impacting reliability resulting To bridge this gap, we introduce dyAb, a flexible framework incorporates AlphaFold2-driven to model pre-binding antigen structures and specifically addresses dynamic nature conformation...

10.1609/aaai.v39i1.32061 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

USTEP: Spatio-Temporal Predictive Learning under a Unified View

OPENALEX - Publications

Cheng Tan Jue Wang Zhangyang Gao Siyuan Li Stan Z. Li

Spatio-temporal predictive learning plays a crucial role in self-supervised learning, with wide-ranging applications across diverse range of fields. Previous approaches for temporal modeling fall into two categories: recurrent-based and recurrent-free methods. The former, while meticulously processing frames one by one, neglect short-term spatio-temporal information redundancies, leading to inefficiencies. latter naively stack sequentially, overlooking the inherent dependencies. In this...

10.1109/tpami.2025.3566420 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2025-01-01