NFDI4DS | UHH-SEMS - Publication Details

Yiheng Zhu

ORCID: 0000-0002-3857-1533

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5052610760

Research Areas

Machine Learning in Bioinformatics
RNA and protein synthesis mechanisms
Genomics and Phylogenetic Studies
Computational Drug Discovery Methods
Protein Structure and Dynamics
Bacterial Genetics and Biotechnology
Antibiotic Resistance in Bacteria
Human Pose and Action Recognition
Advanced Vision and Imaging
Generative Adversarial Networks and Image Synthesis
Bioinformatics and Genomic Networks
Diabetic Foot Ulcer Assessment and Management
Monoclonal and Polyclonal Antibodies Research
Image Enhancement Techniques
Human Motion and Animation
Computer Graphics and Visualization Techniques
Machine Learning in Materials Science
vaccines and immunoinformatics approaches
Multimodal Machine Learning Applications
Circular RNAs in diseases
Respiratory viral infections research
2D Materials and Applications
Gut microbiota and health
Genomics and Chromatin Dynamics
MXene and MAX Phase Materials

Nanjing Agricultural University
2023-2025

Suzhou Municipal Hospital
2024-2025

Shenzhen University
2024-2025

Nanjing University of Science and Technology
2019-2024

First Hospital of Jilin University
2023-2024

Jilin University
2023-2024

Zhejiang University
2024

Zhejiang University of Science and Technology
2023

University of Geneva
2023

Johns Hopkins University
2023

Anatomy-Aware 3D Human Pose Estimation With Bone-Based Pose Decomposition

OPENALEX - Publications

Tianlang Chen Chen Fang Xiaohui Shen Yiheng Zhu Zhili Chen and 1 more

In this work, we propose a new solution to 3D human pose estimation in videos. Instead of directly regressing the joint locations, draw inspiration from skeleton anatomy and decompose task into bone direction prediction length prediction, which locations can be completely derived. Our motivation is fact that lengths remain consistent across time. This promotes us develop effective techniques utilize global information <i>all</i> frames video for high-accuracy prediction. Moreover, network,...

10.1109/tcsvt.2021.3057267 article EN publisher-specific-oa IEEE Transactions on Circuits and Systems for Video Technology 2021-02-05

Integrating unsupervised language model with triplet neural networks for protein gene ontology prediction

OPENALEX - Publications

Yiheng Zhu Chengxin Zhang Dong‐Jun Yu Yang Zhang

Accurate identification of protein function is critical to elucidate life mechanisms and design new drugs. We proposed a novel deep-learning method, ATGO, predict Gene Ontology (GO) attributes proteins through triplet neural-network architecture embedded with pre-trained language models from sequences. The method was systematically tested on 1068 non-redundant benchmarking 3328 targets the third Critical Assessment Protein Function Annotation (CAFA) challenge. Experimental results showed...

10.1371/journal.pcbi.1010793 article EN cc-by PLoS Computational Biology 2022-12-22

ULDNA: integrating unsupervised multi-source language models with LSTM-attention network for high-accuracy protein–DNA binding site prediction

OPENALEX - Publications

Yiheng Zhu Zi Liu Yan Liu Zhiwei Ji Dong‐Jun Yu

Abstract Efficient and accurate recognition of protein–DNA interactions is vital for understanding the molecular mechanisms related biological processes further guiding drug discovery. Although current experimental protocols are most precise way to determine binding sites, they tend be labor-intensive time-consuming. There an immediate need design efficient computational approaches predicting DNA-binding sites. Here, we proposed ULDNA, a new deep-learning model, deduce sites from protein...

10.1093/bib/bbae040 article EN cc-by Briefings in Bioinformatics 2024-01-22

DNAPred: Accurate Identification of DNA-Binding Sites from Protein Sequence by Ensembled Hyperplane-Distance-Based Support Vector Machines

OPENALEX - Publications

Yiheng Zhu Jun Hu Xiaoning Song Dong‐Jun Yu

Accurate identification of protein–DNA binding sites is significant for both understanding protein function and drug design. Machine-learning-based methods have been extensively used the prediction sites. However, data imbalance problem, in which number nonbinding residues (negative-class samples) far larger than that (positive-class samples), seriously restricts performance improvements machine-learning-based predictors. In this work, we designed a two-stage imbalanced learning algorithm,...

10.1021/acs.jcim.8b00749 article EN Journal of Chemical Information and Modeling 2019-04-03

Hexagonal Boron Nitride/Blue Phosphorene Heterostructure as a Promising Anode Material for Li/Na-Ion Batteries

OPENALEX - Publications

Jinna Bao Linsheng Zhu Haochi Wang Shufeng Han Yuhang Jin and 6 more

Blue phosphorene (blue-P), an allotrope of black phosphorene, is prone to oxidize under ambient conditions, which significantly hinders its incorporation in anode for Li/Na ion batteries (LIBs/NIBs). Combining blue-P and hexagonal boron nitride (h-BN) together construct h-BN/blue-P heterostructure (BN/P) can break the limitation restricted properties blue-P. By means first-principles computations, we explored potential using BN/P as material LIBs/NIBs. Our computations show that adsorption...

10.1021/acs.jpcc.8b07062 article EN The Journal of Physical Chemistry C 2018-09-25

TargetDBP: Accurate DNA-Binding Protein Prediction Via Sequence-Based Multi-View Feature Learning

OPENALEX - Publications

Jun Hu Xiaogen Zhou Yiheng Zhu Dong‐Jun Yu Guijun Zhang

Accurately identifying DNA-binding proteins (DBPs) from protein sequence information is an important but challenging task for function annotations. In this paper, we establish a novel computational method, named TargetDBP, accurately targeting DBPs primary sequences. four single-view features, i.e., AAC (Amino Acid Composition), PsePSSM (Pseudo Position-Specific Scoring Matrix), PsePRSA Predicted Relative Solvent Accessibility), and PsePPDBS Probabilities of DNA-Binding Sites), are first...

10.1109/tcbb.2019.2893634 article EN IEEE/ACM Transactions on Computational Biology and Bioinformatics 2019-01-18

A Unified 3D Human Motion Synthesis Model via Conditional Variational Auto-Encoder

OPENALEX - Publications

Yujun Cai Yiwei Wang Yiheng Zhu Tat‐Jen Cham Jianfei Cai and 8 more

We present a unified and flexible framework to address the generalized problem of 3D motion synthesis that covers tasks prediction, completion, interpolation, spatial-temporal recovery. Since these have different input constraints various fidelity diversity requirements, most existing approaches only cater specific task or use architectures tasks. Here we propose based on Conditional Variational Auto-Encoder (CVAE), where treat any arbitrary as masked series. Notably, by considering this...

10.1109/iccv48922.2021.01144 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Adversarial Open Domain Adaptation for Sketch-to-Photo Synthesis

OPENALEX - Publications

Xiaoyu Xiang Ding Liu Xiao Yang Yiheng Zhu Xiaohui Shen and 1 more

In this paper, we explore open-domain sketch-to-photo translation, which aims to synthesize a realistic photo from freehand sketch with its class label, even if the sketches of that are missing in training data. It is challenging due lack supervision and large geometric distortion between domains. To absent photos, propose framework jointly learns photo-to-sketch generation. However, generator trained fake might lead unsatisfying results when dealing classes, domain gap synthesized real...

10.1109/wacv51458.2022.00102 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2022-01-01

Identifying Protein-Nucleotide Binding Residues via Grouped Multi-task Learning and Pre-trained Protein Language Models

OPENALEX - Publications

Jia‐shun Wu Yan Liu Ying Zhang Xiaoyu Wang Yan He and 3 more

The accurate identification of protein-nucleotide binding residues is crucial for protein function annotation and drug discovery. Numerous computational methods have been proposed to predict these residues, achieving remarkable performance. However, due the limited availability high variability nucleotides, predicting diverse nucleotides remains a significant challenge. To address these, we propose NucGMTL, new grouped deep multi-task learning approach designed all observed in BioLiP...

10.1021/acs.jcim.4c02092 article EN Journal of Chemical Information and Modeling 2025-01-09

Genome-wide characterization of circular RNAs in three rat models of pulmonary hypertension reveals distinct pathological patterns

OPENALEX - Publications

Gaohui Fu Lin Qiu Jun Wang Shujin Li Jinglin Tian and 17 more

Pulmonary hypertension (PH) is a devastating disease marked by elevated pulmonary artery pressure, resulting in right ventricular (RV) failure and mortality. Despite the identification of several dysregulated genes PH, involvement circular RNAs (circRNAs), subset long noncoding RNAs, remains largely unknown. In this study, high-throughput RNA sequencing was performed to analyze genome-wide expression patterns circRNAs arteries from three models PH rats induced hypoxia (Hyp),...

10.1186/s12864-025-11239-z article EN cc-by-nc-nd BMC Genomics 2025-02-10

Vaccination Status and Influencing Factors of Delayed Vaccination in Toddlers Born to Hepatitis B Surface Antigen-Positive Mothers

OPENALEX - Publications

Jintao Gao Lin Luan Yiheng Zhu Jie Zhu Zhiyuan Zhu and 3 more

Background: This study aims to analyze the vaccination status and factors influencing delayed among toddlers born hepatitis B surface antigen (HBsAg)-positive mothers. Methods: Data of HBsAg-positive mothers between 1 January 2021 31 December 2022 were provided by Suzhou Maternal Child Health Care Family Planning Service Center. The records obtained from Jiangsu Province Immunization Management Information System. Logistic regression analysis was used vaccination. Results: A total 4250...

10.3390/vaccines13030286 article EN cc-by Vaccines 2025-03-07

MKFGO: Integrating Multi-Source Knowledge Fusion with Pre-Trained Language Model for High-Accuracy Protein Function Prediction

OPENALEX - Publications

Yiheng Zhu Shenglong Zhu Xuan Yu Yan He Yan Liu and 3 more

Accurately identifying protein functions is essential to understand life mechanisms and thus advance drug discovery. Although biochemical experiments are the gold standard for determining functions, they often time-consuming labor-intensive. Here, we proposed a novel composite deep-learning method, MKFGO, infer Gene Ontology (GO) attributes through integrating five complementary pipelines built on multi-source biological data. MKFGO was rigorously benchmarked 1522 non-redundant proteins,...

10.1101/2025.03.27.645685 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2025-04-01

A Multi‐Objective Molecular Generation Method Based on Pareto Algorithm and Monte Carlo Tree Search

OPENALEX - Publications

Yifei Liu Yiheng Zhu Jike Wang Renling Hu Chao Shen and 8 more

Drug discovery faces increasing challenges in identifying novel drug candidates satisfying multiple stringent objectives, such as binding affinity, protein target selectivity, and drug-likeness. Existing optimization methods struggle with the complexity of handling numerous limiting advancements molecular design most algorithms are only effective for up to four objectives. To overcome these limitations, study introduces Pareto Monte Carlo Tree Search Molecular Generation (PMMG) method,...

10.1002/advs.202410640 article EN cc-by Advanced Science 2025-04-01

Synergy of GFlowNet and Protein Language Model Makes a Diverse Antibody Designer

OPENALEX - Publications

Mingze Yin Hanjing Zhou Yiheng Zhu Jialu Wu Wei Wu and 6 more

Antibodies defend our health by binding to antigens with high specificity and potentiality, primarily relying on the Complementarity-Determining Region (CDR). Yet, current experimental methods of discovering new antibody CDRs are heavily time-consuming. Computational design could alleviate this burden; especially, protein language models have proven quite beneficial in many recent studies. However, most existing solely focus potentiality struggle encapsulate diverse range plausible CDR...

10.1609/aaai.v39i21.34370 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

MutTMPredictor: Robust and accurate cascade XGBoost classifier for prediction of mutations in transmembrane proteins

OPENALEX - Publications

Fang Ge Yiheng Zhu Jian Xu Muhammad Arif Jiangning Song and 1 more

Transmembrane proteins have critical biological functions and play a role in multitude of cellular processes including cell signaling, transport molecules ions across membranes. Approximately 60% transmembrane are considered as drug targets. Missense mutations such can lead to many diverse diseases disorders, neurodegenerative cystic fibrosis. However, there limited studies on proteins. In this work, we first design new feature encoding method, termed weight attenuation position-specific...

10.1016/j.csbj.2021.11.024 article EN cc-by-nc-nd Computational and Structural Biotechnology Journal 2021-01-01

PAniC-3D: Stylized Single-view 3D Reconstruction from Portraits of Anime Characters

OPENALEX - Publications

Shuhong Chen Kevin Zhang Yichun Shi Heng Wang Yiheng Zhu and 5 more

We propose PAniC-3D, a system to reconstruct stylized 3D character heads directly from illustrated (p)ortraits of (ani)me (c)haracters. Our anime-style domain poses unique challenges single-view reconstruction; compared natural images human heads, portrait illustrations have hair and accessories with more complex diverse geometry, are shaded non-photorealistic contour lines. In addition, there is lack both model illustration data suitable train evaluate this ambiguous reconstruction task....

10.1109/cvpr52729.2023.02018 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

TargetDBP+: Enhancing the Performance of Identifying DNA-Binding Proteins via Weighted Convolutional Features

OPENALEX - Publications

Jun Hu Liang Rao Yiheng Zhu Guijun Zhang Dong‐Jun Yu

Protein–DNA interactions exist ubiquitously and play important roles in the life cycles of living cells. The accurate identification DNA-binding proteins (DBPs) is one key steps to understand mechanisms protein–DNA interactions. Although many DBP methods have been proposed, current performance still unsatisfactory. In this study, a new method, called TargetDBP+, developed further enhance identifying DBPs. five convolutional features are first extracted from feature sources, i.e., amino acid...

10.1021/acs.jcim.0c00735 article EN Journal of Chemical Information and Modeling 2021-01-07

BLAM6A-Merge: Leveraging Attention Mechanisms and Feature Fusion Strategies to Improve the Identification of RNA N6-methyladenosine Sites

OPENALEX - Publications

Yunpeng Xia Ying Zhang Dian Liu Yiheng Zhu Zhikang Wang and 2 more

RNA N6-methyladenosine is a prevalent and abundant type of modification that exerts significant influence on diverse biological processes. To date, numerous computational approaches have been developed for predicting methylation, with most them ignoring the correlations different encoding strategies failing to explore adaptability various attention mechanisms methylation identification. solve above issues, we proposed an innovative framework m6A site, termed BLAM6A-Merge. Specifically, it...

10.1109/tcbb.2024.3418490 article EN IEEE/ACM Transactions on Computational Biology and Bioinformatics 2024-01-01

Comparative study on the epidemiological characteristics and hazards of respiratory syncytial virus and influenza virus infections among elderly people

OPENALEX - Publications

Jiangtao Yu Na Liu Yiheng Zhu Wenyu Wang Xianquan Fan and 4 more

Abstract Objective To investigate the epidemiological characteristics and infections of respiratory syncytial virus (RSV) influenza viruses in hospitalized elderly patients with tract Suzhou City, China, to compare differences clinical economic burden associated these two infections. Methods In this prospective study, pathogenetic testing data for aged 60 years older were collected five hospitals through stratified cluster sampling from December 2023 May 2024. Comparative study on epidemic...

10.1186/s12879-024-10048-1 article EN cc-by BMC Infectious Diseases 2024-10-09

Accurate multistage prediction of protein crystallization propensity using deep-cascade forest with sequence-based features

OPENALEX - Publications

Yiheng Zhu Jun Hu Fang Ge Fuyi Li Jiangning Song and 2 more

Abstract X-ray crystallography is the major approach for determining atomic-level protein structures. Because not all proteins can be easily crystallized, accurate prediction of crystallization propensity provides critical help in guiding experimental design and improving success rate experiments. This study has developed a new machine-learning-based pipeline that uses newly deep-cascade forest (DCF) model with multiple types sequence-based features to predict propensity. Based on pipeline,...

10.1093/bib/bbaa076 article EN Briefings in Bioinformatics 2020-04-13

TargetMM: Accurate Missense Mutation Prediction by Utilizing Local and Global Sequence Information with Classifier Ensemble

OPENALEX - Publications

Fang Ge Jun Hu Yiheng Zhu Muhammad Arif Dong‐Jun Yu

Missense mutation (MM) may lead to various human diseases by disabling proteins. Accurate prediction of MM is important and challenging for both protein function annotation drug design. Although several computational methods yielded acceptable success rates, there still room further enhancing the performance MM.In present study, we designed a new feature extracting method, which considers impact degree residues in microenvironment range site. Stringent cross-validation independent test on...

10.2174/1386207323666201204140438 article EN Combinatorial Chemistry & High Throughput Screening 2020-12-05

Improving protein fold recognition using triplet network and ensemble deep learning

OPENALEX - Publications

Yan Liu Ke Han Yiheng Zhu Ying Zhang Long-Chen Shen and 2 more

Protein fold recognition is a critical step toward protein structure and function prediction, aiming at providing the most likely type of query protein. In recent years, development deep learning (DL) technique has led to massive advances in this important field, accordingly, sensitivity been dramatically improved. Most DL-based methods take an intermediate bottleneck layer as feature representation proteins with new types. However, strategy indirect, inefficient conditional on hypothesis...

10.1093/bib/bbab248 article EN Briefings in Bioinformatics 2021-06-09

MAResNet: predicting transcription factor binding sites by combining multi-scale bottom-up and top-down attention and residual network

OPENALEX - Publications

Ke Han Long-Chen Shen Yiheng Zhu Jian Xu Jiangning Song and 1 more

Accurate identification of transcription factor binding sites is great significance in understanding gene expression, biological development and drug design. Although a variety methods based on deep-learning models large-scale data have been developed to predict DNA sequences, there room for further improvement prediction performance. In addition, effective interpretation greatly desirable. Here we present MAResNet, new method, predicting 690 ChIP-seq datasets. More specifically, MAResNet...

10.1093/bib/bbab445 article EN Briefings in Bioinformatics 2021-09-30

Coming Soon ...