NFDI4DS | UHH-SEMS - Publication Details

Siguo Wang

ORCID: 0000-0002-3244-3629

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5054096753

Research Areas

Machine Learning in Bioinformatics
RNA and protein synthesis mechanisms
Genomics and Chromatin Dynamics
Bioinformatics and Genomic Networks
Computational Drug Discovery Methods
Genomics and Phylogenetic Studies
vaccines and immunoinformatics approaches
RNA Research and Splicing
Single-cell and spatial transcriptomics
Gene expression and cancer classification
Circular RNAs in diseases
Cancer-related molecular mechanisms research
Genetic Mapping and Diversity in Plants and Animals
Metabolomics and Mass Spectrometry Studies
Cancer Genomics and Diagnostics
Graph Theory and Algorithms
Antimicrobial Peptides and Activities
Protein Structure and Dynamics
Machine Learning in Healthcare
Advanced Proteomics Techniques and Applications
Advanced Graph Neural Networks
Extracellular vesicles in disease
Gut microbiota and health
Multimodal Machine Learning Applications
Colorectal Cancer Screening and Detection

Eastern Institute of Technology, Ningbo
2024

Tongji University
2020-2023

Shaanxi Normal University
2017-2019

A survey on deep learning in DNA/RNA motif mining

OPENALEX - Publications

Ying He Zhen Shen Qinhu Zhang Siguo Wang De-Shuang Huang

Abstract DNA/RNA motif mining is the foundation of gene function research. The plays an extremely important role in identifying DNA- or RNA-protein binding site, which helps to understand mechanism regulation and management. For past few decades, researchers have been working on designing new efficient accurate algorithms for motif. These can be roughly divided into two categories: enumeration approach probabilistic method. In recent years, machine learning methods had made great progress,...

10.1093/bib/bbaa229 article EN cc-by-nc Briefings in Bioinformatics 2020-08-26

Locating transcription factor binding sites by fully convolutional neural network

OPENALEX - Publications

Qinhu Zhang Siguo Wang Zhan‐Heng Chen Ying He Qi Liu and 1 more

Abstract Transcription factors (TFs) play an important role in regulating gene expression, thus identification of the regions bound by them has become a fundamental step for molecular and cellular biology. In recent years, increasing number deep learning (DL) based methods have been proposed predicting TF binding sites (TFBSs) achieved impressive prediction performance. However, these mainly focus on sequence specificity TF-DNA binding, which is equivalent to sequence-level binary...

10.1093/bib/bbaa435 article EN cc-by-nc Briefings in Bioinformatics 2020-12-30

Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture

OPENALEX - Publications

Siguo Wang Qinhu Zhang Zhen Shen Ying He Zhan‐Heng Chen and 2 more

The study of transcriptional regulation is still difficult yet fundamental in molecular biology research. Recent research has shown that the double helix structure nucleotides plays an important role improving accuracy and interpretability transcription factor binding sites (TFBSs). Although several computational methods have been designed to take both DNA sequence shape features into consideration simultaneously, how design efficient model intractable topic. In this paper, we proposed a...

10.1016/j.omtn.2021.02.014 article EN cc-by-nc-nd Molecular Therapy — Nucleic Acids 2021-02-21

Base-resolution prediction of transcription factor binding signals by a deep learning framework

OPENALEX - Publications

Qinhu Zhang Ying He Siguo Wang Zhan‐Heng Chen Zhen-Hao Guo and 3 more

Transcription factors (TFs) play an important role in regulating gene expression, thus the identification of sites bound by them has become a fundamental step for molecular and cellular biology. In this paper, we developed deep learning framework leveraging existing fully convolutional neural networks (FCN) to predict TF-DNA binding signals at base-resolution level (named as FCNsignal). The proposed FCNsignal can simultaneously achieve following tasks: (i) modeling regions; (ii)...

10.1371/journal.pcbi.1009941 article EN cc-by PLoS Computational Biology 2022-03-09

A Brief Survey of Deep Learning-based Models for CircRNA-Protein Binding Sites Prediction

OPENALEX - Publications

Zhen Shen Lin Yuan Wenzheng Bao Siguo Wang Qinhu Zhang and 1 more

10.1016/j.neucom.2025.129637 article EN cc-by Neurocomputing 2025-02-01

Cross‐Species Prediction of Transcription Factor Binding by Adversarial Training of a Novel Nucleotide‐Level Deep Neural Network

OPENALEX - Publications

Qinhu Zhang Siguo Wang Zhipeng Li Yijie Pan De‐Shuang Huang

Cross-species prediction of TF binding remains a major challenge due to the rapid evolutionary turnover individual sites, resulting in cross-species predictive performance being consistently worse than within-species performance. In this study, novel Nucleotide-Level Deep Neural Network (NLDNN) is first proposed predict within or across species. NLDNN regards task as nucleotide-level regression task, which takes DNA sequences input and directly predicts experimental coverage values. Beyond...

10.1002/advs.202405685 article EN cc-by Advanced Science 2024-07-30

Computational prediction and characterization of cell-type-specific and shared binding sites

OPENALEX - Publications

Qinhu Zhang Pengrui Teng Siguo Wang Ying He Zhen Cui and 5 more

Abstract Motivation Cell-type-specific gene expression is maintained in large part by transcription factors (TFs) selectively binding to distinct sets of sites different cell types. Recent research works have provided evidence that such cell-type-specific determined TF’s intrinsic sequence preferences, cooperative interactions with co-factors, chromatin landscapes and 3D interactions. However, computational prediction characterization shared rarely studied. Results In this article, we...

10.1093/bioinformatics/btac798 article EN cc-by Bioinformatics 2022-12-09

scCorrector: a robust method for integrating multi-study single-cell data

OPENALEX - Publications

Zhen-Hao Guo Yanbin Wang Siguo Wang Qinhu Zhang De-Shuang Huang

Abstract The advent of single-cell sequencing technologies has revolutionized cell biology studies. However, integrative analyses diverse data face serious challenges, including technological noise, sample heterogeneity, and different modalities species. To address these problems, we propose scCorrector, a variational autoencoder-based model that can integrate from studies map them into common space. Specifically, designed Study Specific Adaptive Normalization for each study in decoder to...

10.1093/bib/bbad525 article EN cc-by-nc Briefings in Bioinformatics 2024-01-22

Identification of Essential Proteins Based on Improved HITS Algorithm

OPENALEX - Publications

Xiujuan Lei Siguo Wang Fang‐Xiang Wu

Essential proteins are critical to the development and survival of cells. Identifying analyzing essential is vital understand molecular mechanisms living cells design new drugs. With high-throughput technologies, many protein–protein interaction (PPI) data available, which facilitates studies at network level. Up now, although various computational methods have been proposed, prediction precision still needs be improved. In this paper, we propose a novel method by applying Hyperlink-Induced...

10.3390/genes10020177 article EN Genes 2019-02-25

DeepTPpred: A Deep Learning Approach With Matrix Factorization for Predicting Therapeutic Peptides by Integrating Length Information

OPENALEX - Publications

Zhen Cui Siguo Wang Ying He Zhan‐Heng Chen Qinhu Zhang

The abuse of traditional antibiotics has led to increased resistance bacteria and viruses. Efficient therapeutic peptide prediction is critical for drug discovery. However, most the existing methods only make effective predictions one class peptides. It worth noting that currently no predictive method considers sequence length information as a distinct feature In this article, novel deep learning approach with matrix factorization predicting peptides (DeepTPpred) by integrating are proposed....

10.1109/jbhi.2023.3290014 article EN IEEE Journal of Biomedical and Health Informatics 2023-06-27

Predicting the sequence specificities of DNA-binding proteins by DNA Fine-tuned Language Model with decaying learning rates

OPENALEX - Publications

Ying He Qinhu Zhang Siguo Wang Zhan‐Heng Chen Zhen Cui and 2 more

DNA-binding proteins (DBPs) play vital roles in the regulation of biological systems. Although there are already many deep learning methods for predicting sequence specificities DBPs, they face two challenges as follows. Classic DBPs prediction usually fail to capture dependencies between genomic sequences since their commonly used one-hot codes mutually orthogonal. Besides, these perform poorly when samples inadequate. To address challenges, we developed a novel language model mining using...

10.1109/tcbb.2022.3165592 article EN IEEE/ACM Transactions on Computational Biology and Bioinformatics 2022-04-07

FCNGRU: Locating Transcription Factor Binding Sites by Combing Fully Convolutional Neural Network With Gated Recurrent Unit

OPENALEX - Publications

Siguo Wang Ying He Zhan‐Heng Chen Qinhu Zhang

Deciphering the relationship between transcription factors (TFs) and DNA sequences is very helpful for computational inference of gene regulation a comprehensive understanding mechanisms. Transcription factor binding sites (TFBSs) are specific short that play pivotal role in controlling expression through interaction with TF proteins. Although recently many deep learning methods have been proposed to predict TFBSs aiming sequence specificity TF-DNA binding, there still lack effective...

10.1109/jbhi.2021.3117616 article EN IEEE Journal of Biomedical and Health Informatics 2021-10-06

DLoopCaller: A deep learning approach for predicting genome-wide chromatin loops by integrating accessible chromatin landscapes

OPENALEX - Publications

Siguo Wang Qinhu Zhang Ying He Zhen Cui Zhenghao Guo and 2 more

In recent years, major advances have been made in various chromosome conformation capture technologies to further satisfy the needs of researchers for high-quality, high-resolution contact interactions. Discriminating loops from genome-wide interactions is crucial dissecting three-dimensional(3D) genome structure and function. Here, we present a deep learning method predict chromatin loops, called DLoopCaller, by combining accessible landscapes raw Hi-C maps. Some available orthogonal data...

10.1371/journal.pcbi.1010572 article EN cc-by PLoS Computational Biology 2022-10-07

In silico prediction methods of self-interacting proteins: an empirical and academic survey

OPENALEX - Publications

Zhan‐Heng Chen Zhu‐Hong You Qinhu Zhang Zhen-Hao Guo Siguo Wang and 1 more

10.1007/s11704-022-1563-1 article EN Frontiers of Computer Science 2022-10-06

Using Fully Convolutional Network to Locate Transcription Factor Binding Sites Based on DNA Sequence and Conservation Information

OPENALEX - Publications

Qinhu Zhang Youhong Xu Siguo Wang Yong Wu Yuannong Ye and 4 more

Transcription factors (TFs) play a part in gene expression. TFs can form complex expression regulation system by combining with DNA. Thereby, identifying the binding regions has become an indispensable step for understanding regulatory mechanism of Due to great achievements applying deep learning (DL) computer vision and language processing recent years, many scholars are inspired use these methods predict TF sites (TFBSs), achieving extraordinary results. However, mainly focus on whether...

10.1109/tcbb.2022.3219831 article EN IEEE/ACM Transactions on Computational Biology and Bioinformatics 2022-11-14

Nucleotide-level prediction of CircRNA-protein binding based on fully convolutional neural network

OPENALEX - Publications

Zhen Shen Wei Liu Shujun Zhao Qinhu Zhang Siguo Wang and 1 more

Introduction: CircRNA-protein binding plays a critical role in complex biological activity and disease. Various deep learning-based algorithms have been proposed to identify sites. These methods predict whether the CircRNA sequence includes protein sites from level, primarily concentrate on analysing specificity of binding. For model performance, these are unsatisfactory accurately predicting motif that special functions gene expression. Methods: In this study, based learning models...

10.3389/fgene.2023.1283404 article EN cc-by Frontiers in Genetics 2023-10-06

Predicting in-vitro DNA protein binding with a spatially aligned fusion of sequence and shape

OPENALEX - Publications

Qinhu Zhang Yindong Zhang Siguo Wang Zhan‐Heng Chen Valeriya Gribova and 2 more

Discovery of transcription factor binding sites (TFBSs) is primary importance for understanding the underlying mechanic and gene regulation process. Growing evidence indicates that apart from DNA sequences, shape landscape has a significant influence on preference. To effectively model co-influence sequence features, we emphasize position information motif pattern. In this paper, propose novel deep learning-based architecture, named hybridShape eDeepCNN, TFBS prediction which integrates in...

10.1109/tcbb.2021.3133869 article EN publisher-specific-oa IEEE/ACM Transactions on Computational Biology and Bioinformatics 2022-11-01

NPENN: A Noise Perturbation Ensemble Neural Network for Microbiome Disease Phenotype Prediction

OPENALEX - Publications

Zhen Cui Yan Wu Qinhu Zhang Siguo Wang Zhen-Hao Guo

With advances in microbiomics, the crucial role of microbes disease progression is increasingly recognized. However, predicting phenotypes using microbiome data remains challenging due to complexity, heterogeneity, and limited model generalization. Current methods often depend on specific datasets are vulnerable adversarial attacks. To address these issues, this paper introduces a novel Noise Perturbation Ensemble Neural Network (NPENN), which combines noise mechanisms with Gradient Boosting...

10.1109/jbhi.2024.3507789 article EN IEEE Journal of Biomedical and Health Informatics 2024-11-27

Graph pooling for graph-level representation learning: a survey

OPENALEX - Publications

Zhipeng Li Siguo Wang Qinhu Zhang Yijie Pan Naian Xiao and 4 more

In graph-level representation learning tasks, graph neural networks have received much attention for their powerful feature capabilities. However, with the increasing scales of data, how to efficiently process and extract key information has become focus research. The pooling technique, as a step in networks, simplifies structure by merging nodes or subgraphs, which significantly improves computational efficiency extraction ability networks. Although various methods been proposed numerous...

10.1007/s10462-024-10949-2 article EN cc-by-nc-nd Artificial Intelligence Review 2024-12-20

Identifying Essential Proteins in Dynamic PPI Network with Improved FOA

OPENALEX - Publications

Xiujuan Lei Siguo Wang Linqiang Pan

Identification of essential proteins plays an important role for understanding the cellular life activity and development in postgenomic era. from protein-protein interaction (PPI) networks has become a hot topic recent years. In this work, fruit fly optimization algorithm (FOA) is extended identifying proteins, called EPFOA, which merges FOA with topological properties biological information identification. The EPFOA advantage multiple simultaneously rather than completely relying on...

10.15837/ijccc.2018.3.3285 article EN cc-by-nc International Journal of Computers Communications & Control 2018-05-27

Base-resolution prediction of transcription factor binding signals by a deep learning framework

OPENALEX - Publications

Qinhu Zhang Ying He Siguo Wang Zhan‐Heng Chen Zhen-Hao Guo and 2 more

Abstract Transcription factors (TFs) play an important role in regulating gene expression, thus the identification of sites bound by them has become a fundamental step for molecular and cellular biology. In this paper, we developed deep learning framework leveraging existing fully convolutional neural networks (FCN) to predict TF-DNA binding signals at base-resolution level, called FCNsignal. The proposed FCNsignal can simultaneously achieve following tasks: (i) modeling regions; (ii)...

10.1101/2021.11.01.466840 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2021-11-04

MV-CVIB: a microbiome-based multi-view convolutional variational information bottleneck for predicting metastatic colorectal cancer

OPENALEX - Publications

Zhen Cui Yan Wu Qinhu Zhang Siguo Wang Ying He and 1 more

Imbalances in gut microbes have been implied many human diseases, including colorectal cancer (CRC), inflammatory bowel disease, type 2 diabetes, obesity, autism, and Alzheimer's disease. Compared with other CRC is a gastrointestinal malignancy high mortality probability of metastasis. However, current studies mainly focus on the prediction while neglecting more serious metastatic (mCRC). In addition, dimensionality small samples lead to complexity microbial data, which increases difficulty...

10.3389/fmicb.2023.1238199 article EN cc-by Frontiers in Microbiology 2023-08-22

Coming Soon ...