Jiansheng Wu

ORCID: 0000-0002-7941-9722
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Computational Drug Discovery Methods
  • Protein Structure and Dynamics
  • Receptor Mechanisms and Signaling
  • Machine Learning in Bioinformatics
  • RNA and protein synthesis mechanisms
  • Text and Document Classification Technologies
  • Advanced Image and Video Retrieval Techniques
  • Web Data Mining and Analysis
  • Machine Learning in Materials Science
  • Bioinformatics and Genomic Networks
  • Advanced Text Analysis Techniques
  • Algorithms and Data Compression
  • Image Retrieval and Classification Techniques
  • Alzheimer's disease research and treatments
  • Viral Infectious Diseases and Gene Expression in Insects
  • Video Surveillance and Tracking Methods
  • Topic Modeling
  • Genomics and Phylogenetic Studies
  • Quantum-Dot Cellular Automata
  • Domain Adaptation and Few-Shot Learning
  • interferon and immune responses
  • Protein Degradation and Inhibitors
  • Vehicle License Plate Recognition
  • Spam and Phishing Detection
  • Click Chemistry and Applications

Nanjing University of Posts and Telecommunications
2015-2025

Alibaba Group (China)
2025

Nanjing Health and Health Commission
2023

University of Science and Technology Liaoning
2012-2023

University of Michigan
2015-2018

Arizona State University
2015

Nanjing University
2013

AutoDock Vina is one of the most popular molecular docking tools. In latest benchmark CASF-2016 for comparative assessment scoring functions, won best power among all Modern drug discovery facing a common scenario large virtual screening hits from huge compound databases. Due to seriality characteristic algorithm, there no successful report on its parallel acceleration with GPUs. Current typically relies stack computing as well allocation resource and tasks, such VirtualFlow platform. The...

10.3390/molecules27093041 article EN cc-by Molecules 2022-05-09

Modern drug discovery typically faces large virtual screens from huge compound databases where multiple docking tools are involved for meeting various real scenes or improving the precision of screens. Among these tools, AutoDock Vina and its numerous derivatives most popular have become standard pipeline molecular in modern discovery. Our recent Vina-GPU method realized 14-fold acceleration against on a piece NVIDIA RTX 3090 GPU one screening case. Further speedup with graphics processing...

10.1021/acs.jcim.2c01504 article EN Journal of Chemical Information and Modeling 2023-03-20

Automated annotation of protein function is challenging. As the number sequenced genomes rapidly grows, vast majority proteins can only be annotated computationally. Nature often brings several domains together to form multi-domain and multi-functional with a possibilities, each domain may fulfill its own independently or in concerted manner neighbors. Thus, it evident that prediction problem naturally inherently Multi-Instance Multi-Label (MIML) learning tasks. Based on state-of-the-art...

10.1109/tcbb.2014.2323058 article EN IEEE/ACM Transactions on Computational Biology and Bioinformatics 2014-05-16

Autism spectrum disorder (ASD) is a complex neurodevelopmental mainly showed atypical social interaction, communication, and restricted, repetitive patterns of behavior, interests activities. Now clinic diagnosis ASD mostly based on psychological evaluation, clinical observation medical history. All these behavioral indexes could not avoid defects such as subjectivity reporter-dependency. Therefore researchers devoted themselves to seek relatively stable biomarkers supplementary diagnostic...

10.1002/aur.1711 article EN Autism Research 2016-11-22

The identification of RNA-binding residues in proteins is important several areas such as protein function, posttranscriptional regulation and drug design. We have developed PRBR (Prediction RNA Binding Residues), a novel method for identifying from amino acid sequences. Our combines hybrid feature with the enriched random forest (ERF) algorithm. composed predicted secondary structure information three features: evolutionary combined conservation physicochemical properties acids about...

10.1002/prot.22958 article EN Proteins Structure Function and Bioinformatics 2010-12-06

Hashing-based image retrieval methods have become a cutting-edge topic in the information domain due to their high efficiency and low cost. In order perform efficient hash learning by simultaneously preserving semantic similarity data structures feature space, this paper presents semi-supervised metric learning-based anchor graph hashing method. Our proposed approach can be divided into three parts. First, we exploit transformation matrix construct anchor-based of training set. Second,...

10.1109/tip.2018.2860898 article EN IEEE Transactions on Image Processing 2018-08-03

Precise assessment of ligand bioactivities (including IC50, EC50, Ki, Kd, etc.) is essential for virtual screening and lead compound identification. However, not all ligands have experimentally determined activities. In particular, many G protein-coupled receptors (GPCRs), which are the largest integral membrane protein family represent targets nearly 40% drugs on market, lack published experimental data about interactions. Computational methods with ability to accurately predict bioactivity...

10.1093/bioinformatics/bty070 article EN Bioinformatics 2018-02-07

Simulated Annealing (SA) algorithm is not effective with large optimization problems for its slow convergence. Hence, several parallel (pSA) methods have been proposed, where the increase of searching threads can boost speed Although satisfactory solutions be obtained by these methods, there no rigorous mathematical analyses on their effectiveness. Thus, this article introduces a probabilistic model, which theorem about effectiveness multiple initial states SA (MISPSA) has proven. The also...

10.1109/tcbb.2023.3323552 article EN IEEE/ACM Transactions on Computational Biology and Bioinformatics 2023-10-13

This study presents SDL-IUAs, a novel deep learning framework that employs self-attention mechanism to predict pregnancy outcomes in patients with intrauterine adhesions (IUAs) following hysteroscopic adhesiolysis. By integrating random forest-based feature importance analysis self-attention, the model achieves enhanced predictive accuracy and interpretability. The experiments were conducted on clinical dataset of 121 163 features, where 16.05% missing values effectively handled using matrix...

10.1063/5.0248831 article EN cc-by-nc AIP Advances 2025-04-01

Virtual screening has been widely used to identify potential hit candidates that can bind the target protein in drug discovery. Contemporary methods typically rely on oversimplified scoring functions, frequently yielding one-digit rates (or even zero) among top-ranked candidates. The substantial cost of laboratory validation further constrains exploration candidate molecules. We find test-time prediction refinement is almost blank this area, which means bioactivity feedback wet-lab...

10.1021/acs.jctc.4c01618 article EN Journal of Chemical Theory and Computation 2025-04-16

The recognition of microRNA (miRNA)-binding residues in proteins is helpful to understand how miRNAs silence their target genes. It difficult use existing computational method predict miRNA-binding due the lack training examples. To address this issue, unlabeled data may be exploited help construct a model. Semisupervised learning deals with methods for exploiting addition labeled automatically improve performance, where no human intervention assumed. In addition, almost always contain much...

10.1109/tcbb.2013.75 article EN IEEE/ACM Transactions on Computational Biology and Bioinformatics 2013-05-01

AutoDock Vina and its derivatives have established themselves as a prevailing pipeline for virtual screening in contemporary drug discovery. Our Vina-GPU method leverages the parallel computing power of GPUs to accelerate Vina, 2.0 further enhances speed derivatives. Given prevalence large screens modern discovery, improvement accuracy has become longstanding challenge. In this study, we propose 2.1, aimed at enhancing docking precision through integration novel algorithms facilitate...

10.1109/tcbb.2024.3467127 article EN IEEE/ACM Transactions on Computational Biology and Bioinformatics 2024-01-01

DNA-binding proteins are fundamentally important in understanding cellular processes. Thus, the identification of has particularly practical application various fields, such as drug design. We have proposed a novel approach method for predicting using only sequence information. The prediction model developed this study is constructed by support vector machine-sequential minimal optimization (SVM-SMO) algorithm conjunction with hybrid feature. feature incorporating evolutionary information...

10.1155/2013/524502 article EN cc-by Computational and Mathematical Methods in Medicine 2013-01-01

Molecular docking (MD) is one of the core steps in expensive and time-consuming process drug design, which basically an optimization problem based on scoring functions. AutoDock series MD software widely accepted by academia industry, among Vina (Vina) latest most popular version due to its accuracy relatively high speed. However, contrast prior version, i.e., AutoDock4, hardware acceleration approaches are rarely reported. In this article, we propose Vina-field-programmable gate array...

10.1109/tvlsi.2022.3217275 article EN IEEE Transactions on Very Large Scale Integration (VLSI) Systems 2022-11-04

Accurate prediction and interpretation of ligand bioactivities are essential for virtual screening drug discovery. Unfortunately, many important targets lack experimental data about the bioactivities; this is particularly true G protein-coupled receptors (GPCRs), which account a third drugs currently on market. Computational approaches with potential precise assessment determination key substructural features determine needed to address issue.A new method, SED, was proposed predict recognize...

10.1093/bioinformatics/btz336 article EN cc-by-nc Bioinformatics 2019-06-11

Alzheimer’s disease (AD) is a neurodegenerative brain in the elderly. Identifying patients with mild cognitive impairment (MCI) who are more likely to progress AD key step prevention. Recent studies have shown that heterogeneous disease. In this study, we propose subtyping-based prediction strategy predict conversion from MCI three years according patient subtypes. Structural magnetic resonance imaging (sMRI) data and multi-omics data, including genotype gene expression profiling derived...

10.3390/brainsci11060674 article EN cc-by Brain Sciences 2021-05-21

Most of the news headline generation models that use sequence-to-sequence model or recurrent network have two shortcomings: lack parallel ability and easily repeated words. It is difficult to select important words in reproduce these expressions, resulting inaccurately summarizes news. In this work, we propose a TD-NHG model, which stands for based on an improved decoder from transformer. The uses masked multi-head self-attention learn feature information different representation subspaces...

10.1038/s41598-022-15817-z article EN cc-by Scientific Reports 2022-07-08

Mild Cognitive Impairment (MCI) is a preclinical stage of Alzheimer's Disease (AD) and clinical heterogeneity. The classification MCI crucial for the early diagnosis treatment AD. In this study, we investigated potential using both labeled unlabeled samples from Neuroimaging Initiative (ADNI) cohort to classify through multimodal co-training method. We utilized structural magnetic resonance imaging (sMRI) data genotype 364 including 228 136 ADNI-1 cohort. First, selected quantitative trait...

10.1109/tcbb.2021.3053061 article EN cc-by IEEE/ACM Transactions on Computational Biology and Bioinformatics 2021-01-20

Abstract Ligand molecules naturally constitute a graph structure. Recently, many excellent deep learning (DGL) methods have been proposed and used to model ligand bioactivities, which is critical for the virtual screening of drug hits from compound databases in interest. However, pharmacists can find that these well-trained DGL models usually are hard achieve satisfying performance real scenarios candidates. The main challenges involve datasets training were small-sized biased, inner active...

10.1093/bib/bbac077 article EN Briefings in Bioinformatics 2022-02-24

Accurate modeling of compound bioactivities is essential for the virtual screening drug leads. In real-world scenarios, pharmacists tend to choose from top-k hit compounds ranked by predicted a large database with interest continue wet experiments discovery. Significant improvement precision top hits in ligand-based leads more valuable than conventional schemes accurately predicting all database. Here, we proposed new method, RealVS, significantly improve hits' and learn interpretable key...

10.1021/acs.jcim.1c01021 article EN Journal of Chemical Information and Modeling 2021-10-07

Simulated Annealing (SA) algorithm is not effective with large optimization problems for its slow convergence. Hence, several parallel (pSA) methods have been proposed, where the increase of searching threads can boost speed Although these obtain satisfactory solutions to problems, there no rigorous mathematical analyses on their effectiveness. Thus, this paper introduces a probabilistic model, which theorem about effectiveness multiple initial states SA (MISPSA) has proven. The also...

10.2139/ssrn.4120348 article EN SSRN Electronic Journal 2022-01-01

G protein-coupled receptors (GPCRs) are one of the most important drug targets, accounting for ∼34% drugs on market. For discovery, accurate modeling and explanation bioactivities ligands is critical screening optimization hit compounds. Homologous GPCRs more likely to interact with chemically similar ligands, they tend share common binding modes ligand molecules. The inclusion homologous in learning potentially enhances accuracy interpretability models due utilizing increased training...

10.1021/acs.jcim.9b01000 article EN Journal of Chemical Information and Modeling 2020-02-10

In multi-instance multi-label learning (MIML) problems, predicting the labels of unseen bags becomes difficult when their instances are not provided directly. Therefore, it is necessary to exploit label correlations enhance accuracy MIML classification. This paper presents metric learning-based MIML-kNN (MI(ML)2kNN) method, which composed three parts. First, Laplacian matrix can be learned obtain by minimizing manifold regularizer. Then, based on correlations, a novel objective function for...

10.1109/access.2019.2928218 article EN cc-by IEEE Access 2019-01-01
Coming Soon ...