NFDI4DS | UHH-SEMS - Publication Details

Yaohang Li

ORCID: 0000-0003-0178-1876

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5079316675

Research Areas

Protein Structure and Dynamics
Computational Drug Discovery Methods
Machine Learning in Bioinformatics
Bioinformatics and Genomic Networks
Enzyme Structure and Function
Distributed and Parallel Computing Systems
Parallel Computing and Optimization Techniques
Particle physics theoretical and experimental studies
Sparse and Compressive Sensing Techniques
RNA and protein synthesis mechanisms
Advanced Data Storage Technologies
Scientific Computing and Data Management
Matrix Theory and Algorithms
Computational Physics and Python Applications
Stochastic Gradient Optimization Techniques
Gene expression and cancer classification
Algorithms and Data Compression
Face and Expression Recognition
Biomedical Text Mining and Ontologies
Genomics and Phylogenetic Studies
Cloud Computing and Resource Management
Text and Document Classification Technologies
Gene Regulatory Network Analysis
Machine Learning in Materials Science
High-Energy Particle Collisions Research

Old Dominion University
2016-2025

University of Virginia
2024

Dominion (United States)
2020

Central South University
2019

Institute of Computing Technology
2017

Chinese Academy of Sciences
2017

Information Technology Laboratory
2017

Tsinghua University
2017

Dominion University College
2016

North Carolina Agricultural and Technical State University
2003-2010

Prediction of lncRNA–disease associations based on inductive matrix completion

OPENALEX - Publications

Chengqian Lu Mengyun Yang Feng Luo Fang‐Xiang Wu Min Li and 3 more

Accumulating evidences indicate that long non-coding RNAs (lncRNAs) play pivotal roles in various biological processes. Mutations and dysregulations of lncRNAs are implicated miscellaneous human diseases. Predicting lncRNA–disease associations is beneficial to disease diagnosis as well treatment. Although many computational methods have been developed, precisely identifying associations, especially for novel lncRNAs, remains challenging. In this study, we propose a method (named SIMCLDA)...

10.1093/bioinformatics/bty327 article EN Bioinformatics 2018-04-25

Computational drug repositioning using low-rank matrix approximation and randomized algorithms

OPENALEX - Publications

Huimin Luo Min Li Shaokai Wang Quan Liu Yaohang Li and 1 more

Abstract Motivation Computational drug repositioning is an important and efficient approach towards identifying novel treatments for diseases in discovery. The emergence of large-scale, heterogeneous biological biomedical datasets has provided unprecedented opportunity developing computational methods. problem can be modeled as a recommendation system that recommends based on known drug–disease associations. formulation under this matrix completion, assuming the hidden factors contributing...

10.1093/bioinformatics/bty013 article EN Bioinformatics 2018-01-19

Protein–protein interaction site prediction through combining local and global features with deep neural networks

OPENALEX - Publications

Min Zeng Fuhao Zhang Fang‐Xiang Wu Yaohang Li Jianxin Wang and 1 more

Abstract Motivation Protein–protein interactions (PPIs) play important roles in many biological processes. Conventional experiments for identifying PPI sites are costly and time-consuming. Thus, computational approaches have been proposed to predict sites. Existing methods usually use local contextual features Actually, global of protein sequences critical site prediction. Results A new end-to-end deep learning framework, named DeepPPISP, through combining sequence features, is For we a...

10.1093/bioinformatics/btz699 article EN Bioinformatics 2019-09-04

Drug repositioning based on bounded nuclear norm regularization

OPENALEX - Publications

Mengyun Yang Huimin Luo Yaohang Li Jianxin Wang

Abstract Motivation Computational drug repositioning is a cost-effective strategy to identify novel indications for existing drugs. Drug often modeled as recommendation system problem. Taking advantage of the known drug–disease associations, objective new treatments by filling out unknown entries in association matrix, which matrix completion. Underpinned fact that common molecular pathways contribute many different diseases, assumes underlying latent factors determining associations are...

10.1093/bioinformatics/btz331 article EN cc-by-nc Bioinformatics 2019-05-10

DeepDTAF: a deep learning method to predict protein–ligand binding affinity

OPENALEX - Publications

Kaili Wang Renyi Zhou Yaohang Li Min Li

Abstract Biomolecular recognition between ligand and protein plays an essential role in drug discovery development. However, it is extremely time resource consuming to determine the protein–ligand binding affinity by experiments. At present, many computational methods have been proposed predict affinity, most of which usually require 3D structures that are not often available. Therefore, new can fully take advantage sequence-level features greatly needed accelerate process. We developed a...

10.1093/bib/bbab072 article EN Briefings in Bioinformatics 2021-02-23

A survey of matrix completion methods for recommendation systems

OPENALEX - Publications

Andy Ramlatchan Mengyun Yang Quan Liu Min Li Jianxin Wang and 1 more

In recent years, the recommendation systems have become increasingly popular and been used in a broad variety of applications. Here, we investigate matrix completion techniques for that are based on collaborative filtering. The filtering problem can be viewed as predicting favorability user with respect to new items commodities. When rating is constructed users rows, columns, entries ratings, then modeled by filling out unknown elements matrix. This article presents comprehensive survey...

10.26599/bdma.2018.9020008 article EN cc-by Big Data Mining and Analytics 2018-07-02

Automated ICD-9 Coding via A Deep Learning Approach

OPENALEX - Publications

Min Li Zhihui Fei Min Zeng Fang‐Xiang Wu Yaohang Li and 2 more

ICD-9 (the Ninth Revision of International Classification Diseases) is widely used to describe a patient's diagnosis. Accurate automated coding important because manual expensive, time-consuming, and inefficient. Inspired by the recent successes deep learning, in this study, we present learning framework called DeepLabeler automatically assign codes. combines convolutional neural network with `Document Vector' technique extract encode local global features. Our proposed demonstrates its...

10.1109/tcbb.2018.2817488 article EN IEEE/ACM Transactions on Computational Biology and Bioinformatics 2018-03-20

Clinical big data and deep learning: Applications, challenges, and future outlooks

OPENALEX - Publications

Ying Yu Min Li Liangliang Liu Yaohang Li Jianxin Wang

The explosion of digital healthcare data has led to a surge data-driven medical research based on machine learning. In recent years, as powerful technique for big data, deep learning gained central position in circles its great advantages feature representation and pattern recognition. This article presents comprehensive overview studies that employ methods deal with clinical data. Firstly, the analysis characteristics various types (e.g., images, notes, lab results, vital signs demographic...

10.26599/bdma.2019.9020007 article EN cc-by Big Data Mining and Analytics 2019-08-05

DeepDSC: A Deep Learning Method to Predict Drug Sensitivity of Cancer Cell Lines

OPENALEX - Publications

Min Li Yake Wang Ruiqing Zheng Xinghua Shi Yaohang Li and 2 more

High-throughput screening technologies have provided a large amount of drug sensitivity data for panel cancer cell lines and hundreds compounds. Computational approaches to analyzing these can benefit anticancer therapeutics by identifying molecular genomic determinants developing new drugs. In this study, we developed deep learning architecture improve the performance prediction based on data. We integrated both features chemical information compounds predict half maximal inhibitory...

10.1109/tcbb.2019.2919581 article EN publisher-specific-oa IEEE/ACM Transactions on Computational Biology and Bioinformatics 2019-05-28

Gaining competitive intelligence from social media data

OPENALEX - Publications

Wu He Jiancheng Shen Xin Tian Yaohang Li Vasudeva Akula and 2 more

Purpose – Social media analytics uses data mining platforms, tools and techniques to collect, monitor analyze massive amounts of social extract useful patterns, gain insight into market requirements enhance business intelligence. The purpose this paper is propose a framework for competitive intelligence value Design/methodology/approach authors conducted case study collect set with nearly half million tweets related two largest retail chains in the world: Walmart Costco past three months...

10.1108/imds-03-2015-0098 article EN Industrial Management & Data Systems 2015-10-19

Predicting drug–target interaction using positive-unlabeled learning

OPENALEX - Publications

Wei Lan Jianxin Wang Min Li Jin Liu Yaohang Li and 2 more

10.1016/j.neucom.2016.03.080 article EN Neurocomputing 2016-05-31

A Deep Learning Framework for Identifying Essential Proteins by Integrating Multiple Types of Biological Information

OPENALEX - Publications

Min Zeng Min Li Zhihui Fei Fang‐Xiang Wu Yaohang Li and 2 more

Computational methods including centrality and machine learning-based have been proposed to identify essential proteins for understanding the minimum requirements of survival evolution a cell. In methods, researchers are required design score function which is based on prior knowledge, yet usually not sufficient capture complexity biological information. some selected features cannot represent complete properties information as they lack computational framework automatically select features....

10.1109/tcbb.2019.2897679 article EN IEEE/ACM Transactions on Computational Biology and Bioinformatics 2019-02-05

DeepFunc: A Deep Learning Framework for Accurate Prediction of Protein Functions from Protein Sequences and Interactions

OPENALEX - Publications

Fuhao Zhang Hong Song Min Zeng Yaohang Li Lukasz Kurgan and 1 more

Abstract Annotation of protein functions plays an important role in understanding life at the molecular level. High‐throughput sequencing produces massive numbers raw proteins sequences and only about 1% them have been manually annotated with functions. Experimental annotations are expensive, time‐consuming do not keep up rapid growth sequence numbers. This motivates development computational approaches that predict A novel deep learning framework, DeepFunc, is proposed which accurately...

10.1002/pmic.201900019 article EN PROTEOMICS 2019-04-03

Identifying At-Risk Students for Early Interventions—A Time-Series Clustering Approach

OPENALEX - Publications

Jui-Long Hung Morgan C. Wang Shuyan Wang Maha Abdelrasoul Yaohang Li and 1 more

The purpose of this paper is to identify at-risk online students earlier, more often, and with greater accuracy using time-series clustering. case study showed that the proposed approach could generate models higher feasibility than traditional frequency aggregation approaches. best performing model can start capture from week 10. In addition, four phases in student's learning process detected holiday effect illustrate students' behaviors before after a long break. findings also enable...

10.1109/tetc.2015.2504239 article EN IEEE Transactions on Emerging Topics in Computing 2015-11-26

AttentionDTA: prediction of drug–target binding affinity using attention model

OPENALEX - Publications

Qichang Zhao Fen Xiao Mengyun Yang Yaohang Li Jianxin Wang

In bioinformatics, machine learning-based prediction of drug-target interaction (DTI) plays an important role in virtual screening drug discovery. DTI prediction, which have been treated as a binary classification problem, depends on the concentration two molecules, between and other factors. The degree affinity molecule (such compound) target receptor or protein kinase) reflects how tightly binds to particular is quantified by measurement can reflect more detailed specific information than...

10.1109/bibm47256.2019.8983125 article EN 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2019-11-01

United neighborhood closeness centrality and orthology for predicting essential proteins

OPENALEX - Publications

Gaoshi Li Min Li Jianxin Wang Yaohang Li Yi Pan

Identifying essential proteins plays an important role in disease study, drug design, and understanding the minimal requirement for cellular life. Computational methods discovery overcome disadvantages of biological experimental that are often time-consuming, expensive, inefficient. The topological features protein-protein interaction (PPI) networks used to design computational prediction methods, such as Degree Centrality (DC), Betweenness (BC), Closeness (CC), Subgraph (SC), Eigenvector...

10.1109/tcbb.2018.2889978 article EN IEEE/ACM Transactions on Computational Biology and Bioinformatics 2020-07-01

SDLDA: lncRNA-disease association prediction based on singular value decomposition and deep learning

OPENALEX - Publications

Min Zeng Chengqian Lu Fuhao Zhang Yiming Li Fang‐Xiang Wu and 2 more

10.1016/j.ymeth.2020.05.002 article EN Methods 2020-05-05

Computational drug repositioning based on multi-similarities bilinear matrix factorization

OPENALEX - Publications

Mengyun Yang Gaoyan Wu Qichang Zhao Yaohang Li Jianxin Wang

Abstract With the development of high-throughput technology and accumulation biomedical data, prior information biological entity can be calculated from different aspects. Specifically, drug–drug similarities measured target profiles, interaction side effects. Similarly, methods data sources to calculate disease ontology result in multiple measures pairwise similarities. Therefore, computational drug repositioning, developing a dynamic method optimize fusion process is crucial challenging...

10.1093/bib/bbaa267 article EN Briefings in Bioinformatics 2020-11-04

Simulation of Electron-Proton Scattering Events by a Feature-Augmented and Transformed Generative Adversarial Network (FAT-GAN)

OPENALEX - Publications

Yasir Alanazi N. Sato Tianbo Liu Wally Melnitchouk P. Ambrozewicz and 7 more

We apply generative adversarial network (GAN) technology to build an event generator that simulates particle production in electron-proton scattering is free of theoretical assumptions about underlying dynamics. The difficulty efficiently training a GAN simulator lies learning the complicated patterns distributions particles physical properties. develop selects set transformed features from momenta can be generated easily by generator, and uses these produce augmented improve sensitivity...

10.24963/ijcai.2021/293 preprint EN 2021-08-01

BACPI: a bi-directional attention neural network for compound–protein interaction and binding affinity prediction

OPENALEX - Publications

Min Li Zhangli Lu Yifan Wu Yaohang Li

Abstract Motivation The identification of compound–protein interactions (CPIs) is an essential step in the process drug discovery. experimental determination CPIs known for a large amount funds and time it consumes. Computational model has therefore become promising efficient alternative predicting novel between compounds proteins on scale. Most supervised machine learning prediction models are approached as binary classification problem, which aim to predict whether there interaction...

10.1093/bioinformatics/btac035 article EN Bioinformatics 2022-01-14

AttentionDTA: Drug–Target Binding Affinity Prediction by Sequence-Based Deep Learning With Attention Mechanism

OPENALEX - Publications

Qichang Zhao Guihua Duan Mengyun Yang Zhongjian Cheng Yaohang Li and 1 more

The identification of drug–target relations (DTRs) is substantial in drug development. A large number methods treat DTRs as drug-target interactions (DTIs), a binary classification problem. main drawback these are the lack reliable negative samples and absence many important aspects DTR, including their dose dependence quantitative affinities. With increasing publications drug–protein binding affinity data recently, prediction can be viewed regression problem affinities (DTAs) which reflects...

10.1109/tcbb.2022.3170365 article EN IEEE/ACM Transactions on Computational Biology and Bioinformatics 2022-04-26

Efficient Randomized Algorithms for the Fixed-Precision Low-Rank Matrix Approximation

OPENALEX - Publications

Wenjian Yu Yu Gu Yaohang Li

Randomized algorithms for low-rank matrix approximation are investigated, with the emphasis on fixed-precision problem and computational efficiency handling large matrices. The based so-called QB factorization, where Q is an orthonormal matrix. First, a mechanism calculating error in Frobenius norm proposed, which enables efficient adaptive rank determination and/or sparse It can be combined any QB-form factorization algorithm B's rows incrementally generated. Based blocked randQB by...

10.1137/17m1141977 article EN SIAM Journal on Matrix Analysis and Applications 2018-01-01

RCD+: Fast loop modeling server

OPENALEX - Publications

José Ramón López‐Blanco Alejandro Jesús Canosa-Valls Yaohang Li Pablo Chacón

Modeling loops is a critical and challenging step in protein modeling prediction. We have developed quick online service (http://rcd.chaconlab.org) for ab initio loop combining coarse-grained conformational search with full-atom refinement. Our original Random Coordinate Descent (RCD) closure algorithm has been greatly improved to enrich the sampling distribution towards near-native conformations. These improvements include new workflow optimization, MPI-parallelization fast backbone angle...

10.1093/nar/gkw395 article EN cc-by-nc Nucleic Acids Research 2016-05-05

Coming Soon ...