NFDI4DS | UHH-SEMS - Publication Details

Tao Jiang

ORCID: 0000-0003-3833-4498

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5101911266

Research Areas

Algorithms and Data Compression
Advanced Graph Theory Research
Limits and Structures in Graph Theory
Genomics and Phylogenetic Studies
Gene expression and cancer classification
RNA and protein synthesis mechanisms
semigroups and automata theory
DNA and Biological Computing
Machine Learning and Algorithms
Graph theory and applications
Genome Rearrangement Algorithms
RNA modifications and cancer
Cellular Automata and Applications
Genetic Associations and Epidemiology
Genomics and Chromatin Dynamics
Graph Labeling and Dimension Problems
Computability, Logic, AI Algorithms
Genetic Mapping and Diversity in Plants and Animals
RNA Research and Splicing
Advanced Topology and Set Theory
Bioinformatics and Genomic Networks
Mercury impact and mitigation studies
Complexity and Algorithms in Graphs
graph theory and CDMA systems
Computational Drug Discovery Methods

Southwest University
2016-2025

Chengdu University of Traditional Chinese Medicine
2024-2025

University of California, Riverside
2015-2024

Tsinghua University
2015-2024

Jianghan University
2022-2024

Northeast Institute of Geography and Agroecology
2014-2024

Rice Research Institute
2024

Chinese Academy of Sciences
2015-2024

Ames National Laboratory
2024

Czech Academy of Sciences, Institute of Biophysics
2024

On the Complexity of Multiple Sequence Alignment

OPENALEX - Publications

Lusheng Wang Tao Jiang

We study the computational complexity of two popular problems in multiple sequence alignment: alignment with SP-score and tree alignment. It is shown that first problem NP-complete second MAX SNP-hard. The a given phylogeny also considered.

10.1089/cmb.1994.1.337 article EN Journal of Computational Biology 1994-01-01

Efficient and Robust Feature Extraction by Maximum Margin Criterion

OPENALEX - Publications

Haizhou Li Tao Jiang Kai Zhang

In pattern recognition, feature extraction techniques are widely employed to reduce the dimensionality of data and enhance discriminatory information. Principal component analysis (PCA) linear discriminant (LDA) two most popular reduction methods. However, PCA is not very effective for features, LDA stable due small sample size problem. this paper, we propose some new (linear nonlinear) extractors based on maximum margin criterion (MMC). Geometrically, MMC maximize (average) between classes...

10.1109/tnn.2005.860852 article EN IEEE Transactions on Neural Networks 2006-01-01

The Mg-Chelatase H Subunit ofArabidopsisAntagonizes a Group of WRKY Transcription Repressors to Relieve ABA-Responsive Genes of Inhibition

OPENALEX - Publications

Yi Shang Yan Lu Zhiqiang Liu Zheng Cao Chao Mei and 11 more

Abstract The phytohormone abscisic acid (ABA) plays a vital role in plant development and response to environmental challenges, but the complex networks of ABA signaling pathways are poorly understood. We previously reported that chloroplast protein, magnesium-protoporphyrin IX chelatase H subunit (CHLH/ABAR), functions as receptor for Arabidopsis thaliana. Here, we report ABAR spans envelope cytosolic C terminus interacts with group WRKY transcription factors (WRKY40, WRKY18, WRKY60)...

10.1105/tpc.110.073874 article EN cc-by The Plant Cell 2010-06-01

Prevalence of the initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters

OPENALEX - Publications

Chuhu Yang Eugene Bolotin Tao Jiang Frances M. Sladek Ernest Martinez

10.1016/j.gene.2006.09.029 article EN Gene 2006-10-11

ChemmineR: a compound mining framework for R

OPENALEX - Publications

Yiqun Cao Anna Charisi Li-Chang Cheng Tao Jiang Thomas Girke

Software applications for structural similarity searching and clustering of small molecules play an important role in drug discovery chemical genomics. Here, we present the first open-source compound mining framework popular statistical programming environment R. The integration with a powerful maximizes flexibility, expandability programmability provided analysis functions.We discuss algorithms utilities by R package ChemmineR. It contains functions searching, libraries wide spectrum...

10.1093/bioinformatics/btn307 article EN cc-by-nc Bioinformatics 2008-07-02

NeoDTI: neural integration of neighbor information from a heterogeneous network for discovering new drug–target interactions

OPENALEX - Publications

Fangping Wan Lixiang Hong An Xiao Tao Jiang Jianyang Zeng

Abstract Motivation Accurately predicting drug–target interactions (DTIs) in silico can guide the drug discovery process and thus facilitate development. Computational approaches for DTI prediction that adopt systems biology perspective generally exploit rationale properties of drugs targets be characterized by their functional roles biological networks. Results Inspired recent advance information passing aggregation techniques generalize convolution neural networks to mine large-scale graph...

10.1093/bioinformatics/bty543 article EN Bioinformatics 2018-06-29

SCALE method for single-cell ATAC-seq analysis via latent feature extraction

OPENALEX - Publications

Lei Xiong Kui Xu Tian Kang Yanqiu Shao Lei Tang and 4 more

Single-cell ATAC-seq (scATAC-seq) profiles the chromatin accessibility landscape at single cell level, thus revealing cell-to-cell variability in gene regulation. However, high dimensionality and sparsity of scATAC-seq data often complicate analysis. Here, we introduce a method for analyzing data, called Single-Cell analysis via Latent feature Extraction (SCALE). SCALE combines deep generative framework probabilistic Gaussian Mixture Model to learn latent features that accurately...

10.1038/s41467-019-12630-7 article EN cc-by Nature Communications 2019-10-08

MONN: A Multi-objective Neural Network for Predicting Compound-Protein Interactions and Affinities

OPENALEX - Publications

Shuya Li Fangping Wan Hantao Shu Tao Jiang Dan Zhao and 1 more

Computational approaches for understanding compound-protein interactions (CPIs) can greatly facilitate drug development. Recently, a number of deep-learning-based methods have been proposed to predict binding affinities and attempt capture local interaction sites in compounds proteins through neural attentions (i.e., network architectures that enable the interpretation feature importance). Here, we compiled benchmark dataset containing inter-molecular non-covalent more than 10,000 pairs...

10.1016/j.cels.2020.03.002 article EN cc-by-nc-nd Cell Systems 2020-04-01

Drug target prediction through deep learning functional representation of gene signatures

OPENALEX - Publications

Hao Chen Frederick J. King Bin Zhou Yu Wang Carter J. Canedy and 11 more

Abstract Many machine learning applications in bioinformatics currently rely on matching gene identities when analyzing input signatures and fail to take advantage of preexisting knowledge about functions. To further enable comparative analysis OMICS datasets, including target deconvolution mechanism action studies, we develop an approach that represents projected onto their biological functions, instead identities, similar how the word2vec technique works natural language processing. We...

10.1038/s41467-024-46089-y article EN cc-by Nature Communications 2024-02-29

A multi-scale information fusion-based multiple correlations for unsupervised attribute selection

OPENALEX - Publications

Pengfei Zhang Dexian Wang Zheng Yu Yujie Zhang Tao Jiang and 1 more

10.1016/j.inffus.2024.102276 article EN Information Fusion 2024-02-01

Minimal NFA Problems are Hard

OPENALEX - Publications

Tao Jiang Bala Ravikumar

Finite automata (FA's) are of fundamental importance in theory and applications. The following basic minimization problem is studied: Given a DFA (deterministic FA), find minimum equivalent nondeterministic FA (NFA). This paper shows that the natural decision associated with it PSPACE-complete. More generally, let ${\text{A}} \to {\text{B}}$ denote converting given type A to B. also most these problems computationally hard. Motivated by question how much nondeterminism suffices make...

10.1137/0222067 article EN SIAM Journal on Computing 1993-12-01

Alignment of trees — an alternative to tree edit

OPENALEX - Publications

Tao Jiang

10.1016/0304-3975(95)80015-8 article EN Theoretical Computer Science 1995-05-29

On the complexity of comparing evolutionary trees

OPENALEX - Publications

Jotun Hein Tao Jiang Lusheng Wang Kaizhong Zhang

10.1016/s0166-218x(96)00062-5 article EN Discrete Applied Mathematics 1996-12-01

On the Approximation of Shortest Common Supersequences and Longest Common Subsequences

OPENALEX - Publications

Tao Jiang Ming Li

The problems of finding shortest common supersequences (SCS) and longest subsequences (LCS) are two well-known ${\textbf NP}$-hard that have applications in many areas, including computational molecular biology, data compression, robot motion planning, scheduling, text editing, etc. A lot fruitless effort has been spent searching for good approximation algorithms these problems. In this paper, we show inherently hard to approximate the worst case. particular, prove (i) SCS does not a...

10.1137/s009753979223842x article EN SIAM Journal on Computing 1995-10-01

A General Edit Distance between RNA Structures

OPENALEX - Publications

Tao Jiang Guohui Lin Bin Ma Kaizhong Zhang

Arc-annotated sequences are useful in representing the structural information of RNA sequences. In general, secondary and tertiary structures can be represented as a set nested arcs crossing arcs, respectively. Since functions largely determined by molecular confirmation therefore structures, comparison between has received much attention recently. this paper, we propose notion edit distance to measure similarity two incorporating various operations performed on both bases (i.e.,...

10.1089/10665270252935511 article EN Journal of Computational Biology 2002-04-01

A maximum common substructure-based algorithm for searching and predicting drug-like compounds

OPENALEX - Publications

Yiqun Cao Tao Jiang Thomas Girke

The prediction of biologically active compounds is great importance for high-throughput screening (HTS) approaches in drug discovery and chemical genomics. Many computational methods this area focus on measuring the structural similarities between structures. However, traditional similarity measures are often too rigid or consider only global maximum common substructure (MCS) approach provides a more promising flexible alternative predicting bioactive compounds.In article, new backtracking...

10.1093/bioinformatics/btn186 article EN cc-by-nc Bioinformatics 2008-06-27

Integrated approach for the identification of human hepatocyte nuclear factor 4α target genes using protein binding microarrays

OPENALEX - Publications

Eugene Bolotin Hailing Liao Tuong Chi Ta Chuhu Yang Wendy W. Hwang‐Verslues and 3 more

Hepatocyte nuclear factor 4 alpha (HNF4α), a member of the receptor superfamily, is essential for liver function and linked to several diseases including diabetes, hemophilia, atherosclerosis, hepatitis. Although many DNA response elements target genes have been identified HNF4α, complete repertoire binding sites in human genome unknown. Here, we adapt protein microarrays (PBMs) examine DNA-binding characteristics two HNF4α species (rat human) isoforms (HNF4α2 HNF4α8) high-throughput...

10.1002/hep.23357 article EN Hepatology 2009-10-05

Association Between Variants of PRDM1 and NDP52 and Crohn's Disease, Based on Exome Sequencing and Functional Studies

OPENALEX - Publications

David Ellinghaus Hu Zhang Sebastian Zeißig Simone Lipinski Andreas Till and 56 more

10.1053/j.gastro.2013.04.040 article EN Gastroenterology 2013-04-26

IsoLasso: A LASSO Regression Approach to RNA-Seq Based Transcriptome Assembly

OPENALEX - Publications

Wei Li Jianxing Feng Tao Jiang

The new second generation sequencing technology revolutionizes many biology-related research fields and poses various computational biology challenges. One of them is transcriptome assembly based on RNA-Seq data, which aims at reconstructing all full-length mRNA transcripts simultaneously from millions short reads. In this article, we consider three objectives in assembly: the maximization prediction accuracy, minimization interpretation, completeness. first objective, requires that...

10.1089/cmb.2011.0171 article EN Journal of Computational Biology 2011-09-28

TITER: predicting translation initiation sites by deep learning

OPENALEX - Publications

Sai Zhang Hailin Hu Tao Jiang Lei Zhang Jianyang Zeng

Abstract Motivation Translation initiation is a key step in the regulation of gene expression. In addition to annotated translation sites (TISs), process may also start at multiple alternative TISs (including both AUG and non-AUG codons), which makes it challenging predict study underlying regulatory mechanisms. Meanwhile, advent several high-throughput sequencing techniques for profiling initiating ribosomes single-nucleotide resolution, e.g. GTI-seq QTI-seq, provides abundant data...

10.1093/bioinformatics/btx247 article EN cc-by-nc Bioinformatics 2017-04-24

Establishment of noninvasive diabetes risk prediction model based on tongue features and machine learning techniques

OPENALEX - Publications

Jun Li Qingguang Chen Xiaojuan Hu Pei Yuan Longtao Cui and 9 more

10.1016/j.ijmedinf.2021.104429 article EN International Journal of Medical Informatics 2021-02-23

Algal Organic Matter Drives Methanogen-Mediated Methylmercury Production in Water from Eutrophic Shallow Lakes

OPENALEX - Publications

Pei Lei Jin Zhang Jinjie Zhu Qiao‐Guo Tan Raymond W. M. Kwong and 4 more

Algal blooms bring massive amounts of algal organic matter (AOM) into eutrophic lakes, which influences microbial methylmercury (MeHg) production. However, because the complexity AOM and its dynamic changes during decomposition, relationship between Hg methylators remains poorly understood, hinders predicting MeHg production bioaccumulation in shallow lakes. To address that, we explored impacts on by characterizing dissolved with Fourier transform ion cyclotron resonance mass spectrometry...

10.1021/acs.est.0c08395 article EN Environmental Science & Technology 2021-07-08

DOM influences Hg methylation in paddy soils across a Hg contamination gradient

OPENALEX - Publications

Mahmoud A. Abdelhafiz Jiang Liu Tao Jiang Qiang Pu Muhammad Wajahat Aslam and 3 more

10.1016/j.envpol.2023.121237 article EN Environmental Pollution 2023-02-07

Heating-Induced Redox Property Dynamics of Peat Soil Dissolved Organic Matter in a Simulated Peat Fire: Electron Exchange Capacity and Molecular Characteristics

OPENALEX - Publications

Peijie Yang Ying Wang Xiangwei Tian Yifan Cui Tao Jiang and 9 more

Peatlands store one-third of the world's soil organic carbon. Globally increased fires altered peat matter chemistry, yet redox property and molecular dynamics peat-dissolved (PDOM) during remain poorly characterized, limiting our understanding postfire biogeochemical processes. Clarifying these dynamic changes is essential for effective peatland fire management. This study demonstrates temperature-dependent in electron exchange capacity (EEC) PDOM by simulating burning, significantly...

10.1021/acs.est.4c09174 article EN Environmental Science & Technology 2025-01-02

Assignment of Orthologous Genes via Genome Rearrangement

OPENALEX - Publications

Xin Chen Jie Zheng Zheng Qing Fu Nan Peng Yang Zhong and 2 more

The assignment of orthologous genes between a pair genomes is fundamental and challenging problem in comparative genomics. Existing methods that assign orthologs based on the similarity DNA or protein sequences may make erroneous assignments when sequence does not clearly delineate evolutionary relationship among same families. In this paper, we present new approach to ortholog takes into account both events at genome level, where are assumed correspond each other most parsimonious evolving...

10.1109/tcbb.2005.48 article EN IEEE/ACM Transactions on Computational Biology and Bioinformatics 2005-10-01

Coming Soon ...