NFDI4DS | UHH-SEMS - Publication Details

Ramzan Umarov

ORCID: 0000-0003-3477-7101

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5033647731

Research Areas

RNA and protein synthesis mechanisms
RNA modifications and cancer
Machine Learning in Bioinformatics
Genomics and Phylogenetic Studies
Genomics and Chromatin Dynamics
RNA Research and Splicing
Molecular Biology Techniques and Applications
Protein Structure and Dynamics
Computational Drug Discovery Methods
Gene Regulatory Network Analysis
Antibiotic Resistance in Bacteria
Microbial Metabolic Engineering and Bioproduction
Tuberculosis Research and Epidemiology
Pharmaceutical and Antibiotic Environmental Impacts
Cancer-related molecular mechanisms research
Cell Image Analysis Techniques
Colorectal Cancer Treatments and Studies
Cryptographic Implementations and Security
Chaos-based Image/Signal Encryption
Neural Networks and Applications
Genetic factors in colorectal cancer
Cancer Genomics and Diagnostics
Viral Infectious Diseases and Gene Expression in Insects
Cancer-related gene regulation
Bacteriophages and microbial interactions

RIKEN Center for Integrative Medical Sciences
2023-2024

King Abdullah University of Science and Technology
2015-2021

Bioscience Research
2021

Hiroshima University
2020-2021

University of Utah
2017

Tsinghua University
2017

DEEPre: sequence-based enzyme EC number prediction by deep learning

OPENALEX - Publications

Yu Li Sheng Wang Ramzan Umarov Bingqing Xie Ming Fan and 2 more

Annotation of enzyme function has a broad range applications, such as metagenomics, industrial biotechnology, and diagnosis deficiency-caused diseases. However, the time resource required make it prohibitively expensive to experimentally determine every enzyme. Therefore, computational prediction become increasingly important. In this paper, we develop an approach, determining by predicting Enzyme Commission number.We propose end-to-end feature selection classification model training well...

10.1093/bioinformatics/btx680 article EN cc-by-nc Bioinformatics 2017-10-20

Recognition of prokaryotic and eukaryotic promoters using convolutional deep learning neural networks

OPENALEX - Publications

Ramzan Umarov Victor Solovyev

Accurate computational identification of promoters remains a challenge as these key DNA regulatory regions have variable structures composed functional motifs that provide gene-specific initiation transcription. In this paper we utilize Convolutional Neural Networks (CNN) to analyze sequence characteristics prokaryotic and eukaryotic build their predictive models. We trained similar CNN architecture on five distant organisms: human, mouse, plant (Arabidopsis), two bacteria (Escherichia coli...

10.1371/journal.pone.0171410 article EN cc-by PLoS ONE 2017-02-03

TSSPlant: a new tool for prediction of plant Pol II promoters

OPENALEX - Publications

Ilham A. Shahmuradov Ramzan Umarov Victor Solovyev

Our current knowledge of eukaryotic promoters indicates their complex architecture that is often composed numerous functional motifs. Most known include multiple and in some cases mutually exclusive transcription start sites (TSSs). Moreover, TSS selection depends on cell/tissue, development stage environmental conditions. Such promoter structures make computational identification notoriously difficult. Here, we present TSSPlant, a novel tool predicts both TATA TATA-less sequences wide...

10.1093/nar/gkw1353 article EN cc-by-nc Nucleic Acids Research 2017-01-12

Promoter analysis and prediction in the human genome using sequence-based deep learning models

OPENALEX - Publications

Ramzan Umarov Hiroyuki Kuwahara Yu Li Xin Gao Victor Solovyev

Abstract Motivation Computational identification of promoters is notoriously difficult as human genes often have unique promoter sequences that provide regulation transcription and interaction with initiation complex. While there are many attempts to develop computational methods, we no reliable tool analyze long genomic sequences. Results In this work, further our deep learning approach was relatively successful discriminate short non-promoter Instead focusing on the classification...

10.1093/bioinformatics/bty1068 article EN Bioinformatics 2018-12-27

A deep learning framework to predict binding preference of RNA constituents on protein surface

OPENALEX - Publications

Jordy Homing Lam Yu Li Lizhe Zhu Ramzan Umarov Hanlun Jiang and 10 more

Abstract Protein-RNA interaction plays important roles in post-transcriptional regulation. However, the task of predicting these interactions given a protein structure is difficult. Here we show that, by leveraging deep learning model NucleicNet, attributes such as binding preference RNA backbone constituents and different bases can be predicted from local physicochemical characteristics surface. On diverse set challenging RNA-binding proteins, including Fem-3-binding-factor 2, Argonaute 2...

10.1038/s41467-019-12920-0 article EN cc-by Nature Communications 2019-10-30

Analysis of transcript-deleterious variants in Mendelian disorders: implications for RNA-based diagnostics

OPENALEX - Publications

Sateesh Maddirevula Hiroyuki Kuwahara Nour Ewida Hanan E. Shamseldin Nisha Patel and 23 more

Abstract Background At least 50% of patients with suspected Mendelian disorders remain undiagnosed after whole-exome sequencing (WES), and the extent to which non-coding variants that are not captured by WES contribute this fraction is unclear. Whole transcriptome a promising supplement WES, although empirical data on contribution RNA analysis diagnosis diseases large scale scarce. Results Here, we describe our experience transcript-deleterious (TDVs) based cohort 5647 families diseases. We...

10.1186/s13059-020-02053-9 article EN cc-by Genome biology 2020-06-17

HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes

OPENALEX - Publications

Yu Li Zeling Xu Wenkai Han Huiluo Cao Ramzan Umarov and 7 more

Abstract Background The spread of antibiotic resistance has become one the most urgent threats to global health, which is estimated cause 700,000 deaths each year globally. Its surrogates, genes (ARGs), are highly transmittable between food, water, animal, and human mitigate efficacy antibiotics. Accurately identifying ARGs thus an indispensable step understanding ecology, transmission environmental human-associated reservoirs. Unfortunately, previous computational methods for mostly based...

10.1186/s40168-021-01002-3 article EN cc-by Microbiome 2021-02-08

Sequence2Vec: a novel embedding approach for modeling transcription factor binding affinity landscape

OPENALEX - Publications

Hanjun Dai Ramzan Umarov Hiroyuki Kuwahara Yu Li Le Song and 1 more

Abstract Motivation An accurate characterization of transcription factor (TF)-DNA affinity landscape is crucial to a quantitative understanding the molecular mechanisms underpinning endogenous gene regulation. While recent advances in biotechnology have brought opportunity for building binding prediction methods, TF-DNA still remains challenging problem. Results Here we propose novel sequence embedding approach modeling landscape. Our method represents DNA sequences as hidden Markov model...

10.1093/bioinformatics/btx480 article EN cc-by-nc Bioinformatics 2017-07-26

RNA Secondary Structure Prediction By Learning Unrolled Algorithms

OPENALEX - Publications

Xinshi Chen Yu Li Ramzan Umarov Xin Gao Le Song

In this paper, we propose an end-to-end deep learning model, called E2Efold, for RNA secondary structure prediction which can effectively take into account the inherent constraints in problem. The key idea of E2Efold is to directly predict base-pairing matrix, and use unrolled algorithm constrained programming as template architectures enforce constraints. With comprehensive experiments on benchmark datasets, demonstrate superior performance E2Efold: it predicts significantly better...

10.48550/arxiv.2002.05810 preprint EN other-oa arXiv (Cornell University) 2020-01-01

DeepCellState: An autoencoder-based framework for predicting cell type specific transcriptional states induced by drug treatment

OPENALEX - Publications

Ramzan Umarov Yu Li Erik Arner

Drug treatment induces cell type specific transcriptional programs, and as the number of combinations drugs types grows, cost for exhaustive screens measuring drug response becomes intractable. We developed DeepCellState, a deep learning autoencoder-based framework, predicting induced state in after treatment, based on another type. Training method large collection perturbation profiles, prediction accuracy improves significantly over baseline alternative approaches when applying to two...

10.1371/journal.pcbi.1009465 article EN cc-by PLoS Computational Biology 2021-10-05

ReFeaFi: Genome-wide prediction of regulatory elements driving transcription initiation

OPENALEX - Publications

Ramzan Umarov Yu Li Takahiro Arakawa Satoshi Takizawa Xin Gao and 1 more

Regulatory elements control gene expression through transcription initiation (promoters) and by enhancing at distant regions (enhancers). Accurate identification of regulatory is fundamental for annotating genomes understanding patterns. While there are many attempts to develop computational promoter enhancer methods, reliable tools analyze long genomic sequences still lacking. Prediction methods often perform poorly on the genome-wide scale because number negatives much higher than that in...

10.1371/journal.pcbi.1009376 article EN cc-by PLoS Computational Biology 2021-09-07

SBOLme: a Repository of SBOL Parts for Metabolic Engineering

OPENALEX - Publications

Hiroyuki Kuwahara Xuefeng Cui Ramzan Umarov Raik Grünberg Chris J. Myers and 1 more

The Synthetic Biology Open Language (SBOL) is a community-driven open language to promote standardization in synthetic biology. To support the use of SBOL metabolic engineering, we developed SBOLme, first open-access repository 2-compliant biochemical parts for wide range engineering applications. URL our http://www.cbrc.kaust.edu.sa/sbolme.

10.1021/acssynbio.6b00278 article EN ACS Synthetic Biology 2017-01-12

PromID: human promoter prediction by deep learning

OPENALEX - Publications

Ramzan Umarov Hiroyuki Kuwahara Yu Li Xin Gao Victor Solovyev

Computational identification of promoters is notoriously difficult as human genes often have unique promoter sequences that provide regulation transcription and interaction with initiation complex. While there are many attempts to develop computational methods, we no reliable tool analyze long genomic sequences. In this work further our deep learning approach was relatively successful discriminate short non-promoter Instead focusing on the classification accuracy, in predict exact positions...

10.48550/arxiv.1810.01414 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Prediction of Prokaryotic and Eukaryotic Promoters Using Convolutional Deep Learning Neural Networks

OPENALEX - Publications

Victor Solovyev Ramzan Umarov

Accurate computational identification of promoters remains a challenge as these key DNA regulatory regions have variable structures composed functional motifs that provide gene specific initiation transcription. In this paper we utilize Convolutional Neural Networks (CNN) to analyze sequence characteristics prokaryotic and eukaryotic build their predictive models. We trained the same CNN architecture on four very distant organisms: human, plant (Arabidopsis), two bacteria (Escherichia coli...

10.48550/arxiv.1610.00121 preprint EN other-oa arXiv (Cornell University) 2016-01-01

CFC-seq: identification of full-length capped RNAs unveil enhancer-derived transcription

OPENALEX - Publications

Chi Wai Yip Callum Parr Hazuki Takahashi Kayoko Yasuzawa Matthew Valentine and 39 more

Abstract Long-read sequencing has emerged as a powerful tool for uncovering novel transcripts and genes. However, existing protocols often lack confidence in identifying the transcription start site (TSS) fail to capture non-poly(A) RNA, thereby limiting discovery of genes, particularly long non-coding RNAs (lncRNAs). In this study, we introduce Cap-trap full-length cDNA (CFC-seq), comprehensive protocol that combines Cap-trapping poly(A)-tailing with Oxford Nanopore sequencing. This enables...

10.1101/2024.10.31.620483 preprint EN cc-by-nd bioRxiv (Cold Spring Harbor Laboratory) 2024-10-31

ACRE: Absolute concentration robustness exploration in module-based combinatorial networks

OPENALEX - Publications

Hiroyuki Kuwahara Ramzan Umarov Islam Almasri Xin Gao

To engineer cells for industrial-scale application, a deep understanding of how to design molecular control mechanisms tightly maintain functional stability under various fluctuations is crucial. Absolute concentration robustness (ACR) category in reaction network models which the steady-state species guaranteed be invariant even with perturbations other network. Here, we introduce software tool, absolute explorer (ACRE), efficiently explores combinatorial biochemical networks ACR property....

10.1093/synbio/ysx001 article EN cc-by-nc-nd Synthetic Biology 2017-01-01

DeepCellState: an autoencoder-based framework for predicting cell type-specific transcriptional states induced by drug treatment

OPENALEX - Publications

Ramzan Umarov Yu Li Erik Arner

Abstract Drug treatment induces cell type-specific transcriptional programs, and as the number of combinations drugs types grows, cost for exhaustive screens measuring drug response becomes intractable. We developed DeepCellState, a deep learning autoencoder-based framework, predicting induced state in type after treatment, based on another type. Training method large collection perturbation profiles, prediction accuracy improves significantly over baseline alternative approaches when...

10.1101/2020.12.14.422792 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2020-12-15

ReFeaFi: Genome-wide prediction of regulatory elements driving transcription initiation

OPENALEX - Publications

Ramzan Umarov Yu Li Takahiro Arakawa Satoshi Takizawa Xin Gao and 1 more

Abstract Regulatory elements control gene expression through transcription initiation (promoters) and by enhancing at distant regions (enhancers). Accurate identification of regulatory is fundamental for annotating genomes understanding patterns. While there are many attempts to develop computational promoter enhancer methods, reliable tools analyze long genomic sequences still lacking. Prediction methods often perform poorly on the genome-wide scale because number negatives much higher than...

10.1101/2021.03.31.437992 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2021-04-02

FendOff encryption software to secure personal information on computers and mobile devices

OPENALEX - Publications

Victor V. Solovyev Ramzan Umarov

The paper describes several original cryptographic cipher modules (VSEM) that are based on using one time pseudorandom pad and transpositions. VSEM includes 4 of encryption can be applied in combinations. We studied ability these to secure the private data against attacks their speed encryption. was implemented Fendoff applications for mobile devices iOS Android platforms as well computer application running Window or Mac OS. describe designed encrypt/decrypt various personal such passwords,...

10.48550/arxiv.1511.00050 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Coming Soon ...