NFDI4DS | UHH-SEMS - Publication Details

Nilah M. Ioannidis

ORCID: 0000-0001-9628-8229

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5063118302

Research Areas

Genomics and Chromatin Dynamics
Genomics and Rare Diseases
Bioinformatics and Genomic Networks
Gene expression and cancer classification
Genetic Associations and Epidemiology
Machine Learning in Bioinformatics
Genomics and Phylogenetic Studies
Epigenetics and DNA Methylation
Nonmelanoma Skin Cancer Studies
Nutrition, Genetics, and Disease
Genomic variations and chromosomal abnormalities
RNA Research and Splicing
Birth, Development, and Health
RNA and protein synthesis mechanisms
Renal and related cancers
Cancer Genomics and Diagnostics
Advanced Proteomics Techniques and Applications
Muscle Physiology and Disorders
Cutaneous lymphoproliferative disorders research
RNA modifications and cancer
Immunotherapy and Immune Responses
Genetic and Kidney Cyst Diseases
Genetics, Aging, and Longevity in Model Organisms
Adipose Tissue and Metabolism
Vector-Borne Animal Diseases

Chan Zuckerberg Initiative (United States)
2022-2025

University of California, Berkeley
2020-2024

Berkeley College
2024

Stanford University
2016-2020

Jain Foundation
2018-2019

REVEL: An Ensemble Method for Predicting the Pathogenicity of Rare Missense Variants

OPENALEX - Publications

Nilah M. Ioannidis Joseph H. Rothstein Vikas Pejaver Sumit Middha Shannon K. McDonnell and 36 more

10.1016/j.ajhg.2016.08.016 article EN publisher-specific-oa The American Journal of Human Genetics 2016-09-22

Critical assessment of missense variant effect predictors on disease-relevant variant data

OPENALEX - Publications

Ruchir Rastogi Ryan Chung Sindy Li Chang Li Kyoungyeul Lee and 31 more

Abstract Regular, systematic, and independent assessments of computational tools that are used to predict the pathogenicity missense variants necessary evaluate their clinical research utility guide future improvements. The Critical Assessment Genome Interpretation (CAGI) conducts ongoing Annotate-All-Missense (Missense Marathon) challenge, in which variant effect predictors (also called impact predictors) evaluated on added disease-relevant databases following prediction submission...

10.1007/s00439-025-02732-2 article EN cc-by Human Genetics 2025-03-21

Identification of Susceptibility Loci for Cutaneous Squamous Cell Carcinoma

OPENALEX - Publications

Maryam M. Asgari Wei Wang Nilah M. Ioannidis Jacqueline Itnyre Thomas J. Hoffmann and 2 more

10.1016/j.jid.2016.01.013 article EN publisher-specific-oa Journal of Investigative Dermatology 2016-01-29

Tissue-specific impacts of aging and genetics on gene expression patterns in humans

OPENALEX - Publications

Ryō Yamamoto Ryan Chung Juan Manuel Vázquez Huanjie Sheng Philippa Steinberg and 2 more

Age is the primary risk factor for many common human diseases. Here, we quantify relative contributions of genetics and aging to gene expression patterns across 27 tissues from 948 humans. We show that predictive power quantitative trait loci impacted by age in tissues. Jointly modelling transcript level variation find heritability (h2) consistent among while contribution varies >20-fold with [Formula: see text] 5 force purifying selection stronger on genes expressed early versus late life...

10.1038/s41467-022-33509-0 article EN cc-by Nature Communications 2022-10-03

Personal transcriptome variation is poorly explained by current genomic deep learning models

OPENALEX - Publications

Connie Huang Richard W. Shuai Parth Baokar Ryan Chung Ruchir Rastogi and 2 more

Genomic deep learning models can predict genome-wide epigenetic features and gene expression levels directly from DNA sequence. While current perform well at predicting across genes in different cell types the reference genome, their ability to explain variation between individuals due cis-regulatory genetic variants remains largely unexplored. Here, we evaluate four state-of-the-art on paired personal genome transcriptome data find limited performance when explaining individuals. In...

10.1038/s41588-023-01574-w article EN cc-by Nature Genetics 2023-11-30

Cross-protein transfer learning substantially improves disease variant prediction

OPENALEX - Publications

Milind Jagota Chengzhong Ye Carlos Albors Ruchir Rastogi Antoine Koehl and 2 more

Genetic variation in the human genome is a major determinant of individual disease risk, but vast majority missense variants have unknown etiological effects. Here, we present robust learning framework for leveraging saturation mutagenesis experiments to construct accurate computational predictors proteome-wide variant pathogenicity.We train cross-protein transfer (CPT) models using deep mutational scanning (DMS) data from only five proteins and achieve state-of-the-art performance on...

10.1186/s13059-023-03024-6 article EN cc-by Genome biology 2023-08-07

Estimating prevalence for limb-girdle muscular dystrophy based on public sequencing databases

OPENALEX - Publications

Wei Liu Sander Pajusalu Nicole J. Lake Geyu Zhou Nilah M. Ioannidis and 7 more

10.1038/s41436-019-0544-8 article EN publisher-specific-oa Genetics in Medicine 2019-05-19

Personal transcriptome variation is poorly explained by current genomic deep learning models

OPENALEX - Publications

Connie Huang Richard W. Shuai Parth Baokar Ryan Chung Ruchir Rastogi and 2 more

Abstract Genomic deep learning models can predict genome-wide epigenetic features and gene expression levels directly from DNA sequence. While current perform well at predicting across genes in different cell types the reference genome, their ability to explain variation between individuals due cis-regulatory genetic variants remains largely unexplored. Here we evaluate four state-of-the-art on paired personal genome transcriptome data find limited performance when explaining individuals.

10.1101/2023.06.30.547100 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2023-06-30

Critical assessment of missense variant effect predictors on disease-relevant variant data

OPENALEX - Publications

Ruchir Rastogi Ryan Chung Sindy Li Chang Li Kyoungyeul Lee and 31 more

Abstract Regular, systematic, and independent assessment of computational tools used to predict the pathogenicity missense variants is necessary evaluate their clinical research utility suggest directions for future improvement. Here, as part sixth edition Critical Assessment Genome Interpretation (CAGI) challenge, we assess variant effect predictors (or impact predictors) on an evaluation dataset rare from disease-relevant databases. Our evaluates submitted CAGI6 Annotate-All-Missense...

10.1101/2024.06.06.597828 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2024-06-08

Variants in tubule epithelial regulatory elements mediate most heritable differences in human kidney function

OPENALEX - Publications

Gabriel B. Loeb Pooja Kathail Richard W. Shuai Ryan Chung Reinier J. Grona and 14 more

10.1038/s41588-024-01904-6 article EN Nature Genetics 2024-09-10

Current genomic deep learning models display decreased performance in cell type-specific accessible regions

OPENALEX - Publications

Pooja Kathail Richard W. Shuai Ryan Chung Chun Ye Gabriel B. Loeb and 1 more

Abstract Background A number of deep learning models have been developed to predict epigenetic features such as chromatin accessibility from DNA sequence. Model evaluations commonly report performance genome-wide; however, cis regulatory elements (CREs), which play critical roles in gene regulation, make up only a small fraction the genome. Furthermore, cell type-specific CREs contain large proportion complex disease heritability. Results We evaluate genomic regions with varying degrees type...

10.1186/s13059-024-03335-2 article EN cc-by Genome biology 2024-08-01

Two-stage Study of Familial Prostate Cancer by Whole-exome Sequencing and Custom Capture Identifies 10 Novel Genes Associated with the Risk of Prostate Cancer

OPENALEX - Publications

Daniel J. Schaid Shannon K. McDonnell Liesel M. FitzGerald Lissa DeRycke Zachary C. Fogarty and 34 more

10.1016/j.eururo.2020.07.038 article EN European Urology 2020-08-14

FIRE: functional inference of genetic variants that regulate gene expression

OPENALEX - Publications

Nilah M. Ioannidis Joe R. Davis Marianne K. DeGorter Nicholas B. Larson Shannon K. McDonnell and 8 more

Interpreting genetic variation in noncoding regions of the genome is an important challenge for personal analysis. One mechanism by which single nucleotide variants (SNVs) influence downstream phenotypes through regulation gene expression. Methods to predict whether or not individual SNVs are likely regulate expression would aid interpretation unknown significance identified whole-genome sequencing studies.We developed FIRE (Functional Inference Regulators Expression), a tool score both and...

10.1093/bioinformatics/btx534 article EN Bioinformatics 2017-08-23

Improving interpretability of transcription factor binding models with DNA shape features

OPENALEX - Publications

Ryan L. Keivanfar Forest Yang Katherine S. Pollard Nilah M. Ioannidis

Deep learning models in genomics that predict molecular phenotypes from DNA sequence traditionally focus on one-hot encoded representations. Here, we develop a novel model extends this approach by incorporating structural attributes indicative of local shape alongside canonical inputs. This augmentation provides an additional axis for interpretability and aids identifying regulatory patterns not apparent alone. Applying to prediction transcription factor binding (ChIP-seq) demonstrates...

10.1101/2025.04.01.646034 preprint EN cc-by-nc bioRxiv (Cold Spring Harbor Laboratory) 2025-04-03

Fine-tuning sequence-to-expression models on personal genome and transcriptome data

OPENALEX - Publications

Ruchir Rastogi Aniketh Janardhan Reddy Ryan Chung Nilah M. Ioannidis

Genomic sequence-to-expression deep learning models, which are trained to predict gene expression and other molecular phenotypes across the reference genome, have recently been shown poor out-of-the-box performance in predicting variation individuals based on their personal genome sequences. Here we explore whether additional training (fine-tuning) paired transcriptome data improves of such models. Using Enformer as a representative pre-trained model, various fine-tuning strategies. Our...

10.1101/2024.09.23.614632 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2024-09-25

Gene expression imputation identifies candidate genes and susceptibility loci associated with cutaneous squamous cell carcinoma

OPENALEX - Publications

Nilah M. Ioannidis Wei Wang Nicholas A. Furlotte David A. Hinds Michelle Agee and 27 more

Cutaneous squamous cell carcinoma (cSCC) is a common skin cancer with genetic susceptibility loci identified in recent genome-wide association studies (GWAS). Transcriptome-wide (TWAS) using imputed gene expression levels can identify additional gene-level associations. Here we impute 6891 cSCC cases and 54,566 controls the Kaiser Permanente Genetic Epidemiology Research Adult Health Aging (GERA) cohort 25,558 self-reported 673,788 from 23andMe. In discovery-validation study, 19 containing...

10.1038/s41467-018-06149-6 article EN cc-by Nature Communications 2018-10-09

Variants in tubule epithelial regulatory elements mediate most heritable differences in human kidney function

OPENALEX - Publications

Gabriel B. Loeb Pooja Kathail Richard W. Shuai Ryan Chung Reinier J. Grona and 14 more

Kidney disease is highly heritable; however, the causal genetic variants, cell types in which these variants function, and molecular mechanisms underlying kidney remain largely unknown. To identify loci affecting we performed a GWAS using multiple function biomarkers identified 462 loci. begin to investigate how affect generated single-cell chromatin accessibility (scATAC-seq) maps of human candidate

10.1101/2024.06.18.599625 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2024-06-22

Designing Cell-Type-Specific Promoter Sequences Using Conservative Model-Based Optimization

OPENALEX - Publications

Aniketh Janardhan Reddy Xinyang Geng Michael H. Herschl Sathvik Kolli Aviral Kumar and 3 more

Gene therapies have the potential to treat disease by delivering therapeutic genetic cargo disease-associated cells. One limitation their widespread use is lack of short regulatory sequences, or promoters, that differentially induce expression delivered in target cells, minimizing side effects other cell types. Such cell-type-specific promoters are difficult discover using existing methods, requiring either manual curation access large datasets promoter-driven from both targeted and...

10.1101/2024.06.23.600232 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2024-06-23

A Prediction Tool to Facilitate Risk-Stratified Screening for Squamous Cell Skin Cancer

OPENALEX - Publications

Wei Wang Eric Jorgenson Nilah M. Ioannidis Maryam M. Asgari Alice S. Whittemore

10.1016/j.jid.2018.03.1528 article EN publisher-specific-oa Journal of Investigative Dermatology 2018-07-02

Pretraining strategies for effective promoter-driven gene expression prediction

OPENALEX - Publications

Aniketh Janardhan Reddy Michael H. Herschl Xinyang Geng Sathvik Kolli Amy X. Lu and 4 more

The ability to deliver genetic cargo human cells is enabling rapid progress in molecular medicine, but designing this for precise expression specific cell types a major challenge. Expression driven by regulatory DNA sequences within short synthetic promoters, relatively few of these promoters are cell-type-specific. design cell-type-specific using model-based optimization would be impactful research and therapeutic applications. However, models from (promoter-driven expression) lacking most...

10.1101/2023.02.24.529941 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2023-02-27

GUANinE v1.0: Benchmark Datasets for Genomic AI Sequence-to-Function Models

OPENALEX - Publications

Eyes S Robson Nilah M. Ioannidis

Computational genomics increasingly relies on machine learning methods for genome interpretation, and the recent adoption of neural sequence-to-function models highlights need rigorous model specification controlled evaluation, problems familiar to other fields AI. Research strategies that have greatly benefited -- including benchmarking, auditing, algorithmic fairness --- are also needed advance field genomic AI facilitate development. Here we propose a benchmark, GUANinE, evaluating...

10.1101/2023.10.12.562113 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2023-10-17

Genetic variants in the HLA class II region associated with risk of cutaneous squamous cell carcinoma

OPENALEX - Publications

Wei Wang Hanna M. Ollila Alice S. Whittemore Shadmehr Demehri Nilah M. Ioannidis and 3 more

10.1007/s00262-018-2168-2 article EN Cancer Immunology Immunotherapy 2018-05-12

Cross-protein transfer learning substantially improves disease variant prediction

OPENALEX - Publications

Milind Jagota Chengzhong Ye Carlos Albors Ruchir Rastogi Antoine Koehl and 2 more

Abstract Genetic variation in the human genome is a major determinant of individual disease risk, but vast majority missense variants have unknown etiological effects. Here, we present robust learning framework for leveraging saturation mutagenesis experiments to construct accurate computational predictors proteome-wide variant pathogenicity. We train cross-protein transfer (CPT) models using deep mutational scanning data from only five proteins and achieve state-of-the-art performance on...

10.1101/2022.11.15.516532 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2022-11-17

Predicting target genes of non-coding regulatory variants with IRT

OPENALEX - Publications

Zhenqin Wu Nilah M. Ioannidis James Zou

Interpreting genetic variants of unknown significance (VUS) is essential in clinical applications genome sequencing for diagnosis and personalized care. Non-coding remain particularly difficult to interpret, despite making up a large majority trait associations identified genome-wide association studies (GWAS) analyses. Predicting the regulatory effects non-coding on candidate genes key step evaluating their significance. Here, we develop machine-learning algorithm, Inference Connected...

10.1093/bioinformatics/btaa254 article EN Bioinformatics 2020-04-17

Coming Soon ...