NFDI4DS | UHH-SEMS - Publication Details

Predicting responses to platin chemotherapy agents with biochemically-inspired machine learning

OPENALEX - Publications

Eliseos J. Mucaki Jonathan Z.L. Zhao Daniel J. Lizotte Peter K. Rogan

Abstract The selection of effective genes that accurately predict chemotherapy responses might improve cancer outcomes. We compare optimized gene signatures for cisplatin, carboplatin, and oxaliplatin in the same cell lines validate each signature using data from patients with cancer. Supervised support vector machine learning is used to derive sets whose expression related line GI 50 values by backwards feature cross-validation. Specific functional pathways distinguishing sensitive...

10.1038/s41392-018-0034-5 article EN cc-by Signal Transduction and Targeted Therapy 2019-01-11

Interpretation of mRNA splicing mutations in genetic disease: review of the literature and guidelines for information-theoretical analysis

OPENALEX - Publications

Natasha Caminsky Eliseos J. Mucaki Peter K. Rogan

<ns4:p>The interpretation of genomic variants has become one the paramount challenges in post-genome sequencing era. In this review we summarize nearly 20 years research on applications information theory (IT) to interpret coding and non-coding mutations that alter mRNA splicing rare common diseases. We compile spectrum published analyzed by IT, provide a broad perspective distribution deleterious natural cryptic splice site detected, as well those affecting regulatory sequences. Results for...

10.12688/f1000research.5654.1 preprint EN cc-by F1000Research 2014-11-18

FANCMc.5791C>T nonsense mutation (rs144567652) induces exon skipping, affects DNA repair activity and is a familial breast cancer risk factor

OPENALEX - Publications

Paolo Peterlongo Irene Catucci Mara Colombo Laura Caleca Eliseos J. Mucaki and 91 more

Numerous genetic factors that influence breast cancer risk are known. However, approximately two-thirds of the overall familial remain unexplained. To determine whether some missing heritability is due to rare variants conferring high moderate risk, we tested for an association between c.5791C>T nonsense mutation (p.Arg1931*; rs144567652) in exon 22 FANCM gene and cancer. An analysis genotyping data from 8635 cases 6625 controls different countries yielded [odds ratio (OR) = 3.93 (95%...

10.1093/hmg/ddv251 article EN Human Molecular Genetics 2015-06-30

Generative and integrative modeling for transcriptomics with formalin fixed paraffin embedded material

OPENALEX - Publications

Eliseos J. Mucaki W. J. Zhang A. Saha Sabina Trebinjac Sharon Nofech‐Mozes and 3 more

Formalin-fixed paraffin embedded (FFPE) samples are challenging to profile using existing high-throughput sequencing technologies, including RNA-seq. This difficulty primarily arises from the degradation of nucleic acids, a problem that becomes particularly acute with stored for extended periods. FFPE-derived RNA-seq (fRNA-seq) data have high rate transcript dropout, property shared single cell Transcript counts also variance and prone extreme values. We introduce PaRaffin Embedded...

10.1101/2025.02.21.639356 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2025-02-27

Mutational landscape of pure ductal carcinoma in situ and associations with disease prognosis and response to radiotherapy

OPENALEX - Publications

Naiyer A. Rizvi Eliseos J. Mucaki Emily L Salmini Monica Zhang Sabina Trebinjac and 5 more

Ductal Carcinoma in Situ (DCIS) management is challenged by the absence of reliable markers predictive radiotherapy (RT) response, leading to both overtreatment indolent disease and inadequate treatment for aggressive cases. Through whole-exome sequencing 147 DCIS cases, we characterized genomic landscape identified prognosis - specifically risk local recurrence (in situ or invasive) within 10 years after diagnosis. Our analysis revealed that pure frequent mutations genes governing tissue...

10.1101/2025.03.01.25323122 preprint EN cc-by-nc medRxiv (Cold Spring Harbor Laboratory) 2025-03-03

Prediction of Mutant mRNA Splice Isoforms by Information Theory-Based Exon Definition

OPENALEX - Publications

Eliseos J. Mucaki Ben C. Shirley Peter K. Rogan

Mutations that affect mRNA splicing often produce multiple isoforms, resulting in complex molecular phenotypes. Definition of an exon and its inclusion mature relies on joint recognition both acceptor donor splice sites. This study predicts cryptic exon-skipping isoforms produced by mutations from the combined information contents (Ri, which measures binding-site strength, bits) distribution sites defining these exons. The total content (Ri,total) is sum Ri values sites, adjusted for...

10.1002/humu.22277 article EN Human Mutation 2013-01-24

Prioritizing Variants in Complete Hereditary Breast and Ovarian Cancer Genes in Patients Lacking KnownBRCAMutations

OPENALEX - Publications

Natasha Caminsky Eliseos J. Mucaki Ami M. Perri Ruipeng Lu Joan H.M. Knoll and 1 more

BRCA1 and BRCA2 testing for hereditary breast ovarian cancer (HBOC) does not identify all pathogenic variants. Sequencing of 20 complete genes in HBOC patients with uninformative test results (N = 287), including noncoding flanking sequences ATM, BARD1, BRCA1, BRCA2, CDH1, CHEK2, EPCAM, MLH1, MRE11A, MSH2, MSH6, MUTYH, NBN, PALB2, PMS2, PTEN, RAD51B, STK11, TP53, XRCC2, identified 38,372 unique We apply information theory (IT) to predict prioritize variants uncertain significance regulatory,...

10.1002/humu.22972 article EN Human Mutation 2016-02-22

Predicting Outcomes of Hormone and Chemotherapy in the Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) Study by Biochemically-inspired Machine Learning

OPENALEX - Publications

Eliseos J. Mucaki Katherina Baranova Huy Quang Pham Iman Rezaeian Dimo Angelov and 3 more

<ns4:p>Genomic aberrations and gene expression-defined subtypes in the large METABRIC patient cohort have been used to stratify predict survival. The present study normalized expression signatures of paclitaxel drug response outcome for different survival times patients receiving hormone (HT) and, some cases, chemotherapy (CT) agents. This machine learning method, which distinguishes sensitivity vs. resistance breast cancer cell lines validates predictions patients; was also derive other HT...

10.12688/f1000research.9417.3 preprint EN cc-by F1000Research 2017-05-12

Prevalence and spectrum of germline rare variants in BRCA1/2 and PALB2 among breast cancer cases in Sarawak, Malaysia

OPENALEX - Publications

Xiaohong R. Yang Beena C.R. Devi Hyuna Sung Jennifer Guida Eliseos J. Mucaki and 14 more

10.1007/s10549-017-4356-8 article EN Breast Cancer Research and Treatment 2017-06-29

A unified analytic framework for prioritization of non-coding variants of uncertain significance in heritable breast and ovarian cancer

OPENALEX - Publications

Eliseos J. Mucaki Natasha Caminsky Ami M. Perri Ruipeng Lu Alain Laederach and 3 more

Sequencing of both healthy and disease singletons yields many novel low frequency variants uncertain significance (VUS). Complete gene genome sequencing by next generation (NGS) significantly increases the number VUS detected. While prior studies have emphasized protein coding variants, non-coding sequence also been proven to contribute high penetrance disorders, such as hereditary breast ovarian cancer (HBOC). We present a strategy for analyzing different functional classes based on...

10.1186/s12920-016-0178-5 article EN cc-by BMC Medical Genomics 2016-04-11

Discovery and validation of information theory-based transcription factor and cofactor binding site motifs

OPENALEX - Publications

Ruipeng Lu Eliseos J. Mucaki Peter K. Rogan

Data from ChIP-seq experiments can derive the genome-wide binding specificities of transcription factors (TFs) and other regulatory proteins. We analyzed 765 ENCODE peak datasets 207 human TFs with a novel motif discovery pipeline based on recursive, thresholded entropy minimization. This approach, while obviating need to compensate for skewed nucleotide composition, distinguishes true motifs noise, quantifies strengths individual sites computed affinity detects adjacent cofactor that...

10.1093/nar/gkw1036 article EN cc-by Nucleic Acids Research 2016-10-21

Expression Changes Confirm Genomic Variants Predicted to Result in Allele-Specific, Alternative mRNA Splicing

OPENALEX - Publications

Eliseos J. Mucaki Ben C. Shirley Peter K. Rogan

Splice isoform structure and abundance can be affected by either non-coding or masquerading coding variants that alter the of transcripts. When these are common in population, non-constitutive transcripts sufficiently frequent so as to resemble naturally occurring, alternative mRNA splicing. Prediction effects such has been shown accurate using information theory-based methods. Single nucleotide polymorphisms (SNPs) predicted significantly natural and/or cryptic splice site strength were...

10.3389/fgene.2020.00109 article EN cc-by Frontiers in Genetics 2020-03-05

The dual-specificity phosphatase hYVH1 interacts with Hsp70 and prevents heat-shock-induced cell death

OPENALEX - Publications

Priya Sharda Christopher A. Bonham Eliseos J. Mucaki Zareen Butt Panayiotis O. Vacratsis

hYVH1 [human orthologue of YVH1 (yeast VH1-related phosphatase)] is an atypical dual-specificity phosphatase that widely conserved throughout evolution. Deletion studies in yeast have suggested a role for this regulating cell growth. However, the human unknown. The present study used MS to identify Hsp70 (heat-shock protein 70) as novel hYVH1-binding partner. interaction was confirmed using endogenous co-immunoprecipitation experiments and direct binding purified proteins. Endogenous...

10.1042/bj20081484 article EN Biochemical Journal 2008-10-31

Comprehensive prediction of mRNA splicing effects of BRCA1 and BRCA2 variants

OPENALEX - Publications

Eliseos J. Mucaki Peter Ainsworth Peter K. Rogan

Variants of uncertain significance (VUS) in the BRCA1 and BRCA2 genes potentially affecting coding sequence as well normal splicing activity have confounded predisposition testing breast cancer. Here, we apply information theory to analyze BRCA1/2 mRNA mutations categorized VUS. The method was validated for 31 36 known cause missplicing all 26 that do not alter splicing. All single-nucleotide variants Breast Cancer Information Resource (BIC; Core Database; http://research.nhgri.nih.gov/bic;...

10.1002/humu.21513 article EN Human Mutation 2011-04-26

Interpretation, Stratification and Evidence for Sequence Variants Affecting mRNA Splicing in Complete Human Genome Sequences

OPENALEX - Publications

Ben C. Shirley Eliseos J. Mucaki Tyson Whitehead Paul Igor Costea Pelin Akan and 1 more

Abstract Information theory-based methods have been shown to be sensitive and specific for predicting quantifying the effects of non-coding mutations in Mendelian diseases. We present Shannon pipeline software genome-scale mutation analysis provide evidence that predicts variants affecting mRNA splicing. Individual information contents (in bits) reference variant splice sites are compared significant differences annotated prioritized. The has implemented CLC-Bio Genomics platform. Annotation...

10.1016/j.gpb.2013.01.008 article EN cc-by-nc-sa Genomics Proteomics & Bioinformatics 2013-03-14

Assessment of the functional impact of germline BRCA1/2 variants located in non-coding regions in families with breast and/or ovarian cancer predisposition

OPENALEX - Publications

E. Santana dos Santos Sandrine M. Caputo Laurent Castéra Mathilde Gendrot Adrien Briaux and 12 more

10.1007/s10549-017-4602-0 article EN Breast Cancer Research and Treatment 2017-12-13

Predicting ionizing radiation exposure using biochemically-inspired genomic machine learning

OPENALEX - Publications

Jonathan Z.L. Zhao Eliseos J. Mucaki Peter K. Rogan

<ns3:p><ns3:bold>Background:</ns3:bold> Gene signatures derived from transcriptomic data using machine learning methods have shown promise for biodosimetry testing. These may not be sufficiently robust large scale testing, as their performance has been adequately validated on external, independent datasets. The present study develops human and murine with biochemically-inspired that are strictly k-fold traditional approaches.</ns3:p><ns3:p> <ns3:bold>Methods:</ns3:bold> Expression Omnibus...

10.12688/f1000research.14048.2 preprint EN cc-by F1000Research 2018-06-15

Interpretation of mRNA splicing mutations in genetic disease: review of the literature and guidelines for information-theoretical analysis

OPENALEX - Publications

Natasha Caminsky Eliseos J. Mucaki Peter K. Rogan

<ns4:p>The interpretation of genomic variants has become one the paramount challenges in post-genome sequencing era. In this review we summarize nearly 20 years research on applications information theory (IT) to interpret coding and non-coding mutations that alter mRNA splicing rare common diseases. We compile spectrum published analyzed by IT, provide a broad perspective distribution deleterious natural cryptic splice site detected, as well those affecting regulatory sequences. Results for...

10.12688/f1000research.5654.2 preprint EN cc-by F1000Research 2015-03-17

BRCA1 and BRCA2 5′ noncoding region variants identified in breast cancer patients alter promoter activity and protein binding

OPENALEX - Publications

Leslie Burke Jan Ševčı́k Gaetana Gambino Emma Tudini Eliseos J. Mucaki and 32 more

The widespread use of next generation sequencing for clinical testing is detecting an escalating number variants in noncoding regions the genome. significance majority these currently unknown, which presents a significant challenge. We have screened over 6,000 early-onset and/or familial breast cancer (BC) cases collected by ENIGMA consortium sequence 5′ BC susceptibility genes BRCA1 and BRCA2, identified 141 rare with global minor allele frequency < 0.01, 76 not been reported previously....

10.1002/humu.23652 article EN cc-by Human Mutation 2018-09-11

Predicting Outcomes of Hormone and Chemotherapy in the Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) Study by Biochemically-inspired Machine Learning

OPENALEX - Publications

Iman Rezaeian Eliseos J. Mucaki Katherina Baranova Huy Quang Pham Dimo Angelov and 3 more

<ns4:p>Genomic aberrations and gene expression-defined subtypes in the large METABRIC patient cohort have been used to stratify predict survival. The present study normalized expression signatures of paclitaxel drug response outcome for different survival times patients receiving hormone (HT) and, some cases, chemotherapy (CT) agents. This machine learning method, which distinguishes sensitivity vs. resistance breast cancer cell lines validates predictions patients, was also derive other HT...

10.12688/f1000research.9417.1 preprint EN cc-by F1000Research 2016-08-31

A proposed molecular mechanism for pathogenesis of severe RNA-viral pulmonary infections

OPENALEX - Publications

Peter K. Rogan Eliseos J. Mucaki Ben C. Shirley

<ns3:p><ns3:bold>Background:</ns3:bold>Certain riboviruses can cause severe pulmonary complications leading to death in some infected patients. We propose that DNA damage induced-apoptosis accelerates viral release, triggered by depletion of host RNA binding proteins (RBPs) from nuclear bound replicating sequences.</ns3:p><ns3:p><ns3:bold>Methods:</ns3:bold>Information theory-based analysis interactions between RBPs and individual sequences the Severe Acute Respiratory Syndrome CoronaVirus 2...

10.12688/f1000research.25390.2 preprint EN cc-by F1000Research 2021-01-06

Meeting radiation dosimetry capacity requirements of population-scale exposures by geostatistical sampling

OPENALEX - Publications

Peter K. Rogan Eliseos J. Mucaki Ruipeng Lu Ben C. Shirley Edward Waller and 1 more

Background Accurate radiation dose estimates are critical for determining eligibility therapies by timely triaging of exposed individuals after large-scale events. However, the universal assessment a large population subjected to nuclear spill incident or detonation is not feasible. Even with high-throughput dosimetry analysis, test volumes far exceed capacities first responders measure exposures directly, acquire and process samples follow-on biodosimetry testing. Aim To significantly...

10.1371/journal.pone.0232008 article EN cc-by PLoS ONE 2020-04-24

Predicting ionizing radiation exposure using biochemically-inspired genomic machine learning

OPENALEX - Publications

Jonathan Z.L. Zhao Eliseos J. Mucaki Peter K. Rogan

<ns4:p><ns4:bold>Background:</ns4:bold> Gene signatures derived from transcriptomic data using machine learning methods have shown promise for biodosimetry testing. These may not be sufficiently robust large scale testing, as their performance has been adequately validated on external, independent datasets. The present study develops human and murine with biochemically-inspired that are strictly k-fold traditional approaches.</ns4:p><ns4:p> <ns4:bold>Methods:</ns4:bold> Expression Omnibus...

10.12688/f1000research.14048.1 preprint EN cc-by F1000Research 2018-02-27

A proposed molecular mechanism for pathogenesis of severe RNA-viral pulmonary infections

OPENALEX - Publications

Peter K. Rogan Eliseos J. Mucaki Ben C. Shirley

Background: Certain riboviruses can cause severe pulmonary complications leading to death in some infected patients. We propose that DNA damage induced-apoptosis accelerates viral release, triggered by depletion of host RNA binding proteins (RBPs) from nuclear bound replicating sequences. Methods: Information theory-based analysis interactions between RBPs and individual sequences the Severe Acute Respiratory Syndrome CoronaVirus 2 (SARS-CoV-2), Influenza A (H3N1), HIV-1, Dengue genomes...

10.12688/f1000research.25390.1 preprint EN cc-by F1000Research 2020-08-07

Pan-cancer repository of validated natural and cryptic mRNA splicing mutations

OPENALEX - Publications

Ben C. Shirley Eliseos J. Mucaki Peter K. Rogan

<ns4:p>We present a major public resource of mRNA splicing mutations validated according to multiple lines evidence abnormal gene expression. Likely in all tumor types reported the Cancer Genome Atlas (TCGA) were identified based on comparative strengths splice sites versus normal genomes, and then by respectively comparing counts junction spanning abundance transcript reads RNA-Seq data from matched tissues tumors lacking these mutations. The comprehensive features 351,423 mutations,...

10.12688/f1000research.17204.1 preprint EN cc-by F1000Research 2018-12-07