NFDI4DS | UHH-SEMS - Publication Details

Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program

OPENALEX - Publications

Daniel Taliun Daniel Harris Michael D. Kessler Jedidiah Carlson Zachary A. Szpiech and 95 more

Abstract The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood sleep disorders, with ultimate goal improving diagnosis, treatment prevention these diseases. initial phases focused on whole-genome sequencing individuals rich phenotypic data diverse backgrounds. Here we describe TOPMed goals design as well available resources early insights obtained from sequence data. include a variant browser, genotype...

10.1038/s41586-021-03205-y article EN cc-by Nature 2021-02-10

Detection of a Recurrent DNAJB1-PRKACA Chimeric Transcript in Fibrolamellar Hepatocellular Carcinoma

OPENALEX - Publications

Joshua N. Honeyman Elana P. Simon Nicolas Robine Rachel Chiaroni-Clarke David G. Darcy and 14 more

Oncogenic Suspect Exposed It can be difficult logistically to study the genomics of rare variants common cancers. Nevertheless, Honeyman et al. (p. 1010 ) studied fibrolamellar hepatocellular carcinoma (FL-HCC), a and poorly understood liver tumor that affects adolescents young adults for which there is no effective treatment. FL-HCCs from 15 patients all expressed chimeric RNA transcript protein containing sequences molecular chaperone fused in frame with catalytic domain kinase A. The...

10.1126/science.1249484 article EN Science 2014-02-27

Comparative sequencing analysis reveals high genomic concordance between matched primary and metastatic colorectal cancer lesions

OPENALEX - Publications

A. Rose Brannon Efsevia Vakiani Brooke E. Sylvester Sasinya N. Scott Gregory McDermott and 15 more

Colorectal cancer is the second leading cause of death in United States, with over 50,000 deaths estimated 2014. Molecular profiling for somatic mutations that predict absence response to anti-EGFR therapy has become standard practice treatment metastatic colorectal cancer; however, quantity and type tissue available testing frequently limited. Further, degree which primary tumor a faithful representation disease been questioned. As next-generation sequencing technology becomes more widely...

10.1186/s13059-014-0454-7 article EN cc-by Genome biology 2014-08-01

Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program

OPENALEX - Publications

Daniel Taliun Daniel Harris Michael D. Kessler Jedidiah Carlson Zachary A. Szpiech and 95 more

Summary paragraph The Trans-Omics for Precision Medicine (TOPMed) program seeks to elucidate the genetic architecture and disease biology of heart, lung, blood, sleep disorders, with ultimate goal improving diagnosis, treatment, prevention. initial phases focus on whole genome sequencing individuals rich phenotypic data diverse backgrounds. Here, we describe TOPMed goals design as well resources early insights from sequence data. include a variant browser, genotype imputation panel, sharing...

10.1101/563866 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2019-03-06

RazerS—fast read mapping with sensitivity control

OPENALEX - Publications

David Weese Anne‐Katrin Emde Tobias Rausch Andreas Gogol‐Döring Knut Reinert

Second-generation sequencing technologies deliver DNA sequence data at unprecedented high throughput. Common to most biological applications is a mapping of the reads an almost identical or highly similar reference genome. Due large amounts data, efficient algorithms and implementations are crucial for this task. We present read tool called RazerS. It allows user align arbitrary length using either Hamming distance edit distance. Our can work lossless with user-defined loss rate higher...

10.1101/gr.088823.108 article EN cc-by-nc Genome Research 2009-07-10

Genome-wide somatic variant calling using localized colored de Bruijn graphs

OPENALEX - Publications

Giuseppe Narzisi André Corvelo Kanika Arora Ewa A. Bergmann Minita Shah and 5 more

Reliable detection of somatic variations is critical importance in cancer research. Here we present Lancet, an accurate and sensitive variant caller, which detects SNVs indels by jointly analyzing reads from tumor matched normal samples using colored de Bruijn graphs. We demonstrate, through extensive experimental comparison on synthetic real whole-genome sequencing datasets, that Lancet has better accuracy, especially for indel detection, than widely used callers, such as MuTect, MuTect2,...

10.1038/s42003-018-0023-9 article EN cc-by Communications Biology 2018-03-14

Genetic mechanisms of primary chemotherapy resistance in pediatric acute myeloid leukemia

OPENALEX - Publications

Nicole McNeer John Philip Heather Geiger Rhonda E. Ries Vincent‐Philippe Lavallée and 13 more

10.1038/s41375-019-0402-3 article EN Leukemia 2019-02-13

The gout epidemic in French Polynesia: a modelling study of data from the Ma’i u’u epidemiological survey

OPENALEX - Publications

Tristan Pascart Kaja A. Wasik Cristian Preda Valérie Chune Jérémie Torterat and 25 more

Gout is the most common cause of inflammatory arthritis worldwide, particularly in Pacific regions. We aimed to establish prevalence gout and hyperuricaemia French Polynesia, their associations with dietary habits, comorbidities, HLA-B*58:01 allele, current management disease.

10.1016/s2214-109x(24)00012-3 article EN cc-by-nc-nd The Lancet Global Health 2024-03-12

A novel and well-defined benchmarking method for second generation read mapping

OPENALEX - Publications

Manuel Holtgrewe Anne‐Katrin Emde David Weese Knut Reinert

Second generation sequencing technologies yield DNA sequence data at ultra high-throughput. Common to most biological applications is a mapping of the reads an almost identical or highly similar reference genome. The assessment quality read results not straightforward and has been formalized so far. Hence, it easy compare different approaches in unified way determine which program best for what task.We present new benchmark method, called Rabema (Read Alignment BEnchMArk), mappers. It...

10.1186/1471-2105-12-210 article EN cc-by BMC Bioinformatics 2011-05-26

Detecting genomic indel variants with exact breakpoints in single- and paired-end sequencing data using SplazerS

OPENALEX - Publications

Anne‐Katrin Emde Marcel H. Schulz David Weese Ruping Sun Martin Vingron and 3 more

Abstract Motivation: The reliable detection of genomic variation in resequencing data is still a major challenge, especially for variants larger than few base pairs. Sequencing reads crossing boundaries structural carry the potential their identification, but are difficult to map. Results: Here we present method ‘split’ read mapping, where prefix and suffix match may be interrupted by longer gap read-to-reference alignment. We use this accurately detect medium-sized insertions long deletions...

10.1093/bioinformatics/bts019 article EN Bioinformatics 2012-01-11

Disease variants in genomes of 44 centenarians

OPENALEX - Publications

Yun Freudenberg‐Hua Jan Freudenberg Vladimir Vacic Avinash Abhyankar Anne‐Katrin Emde and 10 more

To identify previously reported disease mutations that are compatible with extraordinary longevity, we screened the coding regions of genomes 44 Ashkenazi Jewish centenarians. Individual genome sequences were generated 30× coverage on Illumina HiSeq 2000 and single-nucleotide variants called analysis toolkit (GATK). We identified 130 annotated as "pathogenic" or "likely pathogenic" based ClinVar database infrequent in general population. These to cause a wide range degenerative, neoplastic,...

10.1002/mgg3.86 article EN cc-by Molecular Genetics & Genomic Medicine 2014-06-15

Gustaf: Detecting and correctly classifying SVs in the NGS twilight zone

OPENALEX - Publications

Kathrin Trappe Anne‐Katrin Emde Hans‐Christian Ehrlich Knut Reinert

Abstract Motivation: The landscape of structural variation (SV) including complex duplication and translocation patterns is far from resolved. SV detection tools usually exhibit low agreement, are often geared toward certain types or size ranges struggle to correctly classify the type exact SVs. Results: We present Gustaf (Generic mUlti-SpliT Alignment Finder), a sound generic multi-split tool that detects classifies deletions, inversions, dispersed duplications translocations ≥30 bp. Our...

10.1093/bioinformatics/btu431 article EN Bioinformatics 2014-07-14

Diverse tumorigenic consequences of human papillomavirus integration in primary oropharyngeal cancers

OPENALEX - Publications

David E. Symer Keiko Akagi Heather Geiger Song Yang Gaiyun Li and 16 more

Human papillomavirus (HPV) causes 5% of all cancers and frequently integrates into host chromosomes. The HPV oncoproteins E6 E7 are necessary but insufficient for cancer formation, indicating that additional secondary genetic events required. Here, we investigate potential oncogenic impacts virus integration. Analysis 105 HPV-positive oropharyngeal by whole-genome sequencing detects integration in 77%, revealing five statistically significant sites recurrent near genes regulate epithelial...

10.1101/gr.275911.121 article EN cc-by-nc Genome Research 2021-12-13

Somatic whole genome dynamics of precancer in Barrett’s esophagus reveals features associated with disease progression

OPENALEX - Publications

Thomas G. Paulson Patricia C. Galipeau Kenji Oman Carissa A. Sanchez Mary K. Kuhner and 17 more

Abstract While the genomes of normal tissues undergo dynamic changes over time, little is understood about temporal-spatial dynamics in premalignant that progress to cancer compared those remain cancer-free. Here we use whole genome sequencing contrast genomic alterations 427 longitudinal samples from 40 patients with stable Barrett’s esophagus who progressed esophageal adenocarcinoma (ESAD). We show same somatic mutational processes are active tissue regardless outcome, high levels...

10.1038/s41467-022-29767-7 article EN cc-by Nature Communications 2022-04-28

Segment-based multiple sequence alignment

OPENALEX - Publications

Tobias Rausch Anne‐Katrin Emde David Weese Andreas Gogol‐Döring Cédric Notredame and 1 more

Many multiple sequence alignment tools have been developed in the past, progressing either speed or accuracy. Given importance and wide-spread use of tools, progress both categories is a contribution to community has driven research field so far.We introduce graph-based extension consistency-based, progressive strategy. We apply consistency notion segments instead single characters. The main problem we solve this context define sequences such way that possible. implemented algorithm using...

10.1093/bioinformatics/btn281 article EN Bioinformatics 2008-08-09

Comparing sequencing assays and human-machine analyses in actionable genomics for glioblastoma

OPENALEX - Publications

Kazimierz O. Wrzeszczyński Mayu O. Frank Takahiko Koyama Kahn Rhrissorrakrai Nicolas Robine and 21 more

To analyze a glioblastoma tumor specimen with 3 different platforms and compare potentially actionable calls from each.Tumor DNA was analyzed by commercial targeted panel. In addition, tumor-normal whole-genome sequencing (WGS) RNA (RNA-seq). The WGS RNA-seq data were team of bioinformaticians cancer oncologists, separately IBM Watson Genomic Analytics (WGA), an automated system for prioritizing somatic variants identifying drugs.More identified WGS/RNA analysis than panels. WGA completed...

10.1212/nxg.0000000000000164 article EN cc-by-nc-nd Neurology Genetics 2017-07-12

MicroRazerS: rapid alignment of small RNA reads

OPENALEX - Publications

Anne‐Katrin Emde Marcel Grunert David Weese Knut Reinert Silke Sperling

Abstract Motivation: Deep sequencing has become the method of choice for determining small RNA content a cell. Mapping sequenced reads onto their reference genome serves as basis all further analyses, namely identification and quantification. A frequently used is Mega BLAST followed by several filtering steps, even though it slow inefficient this task. Also, none currently available short read aligners established itself particular task mapping. Results: We present MicroRazerS, tool...

10.1093/bioinformatics/btp601 article EN Bioinformatics 2009-10-29

A consistency-based consensus algorithm for de novo and reference-guided sequence assembly of short reads

OPENALEX - Publications

Tobias Rausch Sergey Koren Gennady Denisov David Weese Anne‐Katrin Emde and 2 more

Abstract Motivation: Novel high-throughput sequencing technologies pose new algorithmic challenges in handling massive amounts of short-read, high-coverage data. A robust and versatile consensus tool is particular interest for such data since a sound multi-read alignment prerequisite variation analyses, accurate genome assemblies insert sequencing. Results: algorithm de novo or reference-guided assembly presented. The program identifies segments shared by multiple reads then aligns these...

10.1093/bioinformatics/btp131 article EN Bioinformatics 2009-03-05

Analytical Validation of Clinical Whole-Genome and Transcriptome Sequencing of Patient-Derived Tumors for Reporting Targetable Variants in Cancer

OPENALEX - Publications

Kazimierz O. Wrzeszczyński Vanessa Felice Avinash Abhyankar Lukasz Kozon Heather Geiger and 18 more

We developed and validated a clinical whole-genome transcriptome sequencing (WGTS) assay that provides comprehensive genomic profile of patient's tumor. The ability to fully capture the mappable genome with sufficient coverage precisely call DNA somatic single nucleotide variants, insertions/deletions, copy number structural RNA gene fusions was analyzed. New York State's Department Health next-generation guidelines were expanded for establishing performance validation applicable sequencing....

10.1016/j.jmoldx.2018.06.007 article EN cc-by-nc-nd Journal of Molecular Diagnostics 2018-08-21

Comparative sequencing analysis reveals high genomic concordance between matched primary and metastatic colorectal cancer lesions

OPENALEX - Publications

A. Rose Brannon Efsevia Vakiani Brooke E. Sylvester Sasinya N. Scott Gregory C. McDermott and 15 more

Colorectal cancer is the second leading cause of death in United States, with over 50,000 deaths estimated 2014. Molecular profiling for somatic mutations that predict absence response to anti-EGFR therapy has become standard practice treatment metastatic colorectal cancer; however, quantity and type tissue available testing frequently limited. Further, degree which primary tumor a faithful representation disease been questioned. As next-generation sequencing technology becomes more widely...

10.1186/preaccept-1207406452128377 article EN cc-by Genome Biology 2014-01-01

Analytical model of peptide mass cluster centres with applications.

OPENALEX - Publications

Witold Wolski Malcolm Farrow Anne‐Katrin Emde Hans Lehrach Maciej Łałowski and 1 more

The elemental composition of peptides results in formation distinct, equidistantly spaced clusters across the mass range. property peptide clustering is used to calibrate lists, identify and remove non-peptide peaks for data reduction.We developed an analytical model cluster centres. Inputs included, amino acid frequencies sequence database, average length proteins cleavage specificity proteolytic enzyme probability. We examined accuracy our by comparing it with based on silico database...

10.1186/1477-5956-4-18 article EN cc-by Proteome Science 2006-01-01

Novel patterns of complex structural variation revealed across thousands of cancer genome graphs

OPENALEX - Publications

Kevin Hadi Xiaotong Yao Julie M. Behr Aditya Deshpande Charalampos Xanthopoulakis and 43 more

Summary Cancer genomes often harbor hundreds of somatic DNA rearrangement junctions, many which cannot be easily classified into simple (e.g. deletion, translocation) or complex chromothripsis, chromoplexy) structural variant classes. Applying a novel genome graph computational paradigm to analyze the topology junction copy number (JCN) across 2,833 tumor whole sequences (WGS), we introduce three phenomena: pyrgo, rigma , and tyfonas . Pyrgo are “towers” low-JCN duplications associated with...

10.1101/836296 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2019-11-09

Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations

OPENALEX - Publications

Anne‐Katrin Emde Amanda Phipps‐Green Murray Cadzow Clair Gallagher Tanya J. Major and 12 more

Abstract Background Historically, geneticists have relied on genotyping arrays and imputation to study human genetic variation. However, an underrepresentation of diverse populations has resulted in that poorly capture global variation, a lack reference panels. This contributed deepening health disparities. Whole genome sequencing (WGS) better captures variation but remains prohibitively expensive. Thus, we explored WGS at “mid-pass” 1-7x coverage. Results Here, developed benchmarked methods...

10.1186/s12864-021-07949-9 article EN cc-by BMC Genomics 2021-11-01

Whole Genome Sequencing-Based Discovery of Structural Variants in Glioblastoma

OPENALEX - Publications

Kazimierz O. Wrzeszczyński Vanessa Felice Minita Shah Sadia Rahman Anne‐Katrin Emde and 3 more

10.1007/978-1-4939-7659-1_1 article EN Methods in molecular biology 2018-01-01