Ryan L. Collins
- Genomics and Rare Diseases
- Genomic variations and chromosomal abnormalities
- Genomics and Phylogenetic Studies
- CRISPR and Genetic Engineering
- Autism Spectrum Disorder Research
- Genetics and Neurodevelopmental Disorders
- Cancer Genomics and Diagnostics
- Congenital heart defects research
- Chromosomal and Genetic Variations
- RNA and protein synthesis mechanisms
- Genetic Associations and Epidemiology
- Prenatal Screening and Diagnostics
- Genomics and Chromatin Dynamics
- RNA modifications and cancer
- Adipose Tissue and Metabolism
- Genetic Neurodegenerative Diseases
- Genetics, Bioinformatics, and Biomedical Research
- Congenital Ear and Nasal Anomalies
- RNA Research and Splicing
- RNA regulation and disease
- Neurological diseases and metabolism
- Craniofacial Disorders and Treatments
- Parkinson's Disease Mechanisms and Treatments
- Single-cell and spatial transcriptomics
- Connective tissue disorders research
Massachusetts General Hospital
2015-2025
Harvard University
2016-2025
Broad Institute
2016-2025
Dana-Farber Cancer Institute
2023-2025
Massachusetts Institute of Technology
2019-2023
QorTek (United States)
2023
Williams (United States)
2023
SRM Institute of Science and Technology
2023
Center for Genomic Science
2018-2019
Harvard University Press
2019
Abstract Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences gene disruption: crucial for function an organism will be depleted such in natural populations, whereas non-essential tolerate their accumulation. However, predicted loss-of-function enriched annotation errors, and tend to found at extremely low frequencies, so analysis requires careful variant very large sample sizes 1 . Here we describe aggregation 125,748...
Summary Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences gene disruption: critical for an organism’s function will be depleted such in natural populations, while non-essential tolerate their accumulation. However, predicted loss-of-function (pLoF) enriched annotation errors, and tend to found at extremely low frequencies, so analysis requires careful variant very large sample sizes 1 . Here, we describe aggregation...
The incomplete identification of structural variants (SVs) from whole-genome sequencing data limits studies human genetic diversity and disease association. Here, we apply a suite long-read, short-read, strand-specific technologies, optical mapping, variant discovery algorithms to comprehensively analyze three trios define the full spectrum variation in haplotype-resolved manner. We identify 818,054 indel (<50 bp) 27,622 SVs (≥50 per genome. also discover 156 inversions genome 58 intersect...
Structural variants (SVs) rearrange large segments of DNA
INTRODUCTION The DNA of protein-coding genes is transcribed into mRNA, which translated proteins. “coding genome” describes the that contains information to make these proteins and represents ~1.5% human genome. Newly arising de novo mutations (variants observed in a child but not either parent) coding genome contribute numerous childhood developmental disorders, including autism spectrum disorder (ASD). Discovery effects aided by triplet code enables functional impact many be readily...
Abstract The depletion of disruptive variation caused by purifying natural selection (constraint) has been widely used to investigate protein-coding genes underlying human disorders, but attempts assess constraint for non-protein-coding regions have proven more difficult. Here we aggregate, process, and release a dataset 76,156 genomes from the Genome Aggregation Database (gnomAD), largest public open-access genome reference dataset, use this build mutational map whole genome. We present...
Background: The relative prevalence and clinical importance of monogenic mutations related to familial hypercholesterolemia high polygenic score (cumulative impact many common variants) pathways for early-onset myocardial infarction remain uncertain. Whole-genome sequencing enables simultaneous ascertainment both each individual. Methods: We performed deep-coverage whole-genome 2081 patients from 4 racial subgroups hospitalized in the United States with (age ≤55 years) recruited a 2:1...
Structural variation (SV) influences genome organization and contributes to human disease. However, the complete mutational spectrum of SV has not been routinely captured in disease association studies. We sequenced 689 participants with autism disorder (ASD) other developmental abnormalities construct a genome-wide map large SV. Using long-insert jumping libraries at 105X mean physical coverage linked-read whole-genome sequencing from 10X Genomics, we document seven major classes ~5 kb...
Abstract The acceleration of DNA sequencing in samples from patients and population studies has resulted extensive catalogues human genetic variation, but the interpretation rare variants remains problematic. A notable example this challenge is existence disruptive dosage-sensitive disease genes, even apparently healthy individuals. Here, by manual curation putative loss-of-function (pLoF) haploinsufficient genes Genome Aggregation Database (gnomAD) 1 , we show that one explanation for...
Abstract Naturally occurring human genetic variants that are predicted to inactivate protein-coding genes provide an in vivo model of gene inactivation complements knockout studies cells and organisms. Here we report three key findings regarding the assessment candidate drug targets using loss-of-function variants. First, even essential genes, which not tolerated, can be highly successful as inhibitory drugs. Second, most sufficiently rare genotype-based ascertainment homozygous or compound...
Upstream open reading frames (uORFs) are tissue-specific cis-regulators of protein translation. Isolated reports have shown that variants create or disrupt uORFs can cause disease. Here, in a systematic genome-wide study using 15,708 whole genome sequences, we show new upstream start codons, and disrupting stop sites existing uORFs, under strong negative selection. This selection signal is significantly stronger for arising genes intolerant to loss-of-function variants. Furthermore, creating...