Kathryn Roeder
- Genetic Associations and Epidemiology
- Gene expression and cancer classification
- Autism Spectrum Disorder Research
- Single-cell and spatial transcriptomics
- Genomic variations and chromosomal abnormalities
- Statistical Methods and Inference
- Genomics and Rare Diseases
- Bioinformatics and Genomic Networks
- Genetics and Neurodevelopmental Disorders
- Genetic and phenotypic traits in livestock
- Genetic Mapping and Diversity in Plants and Animals
- Statistical Methods and Bayesian Inference
- Bayesian Methods and Mixture Models
- Statistical Methods in Clinical Trials
- Gene Regulatory Network Analysis
- Forensic and Genetic Research
- CRISPR and Genetic Engineering
- Congenital heart defects research
- RNA Research and Splicing
- Genomics and Chromatin Dynamics
- Virology and Viral Diseases
- Bayesian Modeling and Causal Inference
- Face and Expression Recognition
- Functional Brain Connectivity Studies
- Advanced Statistical Methods and Models
Carnegie Mellon University
2016-2025
University of Pittsburgh
2008-2023
Genomics (United Kingdom)
2021
Child Health and Development Institute
2021
Johns Hopkins University
2020
Massachusetts General Hospital
2019
Vanderbilt University
2011-2015
Case Western Reserve University
2014
University of Illinois Chicago
2013
West Chester University
2013
Summary. A dense set of single nucleotide polymorphisms (SNP) covering the genome and an efficient method to assess SNP genotypes are expected be available in near future. An outstanding question is how use these technologies efficiently identify genes affecting liability complex disorders. To achieve this goal, we propose a statistical that has several optimal properties: It can used with case‐control data yet, like family‐based designs, controls for population heterogeneity; it insensitive...
This article introduces a new SAS procedure written by the authors that analyzes longitudinal data (developmental trajectories) fitting mixture model. The TRAJ fits semiparametric (discrete) mixtures of censored normal, Poisson, zero-inflated and Bernoulli distributions to data. Applications psychometric scale data, offense counts, dichotomous prevalence measure in violence research are illustrated. In addition, use Bayesian information criterion address problem model selection, including...
To examine the prevalence, correlates, persistence, and treatment-seeking related to symptoms of eating disorders (EDs) in a random sample college students.A students at large university were recruited for an Internet survey Fall 2005 follow-up 2007.ED measured using SCOFF screen adjusted nonresponse administrative data survey.2,822 (56%) completed baseline survey. Among undergraduates prevalence positive screens was 13.5% women 3.6% men. with screens, 20% had received past-year mental...
Technological advances make it possible to use high-throughput sequencing as a primary discovery tool of medical genetics, specifically for assaying rare variation. Still this approach faces the analytic challenge that influence very variants can only be evaluated effectively group. A further complication is any given variant could have no effect, increase risk, or protective. We propose here C-alpha test statistic novel testing presence mixture effects across set variants. Unlike existing...
This paper explores the following question: what kind of statistical guarantees can be given when doing variable selection in high dimensional models? In particular, we look at error rates and power some multi-stage regression methods. first stage fit a set candidate models. second select one model by cross-validation. third use hypothesis testing to eliminate variables. We refer two stages as "screening" last "cleaning." consider three screening methods: lasso, marginal regression, forward...
Abstract Mixtures of normals provide a flexible model for estimating densities in Bayesian framework. There are some difficulties with this model, however. First, standard reference priors yield improper posteriors. Second, the posterior number components mixture is not well defined (if prior used). Third, simulation does direct estimate components. We present practical methods coping these problems. Finally, we give results on consistency method when maximum allowed to grow sample size.
We describe an R package named huge which provides easy-to-use functions for estimating high dimensional undirected graphs from data. This implements recent results in the literature, including Friedman et al. (2007), Liu (2009, 2012) and (2010). Compared with existing graph estimation glasso, extra features: (1) instead of using Fortan, it is written C, makes code more portable easier to modify; (2) besides fitting Gaussian graphical models, also semiparametric copula models; (3) like...
Abstract Background Autism spectrum disorders (ASD) are early onset neurodevelopmental syndromes typified by impairments in reciprocal social interaction and communication, accompanied restricted repetitive behaviors. While rare especially de novo genetic variation known to affect liability, whether common polymorphism plays a substantial role is an open question the relative contribution of genes environment contentious. It probable that contributions variation, as well environment, differs...
While it is apparent that rare variation can play an important role in the genetic architecture of autism spectrum disorders (ASDs), contribution common to risk developing ASD less clear.To produce a more comprehensive picture, we report Stage 2 Autism Genome Project genome-wide association study, adding 1301 families and bringing total 2705 analysed (Stages 1 2).In addition evaluating individual single nucleotide polymorphisms (SNPs), also sought evidence variants, en masse, might affect...