Qunhua Li

ORCID: 0000-0003-0675-7648
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Genomics and Chromatin Dynamics
  • Gene expression and cancer classification
  • Epigenetics and DNA Methylation
  • Statistical Methods and Inference
  • Single-cell and spatial transcriptomics
  • Diet and metabolism studies
  • Bioinformatics and Genomic Networks
  • Molecular Biology Techniques and Applications
  • Genetic Mapping and Diversity in Plants and Animals
  • Genomics and Phylogenetic Studies
  • Chromosomal and Genetic Variations
  • Statistical Methods in Clinical Trials
  • Advanced Proteomics Techniques and Applications
  • RNA modifications and cancer
  • Nutrition and Health in Aging
  • Dietary Effects on Health
  • Mass Spectrometry Techniques and Applications
  • Cell Image Analysis Techniques
  • Vitamin C and Antioxidants Research
  • Bayesian Methods and Mixture Models
  • Gene Regulatory Network Analysis
  • Metabolism, Diabetes, and Cancer
  • Advanced Statistical Methods and Models
  • Retinoids in leukemia and cellular processes
  • Antioxidant Activity and Oxidative Stress

Pennsylvania State University
2015-2024

Chengdu University
2022

Agilent Technologies (United States)
2017

Affiliated Hospital of North Sichuan Medical College
2016

University of Washington
2006-2012

University of Chicago
2010-2012

University of California, Berkeley
2009-2011

Guangdong General Hospital
2009

Guangdong Academy of Medical Sciences
2009

Fred Hutch Cancer Center
2007

Chromatin immunoprecipitation (ChIP) followed by high-throughput DNA sequencing (ChIP-seq) has become a valuable and widely used approach for mapping the genomic location of transcription-factor binding histone modifications in living cells. Despite its widespread use, there are considerable differences how these experiments conducted, results scored evaluated quality, data metadata archived public use. These practices affect quality utility any global ChIP experiment. Through our experience...

10.1101/gr.136184.111 article EN cc-by-nc Genome Research 2012-09-01

Reproducibility is essential to reliable scientific discovery in high-throughput experiments. In this work we propose a unified approach measure the reproducibility of findings identified from replicate experiments and identify putative discoveries using reproducibility. Unlike usual scalar measures reproducibility, our creates curve, which quantitatively assesses when are no longer consistent across replicates. Our curve fitted by copula mixture model, derive quantitative score, call...

10.1214/11-aoas466 article EN other-oa The Annals of Applied Statistics 2011-09-01

Hi-C is a powerful technology for studying genome-wide chromatin interactions. However, current methods assessing data reproducibility can produce misleading results because they ignore spatial features in data, such as domain structure and distance dependence. We present HiCRep, framework the of that systematically accounts these features. In particular, we introduce novel similarity measure, stratum adjusted correlation coefficient (SCC), quantifying between interaction matrices. Not only...

10.1101/gr.220640.117 article EN cc-by-nc Genome Research 2017-08-30

Mapping the chromosomal locations of transcription factors, nucleosomes, histone modifications, chromatin remodeling enzymes, chaperones, and polymerases is one key tasks modern biology, as evidenced by Encyclopedia DNA Elements (ENCODE) Project. To this end, immunoprecipitation followed high-throughput sequencing (ChIP-seq) standard methodology. such protein-DNA interactions in vivo using ChIP-seq presents multiple challenges not only sample preparation but also for computational analysis....

10.1371/journal.pcbi.1003326 article EN cc-by PLoS Computational Biology 2013-11-14

Hi-C is currently the most widely used assay to investigate 3D organization of genome and study its role in gene regulation, DNA replication, disease. However, experiments are costly perform involve multiple complex experimental steps; thus, accurate methods for measuring quality reproducibility data essential determine whether output should be further a study. Using real simulated data, we profile performance several recently proposed assessing population including HiCRep, GenomeDISCO,...

10.1186/s13059-019-1658-7 article EN cc-by Genome biology 2019-03-19

The spatial organization of chromatin in the nucleus has been implicated regulating gene expression. Maps high-frequency interactions between different segments have revealed topologically associating domains (TADs), within which most regulatory are thought to occur. TADs not homogeneous structural units but appear be organized into a hierarchy. We present OnTAD, an optimized nested TAD caller from Hi-C data, identify hierarchical TADs. OnTAD reveals new biological insights role levels,...

10.1186/s13059-019-1893-y article EN cc-by Genome biology 2019-12-01

Thousands of epigenomic data sets have been generated in the past decade, but it is difficult for researchers to effectively use all relevant their projects. Systematic integrative analysis can help meet this need, and VISION project was established

10.1101/gr.255760.119 article EN cc-by-nc Genome Research 2020-03-01

Quantitative comparison of epigenomic data across multiple cell types or experimental conditions is a promising way to understand the biological functions epigenetic modifications. However, differences in sequencing depth and signal-to-noise ratios from different experiments can hinder our ability identify real variation raw data. Proper normalization required prior analysis gain meaningful insights. Most existing methods for standardize signals by rescaling either background regions peak...

10.1093/nar/gkaa105 article EN cc-by Nucleic Acids Research 2020-02-10

10.1016/j.jspi.2017.01.001 article EN Journal of Statistical Planning and Inference 2017-01-21

Mass spectrometry provides a high-throughput way to identify proteins in biological samples. In typical experiment, sample are first broken into their constituent peptides. The resulting mixture of peptides is then subjected mass spectrometry, which generates thousands spectra, each characteristic its generating peptide. Here we consider the problem inferring, from these and present sample. We develop statistical approach problem, based on nested model. contrast commonly used two-stage...

10.1214/09-aoas316 article EN The Annals of Applied Statistics 2010-06-01

Intra-species genetic variation can be used to investigate population structure, selection, and gene flow in non-model vertebrates; due the plummeting costs for genome sequencing, it is now possible small labs obtain full-genome data from their species of interest. However, those may not have easy access to, familiarity with, computational tools analyze data. We created a suite Galaxy web server aimed at handling nucleotide amino-acid polymorphisms discovered by sequencing several...

10.1186/2047-217x-2-17 article EN cc-by GigaScience 2013-12-01

10.1016/j.csda.2017.07.005 article EN Computational Statistics & Data Analysis 2017-07-29

Determining differentially expressed genes (DEGs) between biological samples is the key to understand how genotype gives rise phenotype. RNA-seq and microarray are two main technologies for profiling gene expression levels. However, considerable discrepancy has been found DEGs detected using technologies. Integration data across these platforms potential improve power reliability of DEG detection. We propose a rank-based semi-parametric model determine information different sources apply it...

10.1186/s12859-015-0847-y article EN cc-by BMC Bioinformatics 2016-01-11

Sequencing of the T cell receptor (TCR) repertoire is a powerful tool for deeper study immune response, but unique structure this type data makes its meaningful quantification challenging. We introduce new method, Gamma-GPD spliced threshold model, to address difficulty. This biologically interpretable model captures distribution TCR repertoire, demonstrates stability across varying sequencing depths, and permits comparative analysis any number sampled individuals. apply our method several...

10.1371/journal.pcbi.1006571 article EN cc-by PLoS Computational Biology 2018-11-28

Co-expression network analysis provides useful information for studying gene regulation in biological processes. Examining condition-specific patterns of co-expression can provide insights into the underlying cellular processes activated a particular condition. One challenge this type is that sample sizes each condition are usually small, making statistical inference highly underpowered. A joint construction borrows from related structures across conditions has potential to improve power...

10.1371/journal.pcbi.1006436 article EN cc-by PLoS Computational Biology 2018-09-21

Abstract Hi-C is a powerful technology for studying genome-wide chromatin interactions. However, current methods assessing data reproducibility can produce misleading results because they ignore spatial features in data, such as domain structure and distance dependence. We present HiCRep, framework the of that systematically accounts these features. In particular, we introduce novel similarity measure, stratum adjusted correlation coefficient (SCC), quantifying between interaction matrices....

10.1101/101386 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2017-01-18

Abstract Hi-C is currently the most widely used assay to investigate 3D organization of genome and study its role in gene regulation, DNA replication, disease. However, experiments are costly perform involve multiple complex experimental steps; thus, accurate methods for measuring quality reproducibility data essential determine whether output should be further a study. Using real simulated data, we profile performance several recently proposed assessing population including HiCRep,...

10.1101/188755 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2017-09-14

Abstract Basal colonic crypt stem cells are long lived and play a role in colon homeostasis. Previous evidence has shown that high-calorie diet (HCD) enhances cell numbers expansion of the proliferative zone, an important biomarker for cancer. However, it is not clear how HCD drives dysregulation cell/colonocyte kinetics. We used human-relevant pig model developed immunofluorescence technique to detect quantify cells. Pigs (n = 8/group) were provided either standard (SD; 5% fat) or (23% 13...

10.1158/1940-6207.capr-17-0010 article EN Cancer Prevention Research 2017-06-03
Coming Soon ...