NFDI4DS | UHH-SEMS - Publication Details

Qunhua Li

ORCID: 0000-0003-0675-7648

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5029682068

Research Areas

Genomics and Chromatin Dynamics
Gene expression and cancer classification
Epigenetics and DNA Methylation
Statistical Methods and Inference
Single-cell and spatial transcriptomics
Diet and metabolism studies
Bioinformatics and Genomic Networks
Molecular Biology Techniques and Applications
Genetic Mapping and Diversity in Plants and Animals
Genomics and Phylogenetic Studies
Chromosomal and Genetic Variations
Statistical Methods in Clinical Trials
Advanced Proteomics Techniques and Applications
RNA modifications and cancer
Nutrition and Health in Aging
Dietary Effects on Health
Mass Spectrometry Techniques and Applications
Cell Image Analysis Techniques
Vitamin C and Antioxidants Research
Bayesian Methods and Mixture Models
Gene Regulatory Network Analysis
Metabolism, Diabetes, and Cancer
Advanced Statistical Methods and Models
Retinoids in leukemia and cellular processes
Antioxidant Activity and Oxidative Stress

Pennsylvania State University
2015-2024

Chengdu University
2022

Agilent Technologies (United States)
2017

Affiliated Hospital of North Sichuan Medical College
2016

University of Washington
2006-2012

University of Chicago
2010-2012

University of California, Berkeley
2009-2011

Guangdong General Hospital
2009

Guangdong Academy of Medical Sciences
2009

Fred Hutch Cancer Center
2007

ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia

OPENALEX - Publications

Stephen G. Landt Georgi K. Marinov Anshul Kundaje Pouya Kheradpour Florencia Pauli and 42 more

Chromatin immunoprecipitation (ChIP) followed by high-throughput DNA sequencing (ChIP-seq) has become a valuable and widely used approach for mapping the genomic location of transcription-factor binding histone modifications in living cells. Despite its widespread use, there are considerable differences how these experiments conducted, results scored evaluated quality, data metadata archived public use. These practices affect quality utility any global ChIP experiment. Through our experience...

10.1101/gr.136184.111 article EN cc-by-nc Genome Research 2012-09-01

Measuring reproducibility of high-throughput experiments

OPENALEX - Publications

Qunhua Li James B. Brown Haiyan Huang Peter J. Bickel

Reproducibility is essential to reliable scientific discovery in high-throughput experiments. In this work we propose a unified approach measure the reproducibility of findings identified from replicate experiments and identify putative discoveries using reproducibility. Unlike usual scalar measures reproducibility, our creates curve, which quantitatively assesses when are no longer consistent across replicates. Our curve fitted by copula mixture model, derive quantitative score, call...

10.1214/11-aoas466 article EN other-oa The Annals of Applied Statistics 2011-09-01

HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient

OPENALEX - Publications

Tao Yang Feipeng Zhang Galip Gürkan Yardımcı Fan Song Ross C. Hardison and 3 more

Hi-C is a powerful technology for studying genome-wide chromatin interactions. However, current methods assessing data reproducibility can produce misleading results because they ignore spatial features in data, such as domain structure and distance dependence. We present HiCRep, framework the of that systematically accounts these features. In particular, we introduce novel similarity measure, stratum adjusted correlation coefficient (SCC), quantifying between interaction matrices. Not only...

10.1101/gr.220640.117 article EN cc-by-nc Genome Research 2017-08-30

Practical Guidelines for the Comprehensive Analysis of ChIP-seq Data

OPENALEX - Publications

Timothy L. Bailey Paweł Krajewski István Ladunga Céline Lefèbvre Qunhua Li and 4 more

Mapping the chromosomal locations of transcription factors, nucleosomes, histone modifications, chromatin remodeling enzymes, chaperones, and polymerases is one key tasks modern biology, as evidenced by Encyclopedia DNA Elements (ENCODE) Project. To this end, immunoprecipitation followed high-throughput sequencing (ChIP-seq) standard methodology. such protein-DNA interactions in vivo using ChIP-seq presents multiple challenges not only sample preparation but also for computational analysis....

10.1371/journal.pcbi.1003326 article EN cc-by PLoS Computational Biology 2013-11-14

Measuring the reproducibility and quality of Hi-C data

OPENALEX - Publications

Galip Gürkan Yardımcı Hakan Özadam Michael Sauria Oana Ursu Koon‐Kiu Yan and 14 more

Hi-C is currently the most widely used assay to investigate 3D organization of genome and study its role in gene regulation, DNA replication, disease. However, experiments are costly perform involve multiple complex experimental steps; thus, accurate methods for measuring quality reproducibility data essential determine whether output should be further a study. Using real simulated data, we profile performance several recently proposed assessing population including HiCRep, GenomeDISCO,...

10.1186/s13059-019-1658-7 article EN cc-by Genome biology 2019-03-19

Systematic evaluation of factors influencing ChIP-seq fidelity

OPENALEX - Publications

Yiwen Chen Nicolas Nègre Qunhua Li Joanna O. Mieczkowska Matthew Slattery and 12 more

10.1038/nmeth.1985 article EN Nature Methods 2012-04-22

OnTAD: hierarchical domain structure reveals the divergence of activity among TADs and boundaries

OPENALEX - Publications

Lin An Tao Yang Jiahao Yang Johannes Nuebler Guanjue Xiang and 3 more

The spatial organization of chromatin in the nucleus has been implicated regulating gene expression. Maps high-frequency interactions between different segments have revealed topologically associating domains (TADs), within which most regulatory are thought to occur. TADs not homogeneous structural units but appear be organized into a hierarchy. We present OnTAD, an optimized nested TAD caller from Hi-C data, identify hierarchical TADs. OnTAD reveals new biological insights role levels,...

10.1186/s13059-019-1893-y article EN cc-by Genome biology 2019-12-01

An integrative view of the regulatory and transcriptional landscapes in mouse hematopoiesis

OPENALEX - Publications

Guanjue Xiang Cheryl A. Keller Elisabeth F. Heuston Belinda Giardine Lin An and 19 more

Thousands of epigenomic data sets have been generated in the past decade, but it is difficult for researchers to effectively use all relevant their projects. Systematic integrative analysis can help meet this need, and VISION project was established

10.1101/gr.255760.119 article EN cc-by-nc Genome Research 2020-03-01

S3norm: simultaneous normalization of sequencing depth and signal-to-noise ratio in epigenomic data

OPENALEX - Publications

Guanjue Xiang Cheryl A. Keller Belinda Giardine Lin An Qunhua Li and 2 more

Quantitative comparison of epigenomic data across multiple cell types or experimental conditions is a promising way to understand the biological functions epigenetic modifications. However, differences in sequencing depth and signal-to-noise ratios from different experiments can hinder our ability identify real variation raw data. Proper normalization required prior analysis gain meaningful insights. Most existing methods for standardize signals by rescaling either background regions peak...

10.1093/nar/gkaa105 article EN cc-by Nucleic Acids Research 2020-02-10

Interspecies regulatory landscapes and elements revealed by novel joint systematic integration of human and mouse blood cell epigenomes

OPENALEX - Publications

Guanjue Xiang Xi He Belinda Giardine Kathryn J. Isaac Dylan J. Taylor and 29 more

Knowledge of locations and activities

10.1101/gr.277950.123 article EN Genome Research 2024-07-01

A food-based approach that targets interleukin-6, a key regulator of chronic intestinal inflammation and colon carcinogenesis

OPENALEX - Publications

Abigail Sido Sridhar Radhakrishnan Sung Woo Kim Elisabeth Eriksson Frank Shen and 4 more

10.1016/j.jnutbio.2017.01.012 article EN publisher-specific-oa The Journal of Nutritional Biochemistry 2017-01-28

Robust bent line regression

OPENALEX - Publications

Feipeng Zhang Qunhua Li

10.1016/j.jspi.2017.01.001 article EN Journal of Statistical Planning and Inference 2017-01-21

A nested mixture model for protein identification using mass spectrometry

OPENALEX - Publications

Qunhua Li Michael J. MacCoss Matthew Stephens

Mass spectrometry provides a high-throughput way to identify proteins in biological samples. In typical experiment, sample are first broken into their constituent peptides. The resulting mixture of peptides is then subjected mass spectrometry, which generates thousands spectra, each characteristic its generating peptide. Here we consider the problem inferring, from these and present sample. We develop statistical approach problem, based on nested model. contrast commonly used two-stage...

10.1214/09-aoas316 article EN The Annals of Applied Statistics 2010-06-01

Galaxy tools to study genome diversity

OPENALEX - Publications

Oscar C. Bedoya-Reina Aakrosh Ratan Richard Burhans Hie Lim Kim Belinda Giardine and 8 more

Intra-species genetic variation can be used to investigate population structure, selection, and gene flow in non-model vertebrates; due the plummeting costs for genome sequencing, it is now possible small labs obtain full-genome data from their species of interest. However, those may not have easy access to, familiarity with, computational tools analyze data. We created a suite Galaxy web server aimed at handling nucleotide amino-acid polymorphisms discovered by sequencing several...

10.1186/2047-217x-2-17 article EN cc-by GigaScience 2013-12-01

A continuous threshold expectile model

OPENALEX - Publications

Feipeng Zhang Qunhua Li

10.1016/j.csda.2017.07.005 article EN Computational Statistics & Data Analysis 2017-07-29

A semi-parametric statistical model for integrating gene expression profiles across different platforms

OPENALEX - Publications

Yafei Lyu Qunhua Li

Determining differentially expressed genes (DEGs) between biological samples is the key to understand how genotype gives rise phenotype. RNA-seq and microarray are two main technologies for profiling gene expression levels. However, considerable discrepancy has been found DEGs detected using technologies. Integration data across these platforms potential improve power reliability of DEG detection. We propose a rank-based semi-parametric model determine information different sources apply it...

10.1186/s12859-015-0847-y article EN cc-by BMC Bioinformatics 2016-01-11

powerTCR: A model-based approach to comparative analysis of the clone size distribution of the T cell receptor repertoire

OPENALEX - Publications

Hillary Koch Dmytro Starenki Sara J. Cooper R Myers Qunhua Li

Sequencing of the T cell receptor (TCR) repertoire is a powerful tool for deeper study immune response, but unique structure this type data makes its meaningful quantification challenging. We introduce new method, Gamma-GPD spliced threshold model, to address difficulty. This biologically interpretable model captures distribution TCR repertoire, demonstrates stability across varying sequencing depths, and permits comparative analysis any number sampled individuals. apply our method several...

10.1371/journal.pcbi.1006571 article EN cc-by PLoS Computational Biology 2018-11-28

Condition-adaptive fused graphical lasso (CFGL): An adaptive procedure for inferring condition-specific gene co-expression network

OPENALEX - Publications

Yafei Lyu Lingzhou Xue Feipeng Zhang Hillary Koch Laura Saba and 2 more

Co-expression network analysis provides useful information for studying gene regulation in biological processes. Examining condition-specific patterns of co-expression can provide insights into the underlying cellular processes activated a particular condition. One challenge this type is that sample sizes each condition are usually small, making statistical inference highly underpowered. A joint construction borrows from related structures across conditions has potential to improve power...

10.1371/journal.pcbi.1006436 article EN cc-by PLoS Computational Biology 2018-09-21

HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient

OPENALEX - Publications

Tao Yang Feipeng Zhang Galip Gürkan Yardımcı Fan Song Ross C. Hardison and 3 more

Abstract Hi-C is a powerful technology for studying genome-wide chromatin interactions. However, current methods assessing data reproducibility can produce misleading results because they ignore spatial features in data, such as domain structure and distance dependence. We present HiCRep, framework the of that systematically accounts these features. In particular, we introduce novel similarity measure, stratum adjusted correlation coefficient (SCC), quantifying between interaction matrices....

10.1101/101386 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2017-01-18

Individualized Modeling to Distinguish Between High and Low Arousal States Using Physiological Data

OPENALEX - Publications

Ame Osotsi Zita Oravecz Qunhua Li Joshua M. Smyth Timothy R. Brick

10.1007/s41666-019-00064-1 article EN Journal of Healthcare Informatics Research 2020-01-22

Measuring the reproducibility and quality of Hi-C data

OPENALEX - Publications

Galip Gürkan Yardımcı Hakan Özadam Michael Sauria Oana Ursu Koon‐Kiu Yan and 14 more

Abstract Hi-C is currently the most widely used assay to investigate 3D organization of genome and study its role in gene regulation, DNA replication, disease. However, experiments are costly perform involve multiple complex experimental steps; thus, accurate methods for measuring quality reproducibility data essential determine whether output should be further a study. Using real simulated data, we profile performance several recently proposed assessing population including HiCRep,...

10.1101/188755 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2017-09-14

Pigs, Unlike Mice, Have Two Distinct Colonic Stem Cell Populations Similar to Humans That Respond to High-Calorie Diet prior to Insulin Resistance

OPENALEX - Publications

Venkata Charepalli Lavanya Reddivari Sridhar Radhakrishnan Elisabeth Eriksson Xia Xiao and 7 more

Abstract Basal colonic crypt stem cells are long lived and play a role in colon homeostasis. Previous evidence has shown that high-calorie diet (HCD) enhances cell numbers expansion of the proliferative zone, an important biomarker for cancer. However, it is not clear how HCD drives dysregulation cell/colonocyte kinetics. We used human-relevant pig model developed immunofluorescence technique to detect quantify cells. Pigs (n = 8/group) were provided either standard (SD; 5% fat) or (23% 13...

10.1158/1940-6207.capr-17-0010 article EN Cancer Prevention Research 2017-06-03

Coming Soon ...