A semi-parametric statistical model for integrating gene expression profiles across different platforms
0301 basic medicine
Models, Statistical
Sequence Analysis, RNA
Gene Expression Profiling
High-Throughput Nucleotide Sequencing
Reproducibility of Results
Biochemistry
Computer Science Applications
03 medical and health sciences
Proceedings
Humans
RNA
Transcriptome
Molecular Biology
Oligonucleotide Array Sequence Analysis
DOI:
10.1186/s12859-015-0847-y
Publication Date:
2016-01-11T10:00:45Z
AUTHORS (2)
ABSTRACT
Determining differentially expressed genes (DEGs) between biological samples is the key to understand how genotype gives rise to phenotype. RNA-seq and microarray are two main technologies for profiling gene expression levels. However, considerable discrepancy has been found between DEGs detected using the two technologies. Integration data across these two platforms has the potential to improve the power and reliability of DEG detection.We propose a rank-based semi-parametric model to determine DEGs using information across different sources and apply it to the integration of RNA-seq and microarray data. By incorporating both the significance of differential expression and the consistency across platforms, our method effectively detects DEGs with moderate but consistent signals. We demonstrate the effectiveness of our method using simulation studies, MAQC/SEQC data and a synthetic microRNA dataset.Our integration method is not only robust to noise and heterogeneity in the data, but also adaptive to the structure of data. In our simulations and real data studies, our approach shows a higher discriminate power and identifies more biologically relevant DEGs than eBayes, DEseq and some commonly used meta-analysis methods.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (36)
CITATIONS (16)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....