HCMMCNVs: hierarchical clustering mixture model of copy number variants detection using whole exome sequencing technology
0301 basic medicine
03 medical and health sciences
DNA Copy Number Variations
Exome Sequencing
Carcinoma, Squamous Cell
Humans
Cluster Analysis
Mouth Neoplasms
Software
Algorithms
3. Good health
DOI:
10.1093/bioinformatics/btab183
Publication Date:
2021-03-12T20:11:24Z
AUTHORS (6)
ABSTRACT
Abstract
Summary
In this article, we introduce a hierarchical clustering and Gaussian mixture model with expectation-maximization (EM) algorithm for detecting copy number variants (CNVs) using whole exome sequencing (WES) data. The R shiny package ‘HCMMCNVs’ is also developed for processing user-provided bam files, running CNVs detection algorithm and conducting visualization. Through applying our approach to 325 cancer cell lines in 22 tumor types from Cancer Cell Line Encyclopedia (CCLE), we show that our algorithm is competitive with other existing methods and feasible in using multiple cancer cell lines for CNVs estimation. In addition, by applying our approach to WES data of 120 oral squamous cell carcinoma (OSCC) samples, our algorithm, using the tumor sample only, exhibits more power in detecting CNVs as compared with the methods using both tumors and matched normal counterparts.
Availability and implementation
HCMMCNVs R shiny software is freely available at github repository https://github.com/lunching/HCMM_CNVs.and Zenodo https://doi.org/10.5281/zenodo.4593371.
Supplementary information
Supplementary data are available at Bioinformatics online.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (22)
CITATIONS (1)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....