Identifying novel oncogenes: A machine learning approach
SNP
DOI:
10.1007/s12539-013-0151-3
Publication Date:
2014-01-09T01:44:30Z
AUTHORS (4)
ABSTRACT
Genome sequencing has overflowed the databases with huge amount of SNP data. Although the amount of detected single nucleotide polymorphisms (SNPs) is rising exponentially every day, we still lag behind in characterization techniques. Implementing computational platforms to determine the pathogenecity associated with the SNPs can provide a probable solution to this problem. To improve the prediction quality for SNP characterization methods, we implemented machine learning support vector classification method. Total 557 non-synonymous amino acid variants were collected from CENP family proteins, excluding CENPE. Multivariate simulation of associated changes in biological phenomena's for each SNPs was computed through available SNP analysis platforms. Support vector model was designed using training dataset and the raw classification data was subjected to the classification hyperplane. We observed multiple evidences of cancer associated genetic mutations in CENPI, CENPJ, CENPK, CENPL and CENPX protein. The former four proteins have showed positive hits in cosmic database for mutations in tumour samples, but CENPX has never been reported before for the cancer associated outcomes. Since CENPX has been recently classified and not much functional and pathological insight has been, the results obtained in this study will serve as a starting point for future investigation on cancer research in association to CENPX protein.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (32)
CITATIONS (15)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....