Extracting insights from the shape of complex data using topology
QA75
330
Survival
Beräkningsmatematik
QA75 Electronic computers. Computer science
Breast Neoplasms
Basketball
Carcinomas
PREDICT
Article
Pattern Recognition, Automated
03 medical and health sciences
SDG 3 - Good Health and Well-being
BREAST-CANCER
Cluster Analysis
Humans
Predict
Breast-cancer
Principal Component Analysis
0303 health sciences
Politics
United States
004
Gene Expression Regulation, Neoplastic
Computational Mathematics
Athletes
CARCINOMAS
SURVIVAL
Female
Mathematics
DOI:
10.1038/srep01236
Publication Date:
2013-02-07T10:38:28Z
AUTHORS (8)
ABSTRACT
This paper applies topological methods to study complex high dimensional data sets by extracting shapes (patterns) and obtaining insights about them. Our method combines the best features of existing standard methodologies such as principal component and cluster analyses to provide a geometric representation of complex data sets. Through this hybrid method, we often find subgroups in data sets that traditional methodologies fail to find. Our method also permits the analysis of individual data sets as well as the analysis of relationships between related data sets. We illustrate the use of our method by applying it to three very different kinds of data, namely gene expression from breast tumors, voting data from the United States House of Representatives and player performance data from the NBA, in each case finding stratifications of the data which are more refined than those produced by standard methods.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (20)
CITATIONS (350)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....