Restructured GEO: restructuring Gene Expression Omnibus metadata for genome dynamics analysis
Restructuring
DOI:
10.1093/database/bay145
Publication Date:
2018-12-29T15:08:23Z
AUTHORS (7)
ABSTRACT
Gene Expression Omnibus (GEO) and other publicly available data store their metadata in the format of unstructured English text, which is very difficult for automated reuse.We employed text mining techniques to analyze GEO developed Restructured database (ReGEO). ReGEO reorganizes categorizes series makes them searchable by two new attributes extracted automatically from each series' metadata. These are number time points tested experiment disease being investigated. also GEO, such as platform organism, type, associated PubMed ID well general keywords study's description. Our approach greatly expands usability data, demonstrating a credible improve utility vast amount era Big Data research.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (29)
CITATIONS (22)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....