High-performance integrated virtual environment (HIVE): a robust infrastructure for next-generation sequence data analysis
Sequence (biology)
DOI:
10.1093/database/baw022
Publication Date:
2016-03-17T21:38:10Z
AUTHORS (26)
ABSTRACT
The High-performance Integrated Virtual Environment (HIVE) is a distributed storage and compute environment designed primarily to handle next-generation sequencing (NGS) data. This multicomponent cloud infrastructure provides secure web access for authorized users deposit, retrieve, annotate on NGS data, analyse the outcomes using interface visual environments appropriately built in collaboration with research regulatory scientists other end users. Unlike many massively parallel computing environments, HIVE uses control server which virtualizes services, not processes. It both very robust flexible due abstraction layer introduced between computational requests operating system novel paradigm of moving computations instead data nodes, has proven be significantly less taxing hardware network infrastructure. honeycomb model developed integrates metadata into an object-oriented model. Its distinction from databases additional implementation unified application program search, view manipulate all types. simplifies introduction new types, thereby minimizing need database restructuring streamlining development integrated information systems. employs highly hierarchical permission system, allowing determination privileges finely granular manner without flooding security subsystem multiplicity rules. will allow engineers perform analysis that efficient secure. actively supported public private domains, project collaborations are welcomed. Database URL: https://hive.biochemistry.gwu.edu
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (23)
CITATIONS (54)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....