NFDI4DS | UHH-SEMS - Publication Details

Outbreak.info genomic reports: scalable and dynamic surveillance of SARS-CoV-2 variants and mutations

OPENALEX - Publications

Karthik Gangavarapu Alaa Abdel Latif Julia L. Mullen Manar Alkuzweny Emory Hufbauer and 19 more

10.1038/s41592-023-01769-3 article EN Nature Methods 2023-02-23

Outbreak.info Research Library: a standardized, searchable platform to discover and explore COVID-19 resources

OPENALEX - Publications

Ginger Tsueng Julia L. Mullen Manar Alkuzweny Marco Alvarado Cano B. Rush and 13 more

10.1038/s41592-023-01770-w article EN Nature Methods 2023-02-23

Outbreak.info genomic reports: scalable and dynamic surveillance of SARS-CoV-2 variants and mutations

OPENALEX - Publications

Karthik Gangavarapu Alaa Abdel Latif Julia L. Mullen Manar Alkuzweny Emory Hufbauer and 19 more

Abstract The emergence of SARS-CoV-2 variants concern has prompted the need for near real-time genomic surveillance to inform public health interventions. In response this need, global scientific community, through unprecedented effort, sequenced and shared over 10 million genomes GISAID, as May 2022. This extraordinarily high sampling rate provides a unique opportunity track evolution virus in real-time. Here, we present outbreak.info , platform that currently tracks 40 combinations PANGO...

10.1101/2022.01.27.22269965 preprint EN cc-by medRxiv (Cold Spring Harbor Laboratory) 2022-01-29

Outbreak.info Research Library: A standardized, searchable platform to discover and explore COVID-19 resources

OPENALEX - Publications

Ginger Tsueng Julia L. Mullen Manar Alkuzweny Marco Alvarado Cano B. Rush and 13 more

To combat the ongoing COVID-19 pandemic, scientists have been conducting research at breakneck speeds, producing over 52,000 peer-reviewed articles within first year. address challenge in tracking vast amount of new located separate repositories, we developed outbreak.info Research Library, a standardized, searchable interface and SARS-CoV-2 resources. Unifying metadata from sixteen assembled collection 350,000 publications, clinical trials, datasets, protocols, other resources as October...

10.1101/2022.01.20.477133 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2022-01-21

BioThings SDK: a toolkit for building high-performance data APIs in biomedical research

OPENALEX - Publications

Sebastien Lelong Xinghua Zhou Cyrus Afrasiabi Zhongchao Qian Marco Alvarado Cano and 8 more

To meet the increased need of making biomedical resources more accessible and reusable, Web Application Programming Interfaces (APIs) or web services have become a common way to disseminate knowledge sources. The BioThings APIs are collection high-performance, scalable, annotation as service that automate integration biological annotations from disparate data This currently includes MyGene.info, MyVariant.info MyChem.info for integrating on genes, variants chemical compounds, respectively....

10.1093/bioinformatics/btac017 article EN cc-by Bioinformatics 2022-01-08

Developing a standardized but extendable framework to increase the findability of infectious disease datasets

OPENALEX - Publications

Ginger Tsueng Marco Alvarado Cano José Bento Candice Czech Mengjia Kang and 14 more

Abstract Biomedical datasets are increasing in size, stored many repositories, and face challenges FAIRness (findability, accessibility, interoperability, reusability). As a Consortium of infectious disease researchers from 15 Centers, we aim to adopt open science practices promote transparency, encourage reproducibility, accelerate research advances through data reuse. To improve our computational tools, evaluated metadata standards across established biomedical repositories. The vast...

10.1038/s41597-023-01968-9 article EN cc-by Scientific Data 2023-02-23

Outbreak.info genomic reports: scalable and dynamic surveillance of SARS-CoV-2 variants and mutations

OPENALEX - Publications

Laura D. Hughes Karthik Gangavarapu Alaa Abdel Latif Julia L. Mullen Manar Alkuzweny and 19 more

Abstract The emergence of SARS-CoV-2 variants concern has prompted the need for near real-time genomic surveillance to inform public health interventions. In response this need, global scientific community, through unprecedented effort, sequenced and shared over 11 million genomes GISAID, as May 2022. This extraordinarily high sampling rate provides a unique opportunity track evolution virus in real-time. Here, we present outbreak.info, platform that currently tracks 40 combinations PANGO...

10.21203/rs.3.rs-1723829/v1 preprint EN cc-by Research Square (Research Square) 2022-06-28

Schema Playground: a tool for authoring, extending, and using metadata schemas to improve FAIRness of biomedical data

OPENALEX - Publications

Marco Alvarado Cano Ginger Tsueng Xinghua Zhou Jiwen Xin Laura D. Hughes and 3 more

Abstract Background Biomedical researchers are strongly encouraged to make their research outputs more Findable, Accessible, Interoperable, and Reusable (FAIR). While many biomedical readily accessible through open data efforts, finding relevant remains a significant challenge. Schema.org is metadata vocabulary standardization project that enables web content creators FAIR. Leveraging could benefit resource providers, but it can be challenging apply standards outputs. We created an online...

10.1186/s12859-023-05258-4 article EN cc-by BMC Bioinformatics 2023-04-20

BioThings Explorer: a query engine for a federated knowledge graph of biomedical APIs

OPENALEX - Publications

Jackson Callaghan Colleen H. Xu Jiwen Xin Marco Alvarado Cano Anders Riutta and 9 more

Knowledge graphs are an increasingly common data structure for representing biomedical information. These knowledge can easily represent heterogeneous types of information, and many algorithms tools exist querying analyzing graphs. Biomedical have been used in a variety applications, including drug repurposing, identification targets, prediction side effects, clinical decision support. Typically, constructed by centralization integration from multiple disparate sources. Here, we describe...

10.1093/bioinformatics/btad570 article EN cc-by Bioinformatics 2023-09-01

Enabling Large-Scale Bioinformatics Data Analysis with Cloud Computing

OPENALEX - Publications

Johan Karlsson Óscar Torreño Daniel Ramet Günter Klambauer Marco Alvarado Cano and 1 more

The petabyte scale of the Big Data generation in bioinformatics requires introduction advanced computational techniques to enable efficient knowledge discovery from data. Many data analysis tools have been developed but few adapted take advantage high performance computing (HPC) resources. For some these tools, an attractive option is employ a map/reduce strategy. On other hand, Cloud Computing could be important platform run such parallel because it provides on-demand, elastic This paper...

10.1109/ispa.2012.95 article EN 2012-07-01

ChlamBase: a curated model organism database for the Chlamydia research community

OPENALEX - Publications

Tim Putman Kevin Hybiske Derek Jow Cyrus Afrasiabi Sebastien Lelong and 6 more

The accelerating growth of genomic and proteomic information for Chlamydia species, coupled with unique biological aspects these pathogens, necessitates bioinformatic tools features that are not provided by major public databases. To meet growing needs, we developed ChlamBase, a model organism database is built upon the WikiGenomes application framework, Wikidata, community-curated database. ChlamBase was designed to serve as central access point research community. integrates from numerous...

10.1093/database/baz041 article EN cc-by Database 2019-01-01

Schema Playground: A tool for authoring, extending, and using metadata schemas to improve FAIRness of biomedical data

OPENALEX - Publications

Marco Alvarado Cano Ginger Tsueng Xinghua Zhou Laura D. Hughes Julia L. Mullen and 3 more

Abstract Background Biomedical researchers are strongly encouraged to make their research outputs more Findable, Accessible, Interoperable, and Reusable (FAIR). While many biomedical readily accessible through open data efforts, finding relevant remains a significant challenge. Schema.org is metadata vocabulary standardization project that enables web content creators FAIR. Leveraging schema.org could benefit resource providers, but it can be challenging apply standards outputs. We created...

10.1101/2021.09.02.458726 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2021-09-03

Outbreak.info Research Library: A standardized, searchable platform to discover and explore COVID-19 resources

OPENALEX - Publications

Laura D. Hughes Ginger Tsueng Julia L. Mullen Manar Alkuzweny Marco Alvarado Cano and 11 more

Abstract To combat the ongoing COVID-19 pandemic, scientists have been conducting research at breakneck speeds, producing over 52,000 peer-reviewed articles within first year. address challenge in tracking vast amount of new located separate repositories, we developed outbreak.info Research Library, a standardized, searchable interface and SARS-CoV-2 resources. Unifying metadata from fourteen assembled collection 270,000 publications, clinical trials, datasets, protocols, other resources as...

10.21203/rs.3.rs-1723808/v1 preprint EN cc-by Research Square (Research Square) 2022-06-28

MyGeneset.info: an interactive and programmatic platform for community-curated and user-created collections of genes

OPENALEX - Publications

Ricardo E. Ávila Vincent Rubinetti Xinghua Zhou Dongbo Hu Zhongchao Qian and 5 more

Abstract Gene definitions and identifiers can be painful to manage–more so when trying include gene function annotations as this highly context-dependent. Creating groups of genes or sets help provide such context, but it compounds the issue each within set map multiple have derived from sources. We developed MyGeneset.info an API for integrated suitable use in analytical pipelines web servers. Leveraging our previous work with MyGene.info (a server that provides gene-centric identifiers),...

10.1093/nar/gkad289 article EN cc-by Nucleic Acids Research 2023-04-18

BioThings SDK: a toolkit for building high-performance data APIs in biomedical research

OPENALEX - Publications

Sebastien Lelong Xinghua Zhou Cyrus Afrasiabi Zhongchao Qian Marco Alvarado Cano and 8 more

Abstract Summary To meet the increased need of making biomedical resources more accessible and reusable, Web APIs or web services have become a common way to disseminate knowledge sources. The BioThings are collection high-performance, scalable, annotation as service that automate integration biological annotations from disparate data This currently includes MyGene.info, MyVariant.info, MyChem.info for integrating on genes, variants, chemical compounds, respectively. These used by both...

10.1101/2021.10.18.464256 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2021-10-19

107. MyVariant.info: a gateway to integrated resource of variant annotations

OPENALEX - Publications

Yao Yao Everaldo Rodolpho Sebastien Lelong Xinghua Zhou Cyrus Afrasiabi and 5 more

10.1016/j.cancergen.2023.08.115 article EN Cancer Genetics 2023-10-31

BioThings Explorer: a query engine for a federated knowledge graph of biomedical APIs

OPENALEX - Publications

Jackson Callaghan Colleen H. Xu Jiwen Xin Marco Alvarado Cano Anders Riutta and 9 more

Knowledge graphs are an increasingly common data structure for representing biomedical information. These knowledge can easily represent heterogeneous types of information, and many algorithms tools exist querying analyzing graphs. Biomedical have been used in a variety applications, including drug repurposing, identification targets, prediction side effects, clinical decision support. Typically, constructed by centralization integration from multiple disparate sources. Here, we describe...

10.48550/arxiv.2304.09344 preprint EN cc-by arXiv (Cornell University) 2023-01-01

BioThings Studio: an API gateway for biomedical knowledge

OPENALEX - Publications

Sebastien Lelong Cyrus Afrasiabi Jiwen Xin Marco Alvarado Cano Ginger Tsueng and 2 more

10.7490/f1000research.1115797.1 article EN F1000Research 2018-07-10

Developing a standardized but extendable framework to increase the findability of infectious disease datasets

OPENALEX - Publications

Ginger Tsueng Marco Alvarado Cano José Bento Candice Czech Mengjia Kang and 13 more

Abstract Biomedical datasets are increasing in size, stored many repositories, and face challenges FAIRness (findability, accessibility, interoperability, reusability). As a Consortium of infectious disease researchers from 15 Centers, we aim to adopt open science practices promote transparency, encourage reproducibility, accelerate research advances through data reuse. To improve our computational tools, evaluated metadata standards across established biomedical repositories. The vast...

10.1101/2022.10.10.511492 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2022-10-13