NFDI4DS | UHH-SEMS - Publication Details

Tiago Lubiana

ORCID: 0000-0003-2473-2313

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5055351466

Research Areas

Biomedical Text Mining and Ontologies
Wikis in Education and Collaboration
Semantic Web and Ontologies
Topic Modeling
Advanced Graph Neural Networks
Natural Language Processing Techniques
SARS-CoV-2 and COVID-19 Research
Genetics, Bioinformatics, and Biomedical Research
Bioinformatics and Genomic Networks
Genomics and Phylogenetic Studies
Mitochondrial Function and Pathology
Anesthesia and Neurotoxicity Research
Artificial Intelligence in Healthcare and Education
Research Data Management Practices
Cell Image Analysis Techniques
Microbial Metabolic Engineering and Bioproduction
Hybrid Renewable Energy Systems
Hereditary Neurological Disorders
Scientific Computing and Data Management
Microbial Community Ecology and Physiology
COVID-19 Clinical Research Studies
Cancer-related gene regulation
Academic Publishing and Open Access
Bone and Dental Protein Studies
Vaccine Coverage and Hesitancy

Universidade de São Paulo
2016-2024

Ronin Institute
2021-2023

Institute of Mathematics and Informatics
2022

Czech Academy of Sciences, Institute of Mathematics
2022

Universidade Federal do Rio de Janeiro
2021

Instituto Biológico
2018

Universidade Federal de São Paulo
2016

University of California, San Diego
2016

Ten quick tips for harnessing the power of ChatGPT in computational biology

OPENALEX - Publications

Tiago Lubiana Rafael Lopes Paixão da Silva Pedro Medeiros Juan Carlo Santos e Silva André Nicolau Aquime Gonçalves and 2 more

The rise of advanced chatbots, such as ChatGPT, has stirred excitement and curiosity in the scientific community.Powered by large language models (LLMs) based on generative pretrained transformers (GPTs)-specifically GPT-3.5 GPT-4-ChatGPT is considered a general-purpose technology with potential to impact job market research endeavors numerous fields [1].Although similar have been fine-tuned for biology-specific projects, including text-based analysis biological sequence decoding [2,3],...

10.1371/journal.pcbi.1011319 article EN cc-by PLoS Computational Biology 2023-08-10

The opportunistic pathogen Stenotrophomonas maltophilia utilizes a type IV secretion system for interbacterial killing

OPENALEX - Publications

Ethel Bayer‐Santos William Cenens Bruno Y. Matsuyama Gabriel Umaji Oka Giancarlo Di Sessa and 3 more

Bacterial type IV secretion systems (T4SS) are a highly diversified but evolutionarily related family of macromolecule transporters that can secrete proteins and DNA into the extracellular medium or target cells. It was recently shown subtype T4SS harboured by plant pathogen Xanthomonas citri transfers toxins Here, we show similar from multi-drug-resistant opportunistic Stenotrophomonas maltophilia is proficient in killing competitor bacterial species. T4SS-dependent duelling between S. X....

10.1371/journal.ppat.1007651 article EN cc-by PLoS Pathogens 2019-09-12

Complex Portal 2022: new curation frontiers

OPENALEX - Publications

Birgit H M Meldal Livia Perfetto Colin Combe Tiago Lubiana João Vitor Ferreira Cavalcante and 14 more

The Complex Portal (www.ebi.ac.uk/complexportal) is a manually curated, encyclopaedic database of macromolecular complexes with known function from range model organisms. It summarizes complex composition, topology and along links to large domain-specific resources (i.e. wwPDB, EMDB Reactome). Since the last update in 2019, we have produced first draft complexome for Escherichia coli, maintained updated that Saccharomyces cerevisiae, added over 40 coronavirus increased human 1100 include...

10.1093/nar/gkab991 article EN cc-by Nucleic Acids Research 2021-10-10

Dynamic Retrieval Augmented Generation of Ontologies using Artificial Intelligence (DRAGON-AI)

OPENALEX - Publications

Sabrina Toro Anna V. Anagnostopoulos Susan M. Bello Kai Blumberg Rhiannon Cameron and 25 more

Ontologies are fundamental components of informatics infrastructure in domains such as biomedical, environmental, and food sciences, representing consensus knowledge an accurate computable form. However, their construction maintenance demand substantial resources necessitate collaboration between domain experts, curators, ontology experts. We present Dynamic Retrieval Augmented Generation using AI (DRAGON-AI), generation method employing Large Language Models (LLMs) (RAG). DRAGON-AI can...

10.1186/s13326-024-00320-3 article EN cc-by Journal of Biomedical Semantics 2024-10-16

Pathogenesis, Symptomatology, and Transmission of SARS-CoV-2 through Analysis of Viral Genomics and Structure

OPENALEX - Publications

Halie M. Rando Adam L. MacLean Alexandra Lee Ronan Lordan Sandipan Ray and 28 more

The novel coronavirus SARS-CoV-2, which emerged in late 2019, has since spread around the world and infected hundreds of millions people with disease 2019 (COVID-19). While this viral species was unknown prior to January 2020, its similarity other coronaviruses that infect humans allowed for rapid insight into mechanisms it uses human hosts, as well ways immune system can respond. Here, we contextualize SARS-CoV-2 among identify what is known be inferred about behavior once inside a host....

10.1128/msystems.00095-21 article EN cc-by mSystems 2021-10-26

Representing COVID-19 information in collaborative knowledge graphs: The case of Wikidata

OPENALEX - Publications

Houcemeddine Turki Mohamed Ali Hadj Taieb Thomas Shafee Tiago Lubiana Dariusz Jemielniak and 7 more

Information related to the COVID-19 pandemic ranges from biological bibliographic, geographical genetic and beyond. The structure of raw data is highly complex, so converting it meaningful insight requires curation, integration, extraction visualization, global crowdsourcing which provides both additional challenges opportunities. Wikidata an interdisciplinary, multilingual, open collaborative knowledge base more than 90 million entities connected by well over a billion relationships. It...

10.3233/sw-210444 article EN other-oa Semantic Web 2021-09-28

Unifying the identification of biomedical entities with the Bioregistry

OPENALEX - Publications

Charles Tapley Hoyt Meghan A. Balk Tiffany J Callahan Daniel Domingo‐Fernándéz Melissa Haendel and 14 more

The standardized identification of biomedical entities is a cornerstone interoperability, reuse, and data integration in the life sciences. Several registries have been developed to catalog resources maintaining identifiers for such as small molecules, proteins, cell lines, clinical trials. However, existing struggled provide sufficient coverage metadata standards that meet evolving needs modern sciences researchers. Here, we introduce Bioregistry, an integrative, open, community-driven...

10.1038/s41597-022-01807-3 article EN cc-by Scientific Data 2022-11-19

Ten Quick Tips for Harnessing the Power of ChatGPT/GPT-4 in Computational Biology

OPENALEX - Publications

Tiago Lubiana Rafael de Figueiredo Lopes Pedro Henrique Quintela Soares de Medeiros Juan Carlo Silva Andre Nicolau Aquime Goncalves and 2 more

The rise of advanced chatbots, such as ChatGPT, has sparked curiosity in the scientific community. ChatGPT is a general-purpose chatbot powered by large language models (LLMs) GPT-3.5 and GPT-4, with potential to impact numerous fields, including computational biology. In this article, we offer ten tips based on our experience assist biologists optimizing their workflows. We have collected relevant prompts reviewed nascent literature field, compiling project remain pertinent for future LLM...

10.48550/arxiv.2303.16429 preprint EN cc-by-sa arXiv (Cornell University) 2023-01-01

Zebrafish sp7 mutants show tooth cycling independent of attachment, eruption and poor differentiation of teeth

OPENALEX - Publications

Érika Kague P. Eckhard Witten Mieke Soenens CL. Campos Tiago Lubiana and 5 more

10.1016/j.ydbio.2018.01.021 article EN publisher-specific-oa Developmental Biology 2018-02-02

Characterization of Comments About bioRxiv and medRxiv Preprints

OPENALEX - Publications

Clarissa F. D. Carneiro Gabriel Gonçalves da Costa Kleber Neves Mariana Abreu Pedro Batista Tan and 6 more

Preprints have been increasingly used in biomedical science, and a key feature of many platforms is public commenting. The content these comments, however, has not well studied, it unclear whether they resemble those found journal peer review.To describe the comments on bioRxiv medRxiv preprint platforms.In this cross-sectional study, preprints posted 2020 were accessed through each platform's application programming interface March 29, 2021, random sample containing between 1 20 was...

10.1001/jamanetworkopen.2023.31410 article EN cc-by-nc-nd JAMA Network Open 2023-08-30

Ten quick tips for editing Wikidata

OPENALEX - Publications

Thomas Shafee Daniel Mietchen Tiago Lubiana Dariusz Jemielniak Andra Waagmeester

This article acts as a successor to the 10 simple rules for editing Wikipedia from decade ago [1].It addresses Wikipedia's machine-readable cousin: Wikidata-a project potentially even more relevant point of view Computational Biology.Wikidata is free collaborative knowledgebase [2] providing structured data every page and beyond.It relies on same peer production principle Wikipedia: anyone can contribute.Open, models often surprise in how productively they work practice, given unlikely might...

10.1371/journal.pcbi.1011235 article EN cc-by PLoS Computational Biology 2023-07-20

Mapping the content of comments on bioRxiv and medRxiv preprints

OPENALEX - Publications

Clarissa F. D. Carneiro Gabriel Gonçalves da Costa Kleber Neves Mariana Abreu Pedro Batista Tan and 6 more

Abstract Introduction Preprints have been increasingly used in biomedical sciences, providing the opportunity for research to be publicly assessed before journal publication. With increase attention over preprints during COVID-19 pandemic, we decided assess content of comments left on preprint platforms. Methods posted bioRxiv and medRxiv 2020 were accessed through each platform’s API, a random sample that had received between 1 20 was analyzed. Comments evaluated triplicate by independent...

10.1101/2022.11.23.517621 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2022-11-24

Using logical constraints to validate statistical information about disease outbreaks in collaborative knowledge graphs: the case of COVID-19 epidemiology in Wikidata

OPENALEX - Publications

Houcemeddine Turki Dariusz Jemielniak Mohamed Ali Hadj Taieb José Emilio Labra Gayo Mohamed Ben Aouicha and 6 more

Urgent global research demands real-time dissemination of precise data. Wikidata, a collaborative and openly licensed knowledge graph available in RDF format, provides an ideal forum for exchanging structured data that can be verified consolidated using validation schemas bot edits. In this article, we catalog automatable task set necessary to assess validate the portion Wikidata relating COVID-19 epidemiology. These tasks statistical are implemented SPARQL, query language semantic...

10.7717/peerj-cs.1085 article EN cc-by PeerJ Computer Science 2022-09-29

Chimeric spider silk production in microalgae: a modular bionanomaterial

OPENALEX - Publications

João Vitor Dutra Molino Tiago Lubiana Livia Seno Ferreira‐Camargo Miguel Croce Allan Tanaka and 12 more

The recombinant proteins, spider silk proteins and enzybiotics, will be expressed in Chlamydomonas reinhardtii strains by nuclear transformation. Each strain express a different protein, which contain the N- C-terminal polymerization domains from native proteins. These are essential to step and, subsequently, for production of material very similar silk. This evaluated regarding its antimicrobial mechanical properties, as well system productivity. results may shed some light on silk-based...

10.3897/rio.2.e9342 article EN cc-by Research Ideas and Outcomes 2016-06-23

egonw/SARS-CoV-2-Queries: Edition 1

OPENALEX - Publications

Egon Willighagen Marvin Martens Yasunori Tiago Lubiana nunogit and 2 more

10.5281/zenodo.3977414 article 2020-08-09

Building a Systematic Online Living Evidence Summary of COVID-19 Research

OPENALEX - Publications

Kaitlyn Hair Emily S. Sena Emma Wilson Gillian L. Currie Malcolm Macleod and 60 more

Throughout the global coronavirus pandemic, we have seen an unprecedented volume of COVID-19 researchpublications. This vast body evidence continues to grow, making it difficult for research users keep up with pace evolving findings. To enable synthesis this timely use by researchers, policymakers, and other stakeholders, developed automated workflow collect, categorise, visualise from primary studies. We trained a crowd volunteer reviewers annotate studies relevance COVID-19, study...

10.32384/jeahil17465 article EN cc-by Journal of EAHIL 2021-06-24

Characterizing domain-specific open educational resources by linking ISCB Communities of Special Interest to Wikipedia

OPENALEX - Publications

Alastair M. Kilpatrick Farzana Rahman Audra Anjum Sayane Shome K. M. Salim Andalib and 21 more

Wikipedia is one of the most important channels for public communication science and frequently accessed as an educational resource in computational biology. Joint efforts between International Society Computational Biology (ISCB) taskforce WikiProject Molecular (a group expert editors) have considerably improved biology representation on recent years. However, there still urgent need further improvement quality, especially when compared to related scientific fields such genetics medicine....

10.1093/bioinformatics/btac236 article EN Bioinformatics 2022-04-14

Bringing PanglaoDB to 5-star Linked Open Data using Wikidata

OPENALEX - Publications

Tiago Lubiana João Vitor Ferreira Cavalcante

Abstract PanglaoDB is a database of cell-type markers widely used for single-cell RNA sequencing data analysis. However, cell types and genes in the are encoded by free text, lacking proper identifiers. Wikidata, freely editable knowledge graph useful integrating biomedical knowledge. We thus reasoned that porting PanglaoDB’s to platform could improve their reusability overall technical quality (FAIRness). mapped 188 from species-neutral terms on Wikidata created 376 species-specific Homo...

10.1101/2024.04.12.589259 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2024-04-15

A reasonable request for true data sharing

OPENALEX - Publications

Tiago Lubiana Helder I. Nakaya

10.1016/j.lana.2024.100795 article EN cc-by-nc The Lancet Regional Health - Americas 2024-05-28

Dynamic Retrieval Augmented Generation of Ontologies using Artificial Intelligence (DRAGON-AI)

OPENALEX - Publications

Sabrina Toro Anna V. Anagnostopoulos S. Bello Kai Blumberg Rhiannon Cameron and 25 more

Ontologies are fundamental components of informatics infrastructure in domains such as biomedical, environmental, and food sciences, representing consensus knowledge an accurate computable form. However, their construction maintenance demand substantial resources, necessitating collaborative efforts domain experts, curators, ontology experts. We present Dynamic Retrieval Augmented Generation using AI (DRAGON-AI), generation method employing Large Language Models (LLMs) (RAG). This can...

10.48550/arxiv.2312.10904 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Unifying the Identification of Biomedical Entities with the Bioregistry

OPENALEX - Publications

Charles Tapley Hoyt Meghan A. Balk Tiffany J Callahan Daniel Domingo‐Fernándéz Melissa Haendel and 14 more

ABSTRACT The standardized identification of biomedical entities is a cornerstone interoperability, reuse, and data integration in the life sciences. Several registries have been developed to catalog resources maintaining identifiers for such as small molecules, proteins, cell lines, clinical trials. However, existing struggled provide sufficient coverage metadata standards that meet evolving needs modern sciences researchers. Here, we introduce Bioregistry, an integrative, open,...

10.1101/2022.07.08.499378 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2022-07-10

Coming Soon ...