- Biomedical Text Mining and Ontologies
- Semantic Web and Ontologies
- Service-Oriented Architecture and Web Services
- Research Data Management Practices
- Natural Language Processing Techniques
- Scientific Computing and Data Management
- Human-Automation Interaction and Safety
- Topic Modeling
- Traffic and Road Safety
- Superconducting Materials and Applications
- Atmospheric and Environmental Gas Dynamics
- Genomics and Rare Diseases
- Ga2O3 and related materials
- Cell Image Analysis Techniques
- Gut microbiota and health
- Bioinformatics and Genomic Networks
- Dental Erosion and Treatment
- Women's cancer prevention and management
- Meta-analysis and systematic reviews
- Multiferroics and related materials
- Data Quality and Management
- Rare-earth and actinide compounds
- ZnO doping and properties
- AI-based Problem Solving and Planning
- 3D Surveying and Cultural Heritage
Stanford University
2015-2024
Harvard University
2022-2024
Universidade Federal de Uberlândia
2024
Universidade Federal de Sergipe
2016-2021
San Antonio College
2021
University of Leeds
2020-2021
Netherlands eScience Center
2020
Universidade de Mogi das Cruzes
2019
Apple (Israel)
2019
Universidade Federal do Acre
2019
Abstract We present an analytical study of the quality metadata about samples used in biomedical experiments. The under analysis are stored two well-known databases: BioSample—a repository managed by National Center for Biotechnology Information (NCBI), and BioSamples—a European Bioinformatics Institute (EBI). tested whether 11.4 M sample records repositories populated with values that fulfill stated requirements such values. Our revealed multiple anomalies metadata. Most field names their...
The OWL Reasoner Evaluation competition is an annual (with associated workshop) that pits 2 compliant reasoners against each other on various standard reasoning tasks over naturally occurring problems. 2015 was the third of its sort and had 14 competing in six tracks comprising three (consistency, classification, realisation) two profiles (OWL DL EL). In this paper, we discuss design, execution results with particular attention to lessons learned for benchmarking, comparative experiments,...
Introdução: A síndrome de down é uma anomalia cromossômica que se caracteriza por traços físicos específicos. Consiste em congênita comum a nível mundial cuja prevalência aumenta com idade materna, principalmente partir dos 35 anos, quando os ovócitos envelhecem e perdem qualidade, diminuindo sua capacidade eliminar zigotos Síndrome. Objetivo: Analisar estatísticas macrorregiões brasileiras, nascidos vivos Síndrome Down acordo materna ≥ anos até 39 40 nos 2020-2024. Método: Trata-se um...
Abstract Metadata that are structured using principled schemas and use terms from ontologies essential to making biomedical data findable reusable for downstream analyses. The largest source of metadata describes the experimental protocol, funding, scientific leadership clinical studies is ClinicalTrials.gov. We evaluated whether values in 302,091 trial records adhere expected types ontologies, contain fields required by government regulations, elements could replace free-text elements....
Point cloud datasets provided by LiDAR have become an integral part in many research fields including archaeology, forestry, and ecology. Facilitated technological advances, the volume of these has steadily increased, with modern airborne laser scanning surveys now providing high-resolution, (super-)national scale, multi-terabyte point clouds. However, their wider scientific exploitation is hindered scarcity open source software tools capable handling challenges accessing, processing,...
The National Cancer Institute (NCI) Thesaurus (NCIt) is a biomedical ontology which has been developed for over decade. Nearly every month from 2003 through 2011, the NCI published an updated version of NCIt to Web as OWL (as well in other formats). We collected all 88 versions available and conducted cross-sectional study on this corpus investigate characterize evolution NCIt. In particular, we gathered analysed various axiom entity statistics, carried out reasoner performance test corpus....
Structured data acquisition is a common task that widely performed in biomedicine. However, current solutions for this are far from providing means to structure such way it can be automatically employed decision making (e.g., our example application domain of clinical functional assessment, determining eligibility disability benefits) based on conclusions derived acquired assessment impaired motor function). To use these settings, we need structured exploited by automated reasoning systems,...
Abstract There is evidence that drivers’ behaviour adapts after using different advanced driving assistance systems. For instance, headway during car-following reduces adaptive cruise control. However, little known about whether, and how, will change if they experience automated car-following, how this affected by engagement in non-driving-related tasks (NDRT). The aim of simulator study, conducted as part the H2020 L3Pilot project, was to address topic. We also investigated effect presence...
The literature of human and other host-associated microbiome studies is expanding rapidly, but systematic comparisons among published results signatures differential abundance remain difficult. We present BugSigDB, a community-editable database manually curated microbial from accompanied by information on study geography, health outcomes, host body site experimental, epidemiological statistical methods using controlled vocabulary. initial release the contains >2,500 >600 three species,...
The analysis of changes between OWL ontologies (in the form a diff ) is an important service for ontology engineering. A purely syntactic insufficient to distinguish that have logical impact and those do not. current state art in semantic diffing ignores logically ineffectual lacks any further characterisation even significant changes. We present method based on exhaustive categorisation effectual ontologies. In order verify applicability our approach we apply it 88 versions National Cancer...
Abstract Objective Integrating electronic health record (EHR) data with other resources is essential in rare disease research due to low prevalence. Such integration dependent on the alignment of ontologies used for annotation. The international classification diseases (ICD) annotate clinical diagnoses, while human phenotype ontology (HPO) phenotypes. Although these overlap biomedical entities they describe, extent which are interoperable unknown. We investigate how well aligned and whether...
We present WebProtégé, a tool to develop ontologies represented in the Web Ontology Language (OWL). WebProtégé is cloud-based application that allows users collaboratively edit OWL ontologies, and it available for use at https://webprotege.stanford.edu. currently hosts more than 68,000 ontology projects has over 50,000 user accounts. In this paper, we detail main new features of latest version WebProtégé.
The metadata about scientific experiments are crucial for finding, reproducing, and reusing the data that describe. We present a study of quality stored in BioSample--a repository samples used biomedical managed by U.S. National Center Biomedical Technology Information (NCBI). tested whether 6.6 million BioSample records populated with values fulfill stated requirements such values. Our revealed multiple anomalies analyzed metadata. field names their not standardized or controlled--15%...