Michel Dumontier

ORCID: 0000-0003-4727-9435
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Biomedical Text Mining and Ontologies
  • Semantic Web and Ontologies
  • Scientific Computing and Data Management
  • Research Data Management Practices
  • Bioinformatics and Genomic Networks
  • Computational Drug Discovery Methods
  • Data Quality and Management
  • Genomics and Phylogenetic Studies
  • Service-Oriented Architecture and Web Services
  • Privacy-Preserving Technologies in Data
  • Genetics, Bioinformatics, and Biomedical Research
  • Machine Learning in Healthcare
  • Ethics in Clinical Research
  • Pharmacogenetics and Drug Metabolism
  • Music and Audio Processing
  • Artificial Intelligence in Healthcare
  • Natural Language Processing Techniques
  • Electronic Health Records Systems
  • Big Data and Business Intelligence
  • Gene Regulatory Network Analysis
  • Gene expression and cancer classification
  • Genomics and Rare Diseases
  • Microbial Metabolic Engineering and Bioproduction
  • Pharmacovigilance and Adverse Drug Reactions
  • scientometrics and bibliometrics research

Maastricht University
2016-2025

Economie Publique
2023

Research Institute for Knowledge Systems
2023

Carleton University
2007-2022

Stanford University
2012-2022

Brandenburg-Berliner Institut für Sozialwissenschaftliche Studien
2021

Vrije Universiteit Amsterdam
2018

Stanford Medicine
2015-2018

Rensselaer Polytechnic Institute
2017

Biological E (India)
2017

There is an urgent need to improve the infrastructure supporting reuse of scholarly data. A diverse set stakeholders-representing academia, industry, funding agencies, and publishers-have come together design jointly endorse a concise measureable principles that we refer as FAIR Data Principles. The intent these may act guideline for those wishing enhance reusability their data holdings. Distinct from peer initiatives focus on human scholar, Principles put specific emphasis enhancing ability...

10.1038/sdata.2016.18 article EN cc-by Scientific Data 2016-03-15

The Biomolecular Interaction Network Database (BIND) (http://bind.ca) archives biomolecular interaction, reaction, complex and pathway information. Our aim is to curate the details about molecular interactions that arise from published experimental research provide this information, as well tools enable data analysis, freely researchers worldwide. BIND are curated into a comprehensive machine-readable archive of computable information provides users with methods discover mechanisms. has...

10.1093/nar/gki051 article EN Nucleic Acids Research 2004-12-17

The FAIR Data Principles propose that all scholarly output should be Findable, Accessible, Interoperable, and Reusable.As a set of guiding principles, expressing only the kinds behaviours researchers expect from contemporary data resources, how principles manifest in reality was largely open to interpretation.As support for has spread, so breadth these interpretations.In observing this creeping spread interpretation, several original authors felt it now appropriate revisit Principles,...

10.3233/isu-170824 article EN Information Services & Use 2017-02-17

The Ontology for Biomedical Investigations (OBI) is an ontology that provides terms with precisely defined meanings to describe all aspects of how investigations in the biological and medical domains are conducted. OBI re-uses ontologies provide a representation biomedical knowledge from Open Biological Ontologies (OBO) project adds ability this was derived. We here state several applications using it, such as adding semantic expressivity existing databases, building data entry forms,...

10.1371/journal.pone.0154556 article EN public-domain PLoS ONE 2016-04-29
Michael P. Menden Dennis Wang Mike J. Mason Bence Szalai Krishna C. Bulusu and 95 more Yuanfang Guan Thomas Yu Jaewoo Kang Minji Jeon Russ Wolfinger Tin Nguyen Mikhail Zaslavskiy Jordi Abante Barbara Schmitz Abecassis Nanne Aben Delasa Aghamirzaie Tero Aittokallio Farida S. Akhtari Bissan Al‐Lazikani Tanvir Alam Amin Allam Chad H. G. Allen Mariana Pelicano de Almeida Doaa Altarawy Vinícius M. Alves Alicia Amadoz Benedict Anchang Albert A. Antolín Jeremy R. Ash V. Aznar Wail Ba-Alawi Moeen Bagheri Vladimir B. Bajić G. C. Ball Pedro J. Ballester Delora Baptista Christopher Bare Mathilde Bateson Andreas Bender Denis Bertrand Bhagya K. Wijayawardena Keith A. Boroevich Evert Bosdriesz Salim Bougouffa Gergana Bounova Thomas Brouwer Barbara M. Bryant Manuel Calaza Alberto Calderone Stefano Calza Stephen J. Capuzzi José Carbonell‐Caballero Yichao Li Hannah Carter Luisa Castagnoli Remzi Çelebi Gianni Cesareni Hyeokyoon Chang Guocai Chen Hao Chen Huiyuan Chen Lijun Cheng Ariel Chernomoretz Davide Chicco Kwang‐Hyun Cho Sung‐Hwan Cho Daeseon Choi Jaejoon Choi Kwanghun Choi Min‐Soo Choi Martine De Cock Elizabeth A. Coker Isidro Cortés‐Ciriano Miklós Cserzö Cankut Çubuk Charles Curtis Dries Van Daele Cuong Cao Dang Tjeerd M. H. Dijkstra Joaquı́n Dopazo Sorin Drăghici Anastasios Drosou Michel Dumontier Friederike Ehrhart Fatma-Elzahraa Eid Mahmoud ElHefnawi Haitham Elmarakeby Bo van Engelen H. Billur Engin Iwan J. P. de Esch Chris T. Evelo André O. Falcão Sherif Farag Carlos Fernández-Lozano Kathleen M. Fisch Åsmund Flobak Chiara Fornari Amir Foroushani Donatien Chedom Fotso Denis Fourches

Abstract The effectiveness of most cancer targeted therapies is short-lived. Tumors often develop resistance that might be overcome with drug combinations. However, the number possible combinations vast, necessitating data-driven approaches to find optimal patient-specific treatments. Here we report AstraZeneca’s large combination dataset, consisting 11,576 experiments from 910 across 85 molecularly characterized cell lines, and results a DREAM Challenge evaluate computational strategies for...

10.1038/s41467-019-09799-2 article EN cc-by Nature Communications 2019-06-17

The FAIR principles have been widely cited, endorsed and adopted by a broad range of stakeholders since their publication in 2016. By intention, the 15 guiding do not dictate specific technological implementations, but provide guidance for improving Findability, Accessibility, Interoperability Reusability digital resources. This has likely contributed to adoption principles, because individual stakeholder communities can implement own solutions. However, it also resulted inconsistent...

10.1162/dint_r_00024 article EN Data Intelligence 2019-11-01

The Semanticscience Integrated Ontology (SIO) is an ontology to facilitate biomedical knowledge discovery. SIO features a simple upper level comprised of essential types and relations for the rich description arbitrary (real, hypothesized, virtual, fictional) objects, processes their attributes. specifies design patterns describe associate qualities, capabilities, functions, quantities, informational entities including textual, geometrical, mathematical entities, provides specific extensions...

10.1186/2041-1480-5-14 article EN cc-by Journal of Biomedical Semantics 2014-03-06

Despite a large and multifaceted effort to understand the vast landscape of phenotypic data, their current form inhibits productive data analysis. The lack community-wide, consensus-based, human- machine-interpretable language for describing phenotypes genomic environmental contexts is perhaps most pressing scientific bottleneck integration across many key fields in biology, including genomics, systems development, medicine, evolution, ecology, systematics. Here we survey phenomics...

10.1371/journal.pbio.1002033 article EN cc-by PLoS Biology 2015-01-06

Adverse events resulting from drug-drug interactions (DDI) pose a serious health issue. The ability to automatically extract DDIs described in the biomedical literature could further efforts for ongoing pharmacovigilance. Most of neural networks-based methods typically focus on sentence sequence identify these DDIs, however shortest dependency path (SDP) between two entities contains valuable syntactic and semantic information. Effectively exploiting such information may improve DDI...

10.1093/bioinformatics/btx659 article EN cc-by Bioinformatics 2017-10-24

Abstract Transparent evaluations of FAIRness are increasingly required by a wide range stakeholders, from scientists to publishers, funding agencies and policy makers. We propose scalable, automatable framework evaluate digital resources that encompasses measurable indicators, open source tools, participation guidelines, which come together accommodate domain relevant community-defined FAIR assessments. The components the are: (1) Maturity Indicators – community-authored specifications...

10.1038/s41597-019-0184-5 article EN cc-by Scientific Data 2019-09-20

Reproducibility and reusability of research results is an important concern in scientific communication science policy. A foundational element reproducibility the open persistently available presentation data. However, many common approaches for primary data publication use today do not achieve sufficient long-term robustness, openness, accessibility or uniformity. Nor they permit comprehensive exploitation by modern Web technologies. This has led to several authoritative studies...

10.7717/peerj-cs.1 article EN cc-by PeerJ Computer Science 2015-05-27

PubChem is an open repository for chemical structures, biological activities and biomedical annotations. Semantic Web technologies are emerging as increasingly important approach to distribute integrate scientific data. Exposing data services may help enable automated integration management, well facilitate interoperable web applications.This work, one of a series covering the PubChemRDF project, describes translate Substance Compound information into Resource Description Framework (RDF)...

10.1186/s13321-015-0084-4 article EN cc-by Journal of Cheminformatics 2015-07-13

In many disciplines, data are highly decentralized across thousands of online databases (repositories, registries, and knowledgebases). Wringing value from such depends on the discipline science humble bricks mortar that make integration possible; identifiers a core component this infrastructure. Drawing our experience work by other groups, we outline 10 lessons have learned about identifier qualities best practices facilitate large-scale integration. Specifically, propose actions...

10.1371/journal.pbio.2001414 article EN cc-by PLoS Biology 2017-06-29

Cheminformatics is the application of informatics techniques to solve chemical problems in silico. There are many areas biology where cheminformatics plays an important role computational research, including metabolism, proteomics, and systems biology. One critical aspect these fields accurate exchange data, which increasingly accomplished through use ontologies. Ontologies formal representations objects their properties using a logic-based ontology language. Many such ontologies currently...

10.1371/journal.pone.0025513 article EN cc-by PLoS ONE 2011-10-03

Although potential drug–drug interactions (PDDIs) are a significant source of preventable drug-related harm, there is currently no single complete PDDI information. In the current study, all publically available sources information that could be identified using comprehensive and broad search were combined into dataset. The dataset merged fourteen different including 5 clinically-oriented sources, 4 Natural Language Processing (NLP) Corpora, Bioinformatics/Pharmacovigilance sources. As...

10.1016/j.jbi.2015.04.006 article EN cc-by-nc-nd Journal of Biomedical Informatics 2015-04-25

In recent years, as newer technologies have evolved around the healthcare ecosystem, more and data been generated. Advanced analytics could power collected from numerous sources, both institutions, or generated by individuals themselves via apps devices, lead to innovations in treatment diagnosis of diseases; improve care given patient; empower citizens participate decision-making process regarding their own health well-being. However, sensitive nature prohibits organizations sharing data....

10.1162/dint_a_00032 article EN Data Intelligence 2019-11-01
Coming Soon ...