Christopher J. O. Baker

ORCID: 0000-0003-4004-6479
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Biomedical Text Mining and Ontologies
  • Semantic Web and Ontologies
  • Scientific Computing and Data Management
  • Bioinformatics and Genomic Networks
  • Genomics and Phylogenetic Studies
  • Data Quality and Management
  • Service-Oriented Architecture and Web Services
  • Advanced Text Analysis Techniques
  • Genetics, Bioinformatics, and Biomedical Research
  • Topic Modeling
  • Natural Language Processing Techniques
  • Environmental DNA in Biodiversity Studies
  • Microbial Community Ecology and Physiology
  • Healthcare Systems and Public Health
  • Species Distribution and Climate Change
  • Data-Driven Disease Surveillance
  • Advanced Database Systems and Queries
  • Machine Learning in Bioinformatics
  • Electronic Health Records Systems
  • Genomics and Rare Diseases
  • HIV Research and Treatment
  • Wastewater Treatment and Nitrogen Removal
  • Microbial Metabolic Engineering and Bioproduction
  • Computational Drug Discovery Methods
  • HIV/AIDS drug development and treatment

Rothamsted Research
2024

University of New Brunswick
2014-2023

University of Calgary
2021

RELX Group (Netherlands)
2017

McGill University
2015

Yale University
2014

Concordia University
2005-2011

SIB Swiss Institute of Bioinformatics
2011

Institute for Infocomm Research
2007-2008

Iogen Corporation
2001

The Semanticscience Integrated Ontology (SIO) is an ontology to facilitate biomedical knowledge discovery. SIO features a simple upper level comprised of essential types and relations for the rich description arbitrary (real, hypothesized, virtual, fictional) objects, processes their attributes. specifies design patterns describe associate qualities, capabilities, functions, quantities, informational entities including textual, geometrical, mathematical entities, provides specific extensions...

10.1186/2041-1480-5-14 article EN cc-by Journal of Biomedical Semantics 2014-03-06

Organizational structure for the proposed IsoBank. A central executive group would oversee four subcommittees (SC): Information technology, integrative disciplinary, education and training, analytical expertise. GNIP, Global Network of Isotopes in Precipitation; IAEA, International Atomic Energy Association; QA/QC, quality assurance/quality control.

10.1073/pnas.1701742114 article EN Proceedings of the National Academy of Sciences 2017-03-21

Competitions in text mining have been used to measure the performance of automatic processing solutions against a manually annotated gold standard corpus (GSC). The preparation GSC is time-consuming and costly final consists at most few thousand documents with limited set semantic groups. To overcome these shortcomings, CALBC project partners (PPs) produced large-scale biomedical four different groups through harmonisation annotations from solutions, first version Silver Standard Corpus...

10.1186/2041-1480-2-s5-s11 article EN cc-by Journal of Biomedical Semantics 2011-01-01

Abstract Motivation: Semantic tagging of organism mentions in full-text articles is an important part literature mining and semantic enrichment solutions. Tagged also play a pivotal role disambiguating other entities text, such as proteins. A high-precision system must be able to detect the numerous forms mentions, including common names well traditional taxonomic groups: genus, species strains. In addition, resolve abbreviations acronyms, assign scientific name if possible link detected...

10.1093/bioinformatics/btr452 article EN Bioinformatics 2011-08-09

The indexing of scientific literature and content is a relevant contemporary requirement within life science information systems. Navigating available in legacy formats continues to be challenge both enterprise academic domains. emergence semantic web technologies their fusion with artificial intelligence techniques has provided new toolkit which address these data integration challenges. In the emerging field lipidomics such navigation challenges are barriers translation results into...

10.1186/1471-2105-9-s1-s5 article EN cc-by BMC Bioinformatics 2008-02-01

The trait approach has already indicated significant potential as a tool in understanding natural variation among species sensitivity to contaminants the process of ecological risk assessment. However, realize its full potential, defined nomenclature for traits is urgently required, and effort required populate databases species-trait relationships. Recently, there have been advances area information management discovery semantic web. Combined with continuing progress biological knowledge,...

10.1002/ieam.129 article EN Integrated Environmental Assessment and Management 2010-08-31

Mutation impact extraction is a hitherto unaccomplished task in state of the art mutation systems. Protein mutations and their impacts on protein properties are hidden scientific literature, making them poorly accessible for engineers inaccessible phenotype-prediction systems that currently depend manually curated genomic variation databases.We present first rule-based approach properties, categorizing directionality as positive, negative or neutral. Furthermore mentions grounded to...

10.1186/1471-2164-11-s4-s24 article EN cc-by BMC Genomics 2010-12-01

Threatened freshwater ecosystems urgently require improved tools for effective management. Food web analysis is currently under-utilised, yet can be used to generate metrics support biomonitoring assessments by measuring the stability and robustness of ecosystems. Using a previously developed pipeline, we combined taxonomic outputs from DNA metabarcoding with text-mining routine extract trait information directly literature. This pipeline allowed us heuristic food webs sites within lower...

10.3389/fevo.2019.00395 article EN cc-by Frontiers in Ecology and Evolution 2019-11-25

10.1007/s10796-006-6103-2 article EN Information Systems Frontiers 2006-02-01

Malaria is a leading cause of death in Africa. Many organizations, NGO's, and government agencies are collaborating to prevent, control, eliminate malaria. In order succeed these shared goals, an integrated, consistent knowledge source empower informed decision-making required. surveillance currently performed using dynamic, interconnected, systems which require rapid data exchange between different platforms. An important challenge must overcome the occurrence dynamic changes one or more...

10.1109/access.2017.2761232 article EN cc-by IEEE Access 2017-01-01

<ns3:p>Scientific data analyses often combine several computational tools in automated pipelines, or workflows. Thousands of such workflows have been used the life sciences, though their composition has remained a cumbersome manual process due to lack standards for annotation, assembly, and implementation. Recent technological advances returned long-standing vision workflow into focus.</ns3:p><ns3:p> This article summarizes recent Lorentz Center workshop dedicated sciences. We survey...

10.12688/f1000research.54159.1 preprint EN cc-by F1000Research 2021-09-07

Abstract Background The development of high-throughput experimentation has led to astronomical growth in biologically relevant lipids and lipid derivatives identified, screened, deposited numerous online databases. Unfortunately, efforts annotate, classify, analyze these chemical entities have largely remained the hands human curators using manual or semi-automated protocols, leaving many novel unclassified. Since function is often closely linked structure, accurate structure-based...

10.1186/1471-2105-12-303 article EN cc-by BMC Bioinformatics 2011-07-26

Clinical Intelligence, as a research and engineering discipline, is dedicated to the development of tools for data analysis purposes clinical research, surveillance, effective health care management. Self-service ad hoc querying one desirable type functionality. Since most are currently stored in relational or similar form, problematic it requires specialised technical skills knowledge particular schemas.A possible solution semantic where user formulates queries terms domain ontologies that...

10.1186/2041-1480-4-9 article EN cc-by Journal of Biomedical Semantics 2013-01-01

Abstract Objectives Automatic job coding tools were developed to reduce the laborious task of manually assigning codes based on free-text descriptions in census and survey data sources, including large occupational health studies. The objective this study is provide a case comparative performance JEM (Job-Exposure Matrix)-assigned exposures agreement using existing tools. Methods We compared three automatic [AUTONOC, CASCOT (Computer-Assisted Structured Coding Tool), LabourR], which selected...

10.1093/annweh/wxad002 article EN cc-by-nc Annals of Work Exposures and Health 2023-02-03

Summary Recently it has been demonstrated that the single‐copy malate synthase (MS) and isocitrate lyase (ICL) genes from cucumber are regulated by nutritional status in cell cultures. In this paper a new mesophyll protoplast transient expression system is described which electroporated MS promoter—GUS reporter gene constructs exhibit same pattern of as endogenous gene. Both MS—GUS expressed when protoplasts cultured for 48 h on non‐metabolizable carbon source such mannitol or...

10.1046/j.1365-313x.1994.6060893.x article EN The Plant Journal 1994-12-01

The development of text analysis systems targeting the extraction information about mutations from research publications is an emergent topic in biomedical research. Current differ both scope and approach, thus preventing a meaningful comparison their performance therefore possible synergies. To overcome this evaluation bottleneck, we developed comprehensive framework for systematic mutation systems, precisely defining tasks corresponding metrics, that will allow existing future applications.

10.1142/s0219720007003193 article EN Journal of Bioinformatics and Computational Biology 2007-12-01

Mutation impact extraction is an important task designed to harvest relevant annotations from scientific documents for reuse in multiple contexts. Our previous work on text mining mutation impacts resulted (i) the development of a GATE-based pipeline that mines texts information about mutations proteins, (ii) population this into our OWL DL ontology, and (iii) establishing experimental semantic database storing results mining.This article explores possibility using SADI framework as medium...

10.1186/1471-2105-12-s4-s6 article EN cc-by BMC Bioinformatics 2011-07-05
Coming Soon ...