NFDI4DS | UHH-SEMS - Publication Details

Christopher J. O. Baker

ORCID: 0000-0003-4004-6479

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5080598287

Research Areas

Biomedical Text Mining and Ontologies
Semantic Web and Ontologies
Scientific Computing and Data Management
Bioinformatics and Genomic Networks
Genomics and Phylogenetic Studies
Data Quality and Management
Service-Oriented Architecture and Web Services
Advanced Text Analysis Techniques
Genetics, Bioinformatics, and Biomedical Research
Topic Modeling
Natural Language Processing Techniques
Environmental DNA in Biodiversity Studies
Microbial Community Ecology and Physiology
Healthcare Systems and Public Health
Species Distribution and Climate Change
Data-Driven Disease Surveillance
Advanced Database Systems and Queries
Machine Learning in Bioinformatics
Electronic Health Records Systems
Genomics and Rare Diseases
HIV Research and Treatment
Wastewater Treatment and Nitrogen Removal
Microbial Metabolic Engineering and Bioproduction
Computational Drug Discovery Methods
HIV/AIDS drug development and treatment

Rothamsted Research
2024

University of New Brunswick
2014-2023

University of Calgary
2021

RELX Group (Netherlands)
2017

McGill University
2015

Yale University
2014

Concordia University
2005-2011

SIB Swiss Institute of Bioinformatics
2011

Institute for Infocomm Research
2007-2008

Iogen Corporation
2001

The Semanticscience Integrated Ontology (SIO) for biomedical research and knowledge discovery

OPENALEX - Publications

Michel Dumontier Christopher J. O. Baker Joachim Baran Alison Callahan Leonid Chepelev and 12 more

The Semanticscience Integrated Ontology (SIO) is an ontology to facilitate biomedical knowledge discovery. SIO features a simple upper level comprised of essential types and relations for the rich description arbitrary (real, hypothesized, virtual, fictional) objects, processes their attributes. specifies design patterns describe associate qualities, capabilities, functions, quantities, informational entities including textual, geometrical, mathematical entities, provides specific extensions...

10.1186/2041-1480-5-14 article EN cc-by Journal of Biomedical Semantics 2014-03-06

Why we need a centralized repository for isotopic data

OPENALEX - Publications

Jonathan N. Pauli Seth D. Newsome Joseph A. Cook Chris Harrod Shawn A. Steffan and 22 more

Organizational structure for the proposed IsoBank. A central executive group would oversee four subcommittees (SC): Information technology, integrative disciplinary, education and training, analytical expertise. GNIP, Global Network of Isotopes in Precipitation; IAEA, International Atomic Energy Association; QA/QC, quality assurance/quality control.

10.1073/pnas.1701742114 article EN Proceedings of the National Academy of Sciences 2017-03-21

Assessment of NER solutions against the first and second CALBC Silver Standard Corpus

OPENALEX - Publications

Dietrich Rebholz‐Schuhmann Antonio Jimeno Yepes Chen Li Şenay Kafkas Ian Lewin and 32 more

Competitions in text mining have been used to measure the performance of automatic processing solutions against a manually annotated gold standard corpus (GSC). The preparation GSC is time-consuming and costly final consists at most few thousand documents with limited set semantic groups. To overcome these shortcomings, CALBC project partners (PPs) produced large-scale biomedical four different groups through harmonisation annotations from solutions, first version Silver Standard Corpus...

10.1186/2041-1480-2-s5-s11 article EN cc-by Journal of Biomedical Semantics 2011-01-01

OrganismTagger: detection, normalization and grounding of organism entities in biomedical documents

OPENALEX - Publications

Nona Naderi Thomas Kappler Christopher J. O. Baker René Witte

Abstract Motivation: Semantic tagging of organism mentions in full-text articles is an important part literature mining and semantic enrichment solutions. Tagged also play a pivotal role disambiguating other entities text, such as proteins. A high-precision system must be able to detect the numerous forms mentions, including common names well traditional taxonomic groups: genus, species strains. In addition, resolve abbreviations acronyms, assign scientific name if possible link detected...

10.1093/bioinformatics/btr452 article EN Bioinformatics 2011-08-09

Classifying chemical mode of action using gene networks and machine learning: A case study with the herbicide linuron

OPENALEX - Publications

Anna Ornostay Andrew Cowie Matthew Hindle Christopher J. O. Baker Christopher J. Martyniuk

10.1016/j.cbd.2013.08.001 article EN Comparative Biochemistry and Physiology Part D Genomics and Proteomics 2013-08-09

Towards ontology-driven navigation of the lipid bibliosphere

OPENALEX - Publications

Christopher J. O. Baker Rajaraman Kanagasabai Wee Tiong Ang Anitha Veeramani Hong-Sang Low and 1 more

The indexing of scientific literature and content is a relevant contemporary requirement within life science information systems. Navigating available in legacy formats continues to be challenge both enterprise academic domains. emergence semantic web technologies their fusion with artificial intelligence techniques has provided new toolkit which address these data integration challenges. In the emerging field lipidomics such navigation challenges are barriers translation results into...

10.1186/1471-2105-9-s1-s5 article EN cc-by BMC Bioinformatics 2008-02-01

Toward a knowledge infrastructure for traits‐based ecological risk assessment

OPENALEX - Publications

Donald J. Baird Christopher J. O. Baker Robert B. Brua Mehrdad Hajibabaei Kearon McNicol and 2 more

The trait approach has already indicated significant potential as a tool in understanding natural variation among species sensitivity to contaminants the process of ecological risk assessment. However, realize its full potential, defined nomenclature for traits is urgently required, and effort required populate databases species-trait relationships. Recently, there have been advances area information management discovery semantic web. Combined with continuing progress biological knowledge,...

10.1002/ieam.129 article EN Integrated Environmental Assessment and Management 2010-08-31

Algorithms and semantic infrastructure for mutation impact extraction and grounding

OPENALEX - Publications

Jonas Bergman Laurila Nona Naderi René Witte Alexandre Riazanov Alexandre Kouznetsov and 1 more

Mutation impact extraction is a hitherto unaccomplished task in state of the art mutation systems. Protein mutations and their impacts on protein properties are hidden scientific literature, making them poorly accessible for engineers inaccessible phenotype-prediction systems that currently depend manually curated genomic variation databases.We present first rule-based approach properties, categorizing directionality as positive, negative or neutral. Furthermore mentions grounded to...

10.1186/1471-2164-11-s4-s24 article EN cc-by BMC Genomics 2010-12-01

Network-Based Biomonitoring: Exploring Freshwater Food Webs With Stable Isotope Analysis and DNA Metabarcoding

OPENALEX - Publications

Zacchaeus G. Compson Wendy A. Monk Brian Hayden Alex Bush Zoë G. O’Malley and 7 more

Threatened freshwater ecosystems urgently require improved tools for effective management. Food web analysis is currently under-utilised, yet can be used to generate metrics support biomonitoring assessments by measuring the stability and robustness of ecosystems. Using a previously developed pipeline, we combined taxonomic outputs from DNA metabarcoding with text-mining routine extract trait information directly literature. This pipeline allowed us heuristic food webs sites within lower...

10.3389/fevo.2019.00395 article EN cc-by Frontiers in Ecology and Evolution 2019-11-25

Mutation Mining—A Prospector's Tale

OPENALEX - Publications

Christopher J. O. Baker René Witte

10.1007/s10796-006-6103-2 article EN Information Systems Frontiers 2006-02-01

Semantic web infrastructure for fungal enzyme biotechnologists

OPENALEX - Publications

Christopher J. O. Baker Arash Shaban‐Nejad Xiao Su Volker Haarslev Greg Butler

10.1016/j.websem.2006.05.001 article EN Journal of Web Semantics 2006-07-12

A Malaria Analytics Framework to Support Evolution and Interoperability of Global Health Surveillance Systems

OPENALEX - Publications

Jon Haël Brenas Mohammad Sadnan Al-Manir Christopher J. O. Baker Arash Shaban‐Nejad

Malaria is a leading cause of death in Africa. Many organizations, NGO's, and government agencies are collaborating to prevent, control, eliminate malaria. In order succeed these shared goals, an integrated, consistent knowledge source empower informed decision-making required. surveillance currently performed using dynamic, interconnected, systems which require rapid data exchange between different platforms. An important challenge must overcome the occurrence dynamic changes one or more...

10.1109/access.2017.2761232 article EN cc-by IEEE Access 2017-01-01

Perspectives on automated composition of workflows in the life sciences

OPENALEX - Publications

Anna‐Lena Lamprecht Magnus Palmblad Jon Ison Veit Schwämmle Mohammad Sadnan Al Manir and 27 more

<ns3:p>Scientific data analyses often combine several computational tools in automated pipelines, or workflows. Thousands of such workflows have been used the life sciences, though their composition has remained a cumbersome manual process due to lack standards for annotation, assembly, and implementation. Recent technological advances returned long-standing vision workflow into focus.</ns3:p><ns3:p> This article summarizes recent Lorentz Center workshop dedicated sciences. We survey...

10.12688/f1000research.54159.1 preprint EN cc-by F1000Research 2021-09-07

Ontology-centric integration and navigation of the dengue literature

OPENALEX - Publications

Menaka Rajapakse Rajaraman Kanagasabai Wee Tiong Ang Anitha Veeramani Mark Schreiber and 1 more

10.1016/j.jbi.2008.04.004 article EN publisher-specific-oa Journal of Biomedical Informatics 2008-04-18

Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics

OPENALEX - Publications

Leonid Chepelev Alexandre Riazanov Alexandre Kouznetsov Hong Sang Low Michel Dumontier and 1 more

Abstract Background The development of high-throughput experimentation has led to astronomical growth in biologically relevant lipids and lipid derivatives identified, screened, deposited numerous online databases. Unfortunately, efforts annotate, classify, analyze these chemical entities have largely remained the hands human curators using manual or semi-automated protocols, leaving many novel unclassified. Since function is often closely linked structure, accurate structure-based...

10.1186/1471-2105-12-303 article EN cc-by BMC Bioinformatics 2011-07-26

Semantic querying of relational data for clinical intelligence: a semantic web services-based approach

OPENALEX - Publications

Alexandre Riazanov Artjom Klein Arash Shaban‐Nejad Gregory Rose Alan J. Forster and 2 more

Clinical Intelligence, as a research and engineering discipline, is dedicated to the development of tools for data analysis purposes clinical research, surveillance, effective health care management. Self-service ad hoc querying one desirable type functionality. Since most are currently stored in relational or similar form, problematic it requires specialised technical skills knowledge particular schemas.A possible solution semantic where user formulates queries terms domain ontologies that...

10.1186/2041-1480-4-9 article EN cc-by Journal of Biomedical Semantics 2013-01-01

From Cues to Nudge: A Knowledge-Based Framework for Surveillance of Healthcare-Associated Infections

OPENALEX - Publications

Arash Shaban‐Nejad Hiroshi Mamiya Alexandre Riazanov Alan J. Forster Christopher J. O. Baker and 2 more

10.1007/s10916-015-0364-6 article EN Journal of Medical Systems 2015-11-04

Automated Coding of Job Descriptions From a General Population Study: Overview of Existing Tools, Their Application and Comparison

OPENALEX - Publications

Wenxin Wan Calvin Ge Melissa C. Friesen Sarah J. Locke D. Russ and 9 more

Abstract Objectives Automatic job coding tools were developed to reduce the laborious task of manually assigning codes based on free-text descriptions in census and survey data sources, including large occupational health studies. The objective this study is provide a case comparative performance JEM (Job-Exposure Matrix)-assigned exposures agreement using existing tools. Methods We compared three automatic [AUTONOC, CASCOT (Computer-Assisted Structured Coding Tool), LabourR], which selected...

10.1093/annweh/wxad002 article EN cc-by-nc Annals of Work Exposures and Health 2023-02-03

Analysis of the cucumber malate synthase gene promoter by transient expression and gel retardation assays

OPENALEX - Publications

Ian A. Graham Christopher J. O. Baker Christopher J. Leaver

Summary Recently it has been demonstrated that the single‐copy malate synthase (MS) and isocitrate lyase (ICL) genes from cucumber are regulated by nutritional status in cell cultures. In this paper a new mesophyll protoplast transient expression system is described which electroporated MS promoter—GUS reporter gene constructs exhibit same pattern of as endogenous gene. Both MS—GUS expressed when protoplasts cultured for 48 h on non‐metabolizable carbon source such mannitol or...

10.1046/j.1365-313x.1994.6060893.x article EN The Plant Journal 1994-12-01

TOWARDS A SYSTEMATIC EVALUATION OF PROTEIN MUTATION EXTRACTION SYSTEMS

OPENALEX - Publications

René Witte Christopher J. O. Baker

The development of text analysis systems targeting the extraction information about mutations from research publications is an emergent topic in biomedical research. Current differ both scope and approach, thus preventing a meaningful comparison their performance therefore possible synergies. To overcome this evaluation bottleneck, we developed comprehensive framework for systematic mutation systems, precisely defining tasks corresponding metrics, that will allow existing future applications.

10.1142/s0219720007003193 article EN Journal of Bioinformatics and Computational Biology 2007-12-01

Deploying mutation impact text-mining software with the SADI Semantic Web Services framework

OPENALEX - Publications

Alexandre Riazanov Jonas Bergman Laurila Christopher J. O. Baker

Mutation impact extraction is an important task designed to harvest relevant annotations from scientific documents for reuse in multiple contexts. Our previous work on text mining mutation impacts resulted (i) the development of a GATE-based pipeline that mines texts information about mutations proteins, (ii) population this into our OWL DL ontology, and (iii) establishing experimental semantic database storing results mining.This article explores possibility using SADI framework as medium...

10.1186/1471-2105-12-s4-s6 article EN cc-by BMC Bioinformatics 2011-07-05

Coming Soon ...