- Scientific Computing and Data Management
- Research Data Management Practices
- Species Distribution and Climate Change
- Distributed and Parallel Computing Systems
- Environmental DNA in Biodiversity Studies
- Semantic Web and Ontologies
- Genomics and Phylogenetic Studies
- Biomedical Text Mining and Ontologies
- Data Quality and Management
- Telemedicine and Telehealth Implementation
- Electronic Health Records Systems
- Advanced Data Storage Technologies
- Forest Ecology and Biodiversity Studies
- Context-Aware Activity Recognition Systems
- Energy Efficient Wireless Sensor Networks
- Chronic Disease Management Strategies
- Image Processing and 3D Reconstruction
- Computational and Text Analysis Methods
- IoT-based Smart Home Systems
- Heart Failure Treatment and Management
- Peer-to-Peer Network Technologies
- Water Quality Monitoring Technologies
- Bioinformatics and Genomic Networks
- Mobile Health and mHealth Applications
- Environmental Monitoring and Data Management
Cardiff University
2013-2022
Harvard University
2021
University of Kansas
2021
Seattle University
2021
Natural History Museum
2019-2021
Bavarian Natural History Collections
2021
Freie Universität Berlin
2019-2021
Royal Botanic Garden Edinburgh
2019-2021
Naturalis Biodiversity Center
2019-2021
Distributed System of Scientific Collections
2021
The Taverna workflow tool suite (http://www.taverna.org.uk) is designed to combine distributed Web Services and/or local tools into complex analysis pipelines. These pipelines can be executed on desktop machines or through larger infrastructure (such as supercomputers, Grids cloud environments), using the Server. In bioinformatics, workflows are typically used in areas of high-throughput omics analyses (for example, proteomics transcriptomics), for evidence gathering methods involving text...
Much biodiversity data is collected worldwide, but it remains challenging to assemble the scattered knowledge forassessing status and trends. The concept of Essential Biodiversity Variables (EBVs) was introduced tostructure monitoring globally, harmonize standardize from disparate sourcesto capture a minimum set critical variables required study, report manage change. Here, weassess challenges ‘Big Data’ approach building global EBV products across taxa spatiotemporalscales, focusing on...
Essential Biodiversity Variables (EBV) are fundamental variables that can be used for assessing biodiversity change over time, determining adherence to policy, monitoring progress towards sustainable development goals, and tracking responses disturbances management interventions. Data from observations or models provide measured estimated EBV values, which we refer as data products, help capture the above processes trends serve a coherent framework documenting in biodiversity. Using primary...
We examine the intersection of FAIR principles (Findable, Accessible, Interoperable and Reusable), challenges opportunities presented by aggregation widely distributed heterogeneous data about biological geological specimens, use Digital Object Architecture (DOA) model components as an approach to solving those that offers adherence integral characteristic. This will be prototyped in Distributed System Scientific Collections (DiSSCo) project, pan-European Research Infrastructure which aims...
Essential biodiversity variables (EBVs) have been proposed by the Group on Earth Observations Biodiversity Observation Network (GEO BON) to identify a minimum set of essential measurements that are required for studying, monitoring and reporting ecosystem change. Despite initial conceptualisation, however, practical implementation EBVs remains challenging. There is much discussion about concept EBVs: which meaningful; data needed available; at spatial, temporal topical scales can be...
The Global Biodiversity Informatics Outlook helps to focus effort and investment towards better understanding of life on Earth our impacts upon it. It proposes a framework that will help harness the immense power information technology an open data culture, gather unprecedented evidence about biodiversity inform decisions. Much progress has been made in past ten years fulfil potential informatics. However, it is dwarfed by scale what still required. (GBIO) offers for reaching much deeper...
Marine biological invasions have increased with the development of global trading, causing homogenization communities and decline biodiversity. A main vector is ballast water exchange from shipping. This study evaluates use ecological niche modelling (ENM) to predict spread 18 non-indigenous species (NIS) along shipping routes their potential habitat suitability (hot/cold spots) in Baltic Sea Northeast Atlantic. Results show that, contrary current risk assessment methods, temperature sea ice...
Making forecasts about biodiversity and giving support to policy relies increasingly on large collections of data held electronically, substantial computational capability capacity analyse, model, simulate predict using such data. However, the physically distributed nature resources expertise in advanced analytical tools creates many challenges for modern scientist. Across wider biological sciences, presenting capabilities Internet (as “Web services”) scientific workflow systems compose them...
More and more herbaria are digitising their collections. Images of specimens made available online to facilitate access them allow extraction information from them. Transcription the data written on is critical for general discoverability enables incorporation into large aggregated research datasets. Different methods, such as crowdsourcing artificial intelligence, being developed optimise transcription, but herbarium pose difficulties in many reasons. To provide developers transcription...
DiSSCo, the Distributed System of Scientific Collections, is a pan-European Research Infrastructure (RI) mobilising, unifying bio- and geo-diversity information connected to specimens held in natural science collections delivering it scientific communities beyond. Bringing together 120 institutions across 21 countries combining earlier investments data interoperability practices with technological advancements digitisation, cloud services semantic linking, DiSSCo makes from available as one...
To examine the evidence base for telemonitoring designed patients who have chronic obstructive pulmonary disease and heart failure, to assess whether fulfils principles of monitoring is ready implementation into routine settings.Qualitative data collection using interviews participation in a multi-path mapping process.Twenty-six purposively selected informants completed semi-structured 24 individuals with expertise relevant clinical informatics domains from academia, industry, policy...
Patients with chronic disease may suffer frequent acute deteriorations and associated increased risk of hospitalisation. Earlier detection these could enable successful intervention, improving patients’ well-being reducing costs; however, current telemonitoring systems do not achieve this effectively. We conducted a qualitative study using stakeholder interviews to define standards care user requirements for improved early telemonitoring. determined that is concept has informed technology or...
Environmental research infrastructures (RIs) support their respective communities by integrating large-scale sensor/observation networks with data curation services, analytical tools and common operational policies. These RIs are developed as pillars of intra-and interdisciplinary research, however comprehension the complex, pathologically interconnected aspects Earth's ecosystem increasingly requires that researchers conduct experiments across infrastructure boundaries. Consequently, almost...
Persistent identifiers (PID) to identify digital representations of physical specimens in natural science collections (i.e., specimens) unambiguously and uniquely on the Internet are one mechanisms for digitally transforming collections-based science. Digital Specimen PIDs contribute building maintaining long-term community trust accuracy authenticity scientific data be managed presented by Distributed System Scientific Collections (DiSSCo) research infrastructure planned Europe commence...
Abstract A key limiting factor in organising and using information from physical specimens curated natural science collections is making that computable, with institutional digitization tending to focus more on imaging the themselves than efficiently capturing computable data about them. Label are traditionally manually transcribed today high cost low throughput, rendering such a task constrained for many collection-holding institutions at current funding levels. We show how computer vision,...
Essential biodiversity variables (EBV) are information products for assessing change. Species populations EBVs one class of that can be used to monitor the spread invasive species. However, systematic, reliable, repeatable procedures process primary data into do not yet exist, and environmental research infrastructures still must improve their capabilities deliver EBV products. Here, we tested ability two mature infrastructures, Global Biodiversity Information Facility Atlas Living...
The aim of the BioVeL project is to provide a seamlessly connected informatics environment that makes it easier for biodiversity scientists carry out in-silico analysis relevant data and pursue experimentation based on composing executing sequences complex digital manipulations modelling tasks. In technologists will work together meet needs demands or ‘e-Science’ create production-quality infrastructure enable pipelining into efficient integrated workflows. Workflows represent way speeding...
We describe an effective approach to automated text digitisation with respect natural history specimen labels. These labels contain much useful data about the including its collector, country of origin, and collection date. Our automatically extracting these takes form a pipeline. Recommendations are made for pipeline's component parts based on state-of-the-art technologies. Optical Character Recognition (OCR) can be used digitise images specimens. However, recognising quickly accurately...
Environmental research infrastructures (RIs) support data-intensive by integrating large-scale sensor/observer networks with dedicated data curation services and analytical tools. However the diversity of scientific disciplines coupled lack an accepted methodology for constructing new RIs inevitably leads to incompatibilities between models, metadata standards service descriptions used different RIs, inhibiting their usefulness interdisciplinary research. In absence a common global ontology...