Matthew B. Jones

ORCID: 0000-0003-0077-4738
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Scientific Computing and Data Management
  • Research Data Management Practices
  • Semantic Web and Ontologies
  • Distributed and Parallel Computing Systems
  • Species Distribution and Climate Change
  • Data Quality and Management
  • Advanced Data Storage Technologies
  • Biomedical Text Mining and Ontologies
  • Data Analysis with R
  • Advanced Database Systems and Queries
  • Environmental DNA in Biodiversity Studies
  • Climate change and permafrost
  • Data Mining Algorithms and Applications
  • Software Engineering Research
  • Advanced Computational Techniques and Applications
  • Methane Hydrates and Related Phenomena
  • Genetics, Bioinformatics, and Biomedical Research
  • Cryospheric studies and observations
  • Geographic Information Systems Studies
  • Innovative Approaches in Technology and Social Development
  • Fire effects on ecosystems
  • Genomics and Phylogenetic Studies
  • Fish Ecology and Management Studies
  • Tree Root and Stability Studies
  • Geological Modeling and Analysis

National Center for Ecological Analysis and Synthesis
2011-2024

University of California, Santa Barbara
2010-2024

University of California, Berkeley
2024

California Digital Library
2024

Baylor College of Medicine
2024

Red Hat (United States)
2023

St George’s University Hospitals NHS Foundation Trust
2022

University of Tennessee at Knoxville
2017

University of New Mexico
2017

State Street (United States)
2005-2015

Abstract Many scientific disciplines are now data and information driven, new knowledge is often gained by scientists putting together analysis discovery ‘pipelines’. A related trend that more communities realize the benefits of sharing their computational services, thus contributing to a distributed community infrastructure (a.k.a. ‘the Grid’). However, this only means an end ideally should not be too concerned with its existence. The goal for focus on development use what we call workflows...

10.1002/cpe.994 article EN Concurrency and Computation Practice and Experience 2005-12-13

Ecology is a synthetic discipline benefiting from open access to data the earth, life, and social sciences. Technological challenges exist, however, due dispersed heterogeneous nature of these data. Standardization methods development robust metadata can increase but are not sufficient. Reproducibility analyses also important, executable workflows addressing this issue by capturing provenance. Sociological challenges, including inadequate rewards for sharing data, must be resolved. The...

10.1126/science.1197962 article EN Science 2011-02-10

Most scientists conduct analyses and run models in several different software hardware environments, mentally coordinating the export import of data from one environment to another. The Kepler scientific workflow system provides domain with an easy-to-use yet powerful for capturing workflows (SWFs). SWFs are a formalization ad-hoc process that scientist may go through get raw publishable results. attempts streamline creation execution so can design, execute, monitor, re-run, communicate...

10.1109/ssdbm.2004.44 article EN 2004-06-21

Most scientists conduct analyses and run models in several different software hardware environments, mentally coordinating the export import of data from one environment to another. The Kepler scientific workflow system provides domain with an easy-to-use yet powerful for capturing workflows (SWFs). SWFs are a formalization ad-hoc process that scientist may go through get raw publishable results. attempts streamline creation execution so can design, execute, monitor, re-run, communicate...

10.1109/ssdm.2004.1311241 article EN 2004-11-12

Summary New analytical tools applied to long‐term data demonstrate that ecological communities are highly dynamic over time. We developed an r package, library(“codyn”) , help ecologists easily implement these metrics and gain broader insights into community dynamics. provides temporal diversity indices stability metrics. All functions designed be implemented multiple replicates. Temporal include species turnover, mean rank shifts rate of change Community calculate overall patterns...

10.1111/2041-210x.12569 article EN Methods in Ecology and Evolution 2016-04-02

Bioinformatics, the application of computational tools to management and analysis biological data, has stimulated rapid research advances in genomics through development data archives such as GenBank, similar progress is just beginning within ecology. One reason for belated adoption informatics approaches ecology breadth ecologically pertinent (from genes biosphere) its highly heterogeneous nature. The variety formats, logical structures, sampling methods create significant challenges....

10.1146/annurev.ecolsys.37.091305.110031 article EN Annual Review of Ecology Evolution and Systematics 2006-08-14

The field of ecology is poised to take advantage emerging technologies that facilitate the gathering, analyzing, and sharing data, methods, results. concept transparency at all stages research process, coupled with free open access code, papers, constitutes “open science.” Despite many benefits an approach science, a number barriers entry exist may prevent researchers from embracing openness in their own work. Here we describe several key shifts mindset underpin transition more science....

10.1890/es14-00402.1 article EN cc-by Ecosphere 2015-07-01

The act of sharing scientific knowledge is rapidly evolving away from traditional articles and presentations to the delivery executable objects that integrate data computational details (e.g., scripts workflows) upon which findings rely. This envisioned coupling process essential advancing science but faces technical institutional barriers. Whole Tale project aims address these barriers by connecting computational, data-intensive research efforts with larger process—transforming discovery...

10.1016/j.future.2017.12.029 article EN cc-by Future Generation Computer Systems 2018-02-10

The scale and magnitude of complex pressing environmental issues lend urgency to the need for integrative reproducible analysis synthesis, facilitated by data-intensive research approaches. However, recent pace technological change has been such that appropriate skills accomplish are lacking among scientists, who more than ever greater access training mentorship in computational skills. Here, we provide a roadmap raising data competencies current next-generation researchers describing...

10.1093/biosci/bix025 article EN cc-by-nc BioScience 2017-03-15

This paper assesses trending AI foundation models, especially emerging computer vision models and their performance in natural landscape feature segmentation. While the term model has quickly garnered interest from geospatial domain, its definition remains vague. Hence, this will first introduce defining characteristics. Built upon tremendous success achieved by Large Language Models (LLMs) as for language tasks, discusses challenges of building artificial intelligence (GeoAI) tasks. To...

10.3390/rs16050797 article EN cc-by Remote Sensing 2024-02-24

Metacat is a network-enabled database framework that lets users store, query, and retrieve XML documents with arbitrary schemas in SQL-compliant relational systems. The system (available from the Knowledge Network for Biocomplexity, http://knb.ecoinformatics.org/) incorporates RDF-like methods packaging data sets to allow researchers customize revise their metadata. It extensible flexible enough preserve utility interpretability working future content standards. solves several key challenges...

10.1109/4236.957896 article EN IEEE Internet Computing 2001-01-01

This section highlights new and emerging areas of technology methodology. Topics may range from hardware software, to statistical analyses technologies that could be used in ecological research. Articles should no longer than a few thousand words, sent the editors, David Inouye (E-mail: inouye@umd.edu) or Sam Scheiner sschein@nsf.gov). Data are at heart empirically based sciences, serving as primary evidence supporting (or refuting) models way our natural world operates. While most...

10.1890/0012-9623-90.2.205 article EN Bulletin of the Ecological Society of America 2009-03-31

In this review, we adopt the definition that 'Data citation is a reference to data for purpose of credit attribution and facilitation access data' (TGDCSP 2013: CIDCR6). Furthermore, should be enabled both humans machines (DCSG 2014). We use discuss how has evolved over last couple decades highlight issues need more research attention. Data not new concept, but it changed considerably since beginning digital age. Basic practice now established slowly increasingly being implemented....

10.5334/dsj-2019-052 article EN cc-by Data Science Journal 2019-01-01

The National Science Foundation's Arctic Data Center is the primary data repository for NSF-funded research conducted in Arctic.There are major challenges discovering and interpreting resources a containing as heterogeneous interdisciplinary those Center.This paper reports on advances cyberinfrastructure at that help address these issues by leveraging semantic technologies enhance repository's adherence to FAIR principles improve Findability, Accessibility, Interoperability, Reusability of...

10.5334/dsj-2024-002 article EN cc-by Data Science Journal 2024-01-01

GeoLink has leveraged linked data principles to create a dataset that allows users seamlessly query and reason over some of the most prominent geoscience metadata repositories in United States. The includes such diverse information as port calls made by oceanographic cruises, physical sample metadata, research project funding staffing, authorship technical reports. been published according best practices for is publicly available via SPARQL Protocol RDF Query Language (SPARQL) end point at...

10.1080/20964471.2018.1469291 article EN cc-by Big Earth Data 2018-04-03
Coming Soon ...