You-Wei Cheah

ORCID: 0000-0003-2241-4901
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Scientific Computing and Data Management
  • Distributed and Parallel Computing Systems
  • Research Data Management Practices
  • Data Quality and Management
  • Atmospheric and Environmental Gas Dynamics
  • Advanced Data Storage Technologies
  • Peatlands and Wetlands Ecology
  • Geochemistry and Geologic Mapping
  • Plant Water Relations and Carbon Dynamics
  • Environmental Monitoring and Data Management
  • Climate variability and models
  • Advanced Database Systems and Queries
  • Advanced Algorithms and Applications
  • Energy Load and Power Forecasting
  • Gas Dynamics and Kinetic Theory
  • Cloud Computing and Resource Management
  • Carbon Dioxide Capture Technologies
  • Big Data and Business Intelligence
  • Fault Detection and Control Systems
  • Semantic Web and Ontologies
  • Remote Sensing in Agriculture
  • Embedded Systems Design Techniques
  • Topic Modeling
  • Species Distribution and Climate Change
  • Sensor Technology and Measurement Systems

Lawrence Berkeley National Laboratory
2016-2024

Indiana University
2011-2014

Indiana University Bloomington
2010-2014

Gilberto Pastorello Carlo Trotta Eleonora Canfora Housen Chu Danielle Christianson and 95 more You-Wei Cheah C. Poindexter Jiquan Chen Abdelrahman Elbashandy Marty Humphrey Peter Isaac Diego Polidori Markus Reichstein Alessio Ribeca Catharine van Ingen Nicolas Vuichard Leiming Zhang B. D. Amiro Christof Ammann M. Altaf Arain Jonas Ardö Timothy J. Arkebauer Stefan K. Arndt Nicola Arriga Marc Aubinet Mika Aurela Dennis Baldocchi Alan Barr Eric Beamesderfer Luca Belelli Marchesini Onil Bergeron Jason Beringer Christian Bernhofer Daniel Berveiller D. P. Billesbach T. Andrew Black Peter D. Blanken Gil Bohrer Julia Boike Paul V. Bolstad Damien Bonal Jean-Marc Bonnefond D. R. Bowling Rosvel Bracho Jason Brodeur Christian Brümmer Nina Buchmann Benoît Burban Sean P. Burns Pauline Buysse Peter Cale M. Cavagna Pierre Cellier Shiping Chen Isaac Chini Torben R. Christensen James Cleverly Alessio Collalti Claudia Consalvo Bruce D. Cook David Cook Carole Coursolle Edoardo Cremonese Peter S. Curtis Ettore D’Andrea Humberto da Rocha Xiaoqin Dai K. J. Davis Bruno De Cinti A. de Grandcourt Anne De Ligne Raimundo Cosme de Oliveira Nicolas Delpierre Ankur R. Desai Carlos Marcelo Di Bella Paul Di Tommasi A. J. Dolman Francisco Domingo Gang Dong Sabina Dore Pierpaolo Duce Éric Dufrêne Allison L. Dunn Jiří Dušek Derek Eamus Uwe Eichelmann Hatim Abdalla M. ElKhidir Werner Eugster Cäcilia Ewenz B. E. Ewers D. Famulari Silvano Fares Iris Feigenwinter Andrew Feitz Rasmus Fensholt Gianluca Filippa M. L. Fischer J. M. Frank Marta Galvagno Mana Gharun

Abstract The FLUXNET2015 dataset provides ecosystem-scale data on CO 2 , water, and energy exchange between the biosphere atmosphere, other meteorological biological measurements, from 212 sites around globe (over 1500 site-years, up to including year 2014). These sites, independently managed operated, voluntarily contributed their create global datasets. Data were quality controlled processed using uniform methods, improve consistency intercomparability across sites. is already being used...

10.1038/s41597-020-0534-3 article EN cc-by Scientific Data 2020-07-09
Gilberto Pastorello Carlo Trotta Eleonora Canfora Housen Chu Danielle Christianson and 95 more You-Wei Cheah C. Poindexter Jiquan Chen Abdelrahman Elbashandy Marty Humphrey Peter Isaac Diego Polidori Markus Reichstein Alessio Ribeca Catharine van Ingen Nicolas Vuichard Leiming Zhang B. D. Amiro Christof Ammann M. Altaf Arain Jonas Ardö Timothy J. Arkebauer Stefan K. Arndt Nicola Arriga Marc Aubinet Mika Aurela Dennis Baldocchi Alan Barr Eric Beamesderfer Luca Belelli Marchesini Onil Bergeron Jason Beringer Christian Bernhofer Daniel Berveiller D. P. Billesbach T. Andrew Black Peter D. Blanken Gil Bohrer Julia Boike Paul V. Bolstad Damien Bonal Jean-Marc Bonnefond D. R. Bowling Rosvel Bracho Jason Brodeur Christian Brümmer Nina Buchmann Benoît Burban Sean P. Burns Pauline Buysse Peter Cale M. Cavagna Pierre Cellier Shiping Chen Isaac Chini Torben R. Christensen James Cleverly Alessio Collalti Claudia Consalvo Bruce D. Cook David Cook Carole Coursolle Edoardo Cremonese Peter S. Curtis Ettore D’Andrea Humberto da Rocha Xiaoqin Dai K. J. Davis Bruno De Cinti A. de Grandcourt Anne De Ligne Raimundo Cosme de Oliveira Nicolas Delpierre Ankur R. Desai Carlos Marcelo Di Bella Paul Di Tommasi A. J. Dolman Francisco Domingo Gang Dong Sabina Dore Pierpaolo Duce Éric Dufrêne Allison L. Dunn Jiří Dušek Derek Eamus Uwe Eichelmann Hatim Abdalla M. ElKhidir Werner Eugster Cäcilia Ewenz B. E. Ewers D. Famulari Silvano Fares Iris Feigenwinter Andrew Feitz Rasmus Fensholt Gianluca Filippa M. L. Fischer J. M. Frank Marta Galvagno Mana Gharun

A Correction to this paper has been published: https://doi.org/10.1038/s41597-021-00851-9.

10.1038/s41597-021-00851-9 article EN cc-by Scientific Data 2021-02-25

AmeriFlux is a network of research sites that measure carbon, water, and energy fluxes between ecosystems the atmosphere using eddy covariance technique to study variety Earth science questions. AmeriFlux's diversity ecosystems, instruments, data-processing routines create challenges for data standardization, quality assurance, sharing across network. To address these challenges, Management Project (AMP) designed implemented BASE pipeline. The pipeline begins with uploaded by site teams,...

10.1038/s41597-023-02531-2 article EN cc-by Scientific Data 2023-09-11

Visualization facilitates the understanding of scientific data both through exploration and explanation visualized data. Provenance also contributes to by containing contributing factors behind a result. The visualization provenance, although supported in existing workflow management systems, generally focuses on small (medium) sized provenance data, lacking techniques deal with big high complexity. This paper discusses developed for including layout algorithm, visual style, graph...

10.1109/hipc.2012.6507517 article EN 2012-12-01
Kyle Delwiche Sara Knox Avni Malhotra Etienne Fluet‐Chouinard Gavin McNicol and 95 more Sarah Féron Zutao Ouyang Dario Papale Carlo Trotta Eleonora Canfora You-Wei Cheah Danielle Christianson M. Carmelita R. Alberto Pavel Alekseychik Mika Aurela Dennis Baldocchi Sheel Bansal David P. Billesbach Gil Bohrer Rosvel Bracho Nina Buchmann David I. Campbell Gerardo Celis Jiquan Chen Weinan Chen Housen Chu Higo J. Dalmagro Sigrid Dengel Ankur R. Desai Matteo Detto A. J. Dolman Elke Eichelmann E. S. Euskirchen D. Famulari Thomas Friborg Kathrin Fuchs Mathias Goeckede Sébastien Gogo Mangaliso J. Gondwe Jordan P. Goodrich Pia Gottschalk Scott L. Graham Martin Heimann Manuel Helbig Carole Helfter Kyle S. Hemes Takashi Hirano David Y. Hollinger Lukas Hörtnagl Hiroki Iwata Adrien Jacotot Joachim Jansen Gerald Jurasinski Minseok Kang Kuno Kasak John King Janina Klatt Franziska Koebsch Ken W. Krauss Derrick Y.F. Lai Ivan Mammarella Giovanni Manca Luca Belelli Marchesini Jaclyn Hatala Matthes Trofim Maximon Lutz Merbold Bhaskar Mitra Timothy H. Morin Eiko Nemitz Mats B. Nilsson Shuli Niu Walter C. Oechel Patricia Y. Oikawa Keisuke Ono Matthias Peichl Olli Peltola Michele L. Reba Andrew D. Richardson W. J. Riley Benjamin R. K. Runkle Youngryel Ryu Torsten Sachs Ayaka Sakabe Camilo Rey‐Sánchez Edward A. G. Schuur Karina V. R. Schäfer Oliver Sonnentag Jed P. Sparks Ellen Stuart‐Haëntjens Cove Sturtevant Ryan C. Sullivan Daphne Szutu Jonathan E. Thom Margaret Torn Eeva‐Stiina Tuittila Jessica A. Turner Masahito Ueyama Alex Valach Rodrigo Vargas Andrej Varlagin

Abstract. Methane (CH4) emissions from natural landscapes constitute roughly half of global CH4 contributions to the atmosphere, yet large uncertainties remain in absolute magnitude and seasonality emission quantities drivers. Eddy covariance (EC) measurements flux are ideal for constraining ecosystem-scale emissions, including their seasonality, due quasi-continuous high temporal resolution measurements, coincident carbon, water, energy fluxes, lack ecosystem disturbance, increased...

10.5194/essd-2020-307 preprint EN cc-by 2021-01-18

The volume and complexity of data produced analyzed in scientific collaborations is growing exponentially. It important to track data-intensive analysis workflows provide context reproducibility as transformed these collaborations. Provenance addresses this need aids scientists by providing the lineage or history how generated, used modified. has traditionally been collected at workflow level often making it hard capture relevant information about resource characteristics difficult for users...

10.1109/bigdata.congress.2013.16 article EN 2013-06-01

It can be natural to believe that many of the traditional issues scale have been eliminated or at least greatly reduced via cloud computing. That is, if one create a seemingly well functioning application operates correctly on small moderate-sized problems, then very nature programming abstractions means same will run as potentially significantly larger problems. In this paper, we present our experiences taking MODISAzure, satellite data processing system built Windows Azure computing...

10.1109/escience.2010.47 article EN 2010-12-01

We live in an era which scientific discovery is increasingly driven by data exploration of massive datasets. Scientists today are envisioning diverse analyses and computations that scale from the desktop to supercomputers, yet often have difficulty designing constructing software architectures accommodate heterogeneous inconsistent at scale. Moreover, computational resource needs can vary widely over time. The grow as science collaboration broadens or additional accumulated; demand large...

10.1177/1094342011414746 article EN The International Journal of High Performance Computing Applications 2011-08-01

Data provenance, a key piece of metadata that describes the lifecycle data product, is crucial in aiding scientists to better understand and facilitate reproducibility reuse scientific results. Provenance collection systems often capture provenance on fly protocol between application tool may not be reliable. As result, can become ambiguous or simply inaccurate. In this paper, we identify likely quality issues provenance. We also establish dimensions are especially critical for evaluation...

10.1109/escience.2012.6404480 article EN 2012-10-01

Data provenance, a form of metadata describing the life cycle data product, is crucial in sharing research data. Research data, when shared over decades, requires recipients to make determination both use and trust. That is, can they data? More importantly, trust it? Knowing are high quality one factor establishing fitness for Provenance be used assert but provenance must known as well. We propose framework assessing provenance. identify issues establish key dimensions, define analysis....

10.1145/2665069 article EN Journal of Data and Information Quality 2014-12-12

Recent emphasis and requirements for open data publication have led to significant increases in availability the Earth sciences, which is critical long-tail integration. Currently, are often published a repository with an identifier citation, similar those papers. Subsequent publications that use expected provide citation reference section of paper. However, format still evolving, particularly regards citing dynamic data, subsets, collections data. Considering motivations both producers...

10.1016/j.ecoinf.2021.101251 article EN cc-by Ecological Informatics 2021-02-16

Data quality control is one of the most time consuming activities within Research Infrastructures (RIs), especially when involving observational data and multiple providers. In this work we report on our ongoing development rogues, a scalable approach to manage issues for RIs. The motivation started with creation FLUXNET2015 dataset, which includes carbon, water, energy fluxes plus micrometeorological ancillary measured in over 200 sites around world. To create an uniform including derived...

10.1109/escience.2017.64 article EN 2017-10-01

The Carbon Capture Simulation Initiative (CCSI) project has developed and deployed scientific infrastructure called the CCSI Toolset. Toolset provides state-of-the-art computational modeling simulation tools to accelerate commercialization of carbon capture technologies from discovery development, demonstration, ultimately widespread deployment hundreds power plants. have potential dramatically reduce emissions end users in industry with a comprehensive, integrated suite leading-edge,...

10.1109/escience.2016.7870924 article EN 2016-10-01

Widely used in studies ranging from ecophysiology dynamics to global estimates using models and remote sensing data, FLUXNET datasets have become key scientific research applications. More frequently updated high-quality data collections are ever more pressing, serving opportunities with new technologies real-world applications including nature-based technological climate solutions, carbon credit verification, support agriculture decision systems, ecological forecasting. The three major...

10.5194/egusphere-egu24-14485 preprint EN 2024-03-09

Qualitative user research is a human-intensive approach that draws upon ethnographic methods from social sciences to develop insights about work practices inform software design and development. Recent advances in data science, particular, natural language processing (NLP), enables the derivation of machine-generated augment existing techniques. Our describes our prototype framework based Jupyter, tool supports interactive science scientific computing, leverages NLP techniques make sense...

10.1109/escience.2018.00076 article EN 2018-10-01
Coming Soon ...