Sara Stoudt

ORCID: 0000-0002-1693-8058
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Scientific Computing and Data Management
  • Species Distribution and Climate Change
  • Research Data Management Practices
  • Statistics Education and Methodologies
  • Big Data and Business Intelligence
  • Plant and animal studies
  • Scientific Measurement and Uncertainty Evaluation
  • Statistical Methods and Bayesian Inference
  • Animal and Plant Science Education
  • Online Learning and Analytics
  • Big Data Technologies and Applications
  • Genetics, Bioinformatics, and Biomedical Research
  • Academic Writing and Publishing
  • Gender, Labor, and Family Dynamics
  • Data Analysis with R
  • Wildlife Ecology and Conservation
  • Lepidoptera: Biology and Taxonomy
  • Computational Physics and Python Applications
  • Healthcare Policy and Management
  • Digital Storytelling and Education
  • Meta-analysis and systematic reviews
  • Sensor Technology and Measurement Systems
  • Statistical Methods and Inference
  • Housing Market and Economics
  • Educational Assessment and Improvement

Bucknell University
2022-2025

Smith College
2016-2022

University of California, Berkeley
2016-2022

Simmons University
2022

FOM University of Applied Sciences for Economics and Management
2022

United States Department of Commerce
2017

National Institute of Standards and Technology
2017

Identifying rates at which birders engage with different species can inform the impact and efficacy of conservation outreach scientific use community-collected biodiversity data. Species that are thought to be “charismatic” often prioritized in conservation, previous researchers have used sociological experiments digital records estimate charisma indirectly. In this study, we take advantage community science efforts as another record human engagement animals reveal observer biases directly,...

10.1073/pnas.2110156119 article EN cc-by-nc-nd Proceedings of the National Academy of Sciences 2022-04-11

Participatory science (or ‘citizen science') records are becoming increasingly useful for wildlife monitoring due to their volume and spatiotemporal coverage. Statistical analysis of these data can be challenging the many sources sampling heterogeneity that need accounted for. Many previous studies characterize variability across entire participatory datasets, such as spatial in effort or species preferences. User‐level behavior is less well studied, but it may just important dataset‐level...

10.1111/oik.10938 article EN cc-by Oikos 2025-03-04

Traditional data science education often omits training on research workflows: the process that moves a scientific investigation from raw to coherent question insightful contribution. In this paper, we elaborate basic principles of reproducible analysis workflow by defining three phases: Exploratory, Refinement, and Polishing Phases. Each phase is roughly centered around audience whom decisions, methodologies, results are being immediately communicated. Importantly, each can also give rise...

10.1371/journal.pcbi.1008770 article EN cc-by PLoS Computational Biology 2021-03-18

The Consultative Committee for Mass and related quantities (ccm), of the International weights measures (cipm), has recently declared readiness community to support redefinition international system units (SI) at next meeting General Conference on Weights Measures (cgpm) scheduled November, 2018.

10.1088/1681-7575/aa966c article EN cc-by Metrologia 2017-10-27

Hierarchical statistical models are important in applied sciences because they capture complex relationships data, especially when variables related by space, time, sampling unit, or other shared features. Existing methods for maximum likelihood estimation that rely on Monte Carlo integration over latent variables, such as Expectation Maximization (MCEM), suffer from drawbacks efficiency and/or generality. We harness a connection between sampling-stepping iterations and stochastic gradient...

10.1080/02664763.2024.2383284 article EN Journal of Applied Statistics 2024-07-24

An errors-in-variables regression method is presented as an alternative to the ordinary least-squares computation currently employed for determining calibration function force measuring instruments from data acquired during calibration. A Monte Carlo uncertainty evaluation also presented. The corresponding (which we call measurement function, often called analysis in gas metrology) necessary subsequent use of calibrated device measure force, and associated evaluation, are derived results....

10.1088/0026-1394/53/3/965 article EN Metrologia 2016-05-20

What are the challenges and best practices for doing data-intensive research in teams, labs, other groups? This paper reports from a discussion which researchers many different disciplines departments shared their experiences on data science domains. The issues we discuss range technical to social, including with getting same computational stack, workflow pipeline management, handoffs, composing well-balanced team, dealing fluid membership, fostering coordination communication, not...

10.31235/osf.io/a7b3m preprint EN 2018-11-15

Abstract Understanding variation in species occupancy is an important task for conservation. When assessing patterns over multiple temporal seasons, it recommended to visit at least a subset of sites times within season during period closure account observation biases. However, logistical constraints can inhibit re‐visitation season, resulting the use single‐visit multi‐season models. Some have suggested that autocorrelation space and/or time provide “fractional replication” separately...

10.1111/2041-210x.14275 article EN cc-by-nc Methods in Ecology and Evolution 2023-12-26

What actions can we take to foster diverse and inclusive workplaces in the broad fields around data science? This paper reports from a discussion which researchers many different disciplines departments raised questions shared their experiences with various aspects diversity, inclusion, equity. The issues discuss include fostering interpersonal small group dynamics, rules codes of conduct, increasing diversity less-representative groups disciplines, organizing events for long-term efforts...

10.31235/osf.io/8gsjz article EN 2019-02-02

The National Academies report, "Reproducibility and Replicability in Science," outlines values broad goals for reproducible replicable scientific research, emphasizing the importance of its availability transparency. Confronted with these goals, we examine what it means research material to be transparent, available, reproducible. As early-career researchers, advocate focus placed on reusable extensible which is naturally achieved when reproducible, open. When understand inner workings a...

10.1162/99608f92.1cc3d72a article EN cc-by Harvard data science review 2020-12-16

Abstract Small datasets comprising observations made under conditions of repeatability or reproducibility pervade the practice measurement science. Many laboratories typically will make only one determination, occasionally they two, and rarely three more replicate determinations same measurand. Interlaboratory comparisons, including key meta-analyses, often involve a handful participants. These limitations pose considerable challenges to production reliable uncertainty evaluations. This...

10.1088/1681-7575/abd372 article EN Metrologia 2020-12-14

A bstract Identifying which species are perceived as charismatic can improve the impact and efficiency of conservation outreach, receive more funding have their needs prioritized (9; 17; 13). Sociological experiments studying animal charisma relied on stated preferences to find correlations between hypothetical “willingness pay” or “empathy” for a species’ size, color, aesthetic appeal (51; 13; 16). Recognizing increasing availability digital records public engagement with animals that...

10.1101/2021.06.05.446577 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2021-06-07

Data scientists rely on many technical skills and the ability to reason about data solve problems.As educators grapple with how prepare students in this new field, they are faced identifying both what a student must know should be able do by end of their science education, also collect evidence those abilities.We present way unite coordinate individual efforts toward training well-rounded scientists: portfolio that highlights strong communication.This structuring classroom assignments...

10.1162/99608f92.3c097160 article EN cc-by Harvard data science review 2021-07-23

10.1080/10724117.2024.2312039 article EN Math Horizons 2024-04-02

In 2022, the Journal of Statistics and Data Science Education (JSDSE) instituted augmented requirements for authors to post deidentified data code underlying their papers. These changes were prompted by an increased focus on reproducibility open science (NASEM 2019). A recent review availability practices noted that "such policies help increase published literature, as well make a larger body available reuse re-analysis" (PLOS ONE, 2024). JSDSE values accessibility it endeavors share...

10.1080/26939169.2024.2364737 preprint EN arXiv (Cornell University) 2024-05-28

Participatory science (or "citizen science") records are becoming increasingly useful for wildlife monitoring due to their volume and spatiotemporal coverage. However, statistical analysis using these data can be challenging the many sources of bias that need corrected. Many previous studies characterize sampling biases across entire participatory datasets, such as spatial heterogeneity in effort or species preferences. User-level behavior is less well studied, but it may just important...

10.22541/au.172510569.98511431/v1 preprint EN Authorea (Authorea) 2024-08-31

Abstract Deborah Nolan and Sara Stoudt present a framework for learning the art of statistical storytelling

10.1111/1740-9713.01469 article EN Significance 2020-12-01

Since coverage intervals are widely used expressions of measurement uncertainty, this contribution reviews as defned in the Guide to Expression Uncertainty Measurement (GUM), and compares them against principal types probabilistic that commonly applied statistics science. Although formally identical conventional confdence for means, GUM interprets more if they were Bayesian credible intervals, or tolerance intervals. We focus, particular, on a common misunderstanding about derived from...

10.6028/jres.126.004 article EN publisher-specific-oa Journal of Research of the National Institute of Standards and Technology 2021-03-03

Turnover is a fact of life for any project, and academic research teams can face particularly high levels people who come go through the duration project. In this article, we discuss challenges turnover some potential practices helping manage it, computational- data-intensive projects. The topics include establishing implementing data management plans, file format standardization, workflow process documentation, clear team roles, check-in check-out procedures.

10.31235/osf.io/wsxru preprint EN 2019-03-06

A traffic generation model is a stochastic of the data flow in communication network. These models are useful during development telecommunication technologies and for analyzing performance capacity various protocols,algorithms, network topologies. We present here two modeling approaches simulating internet traffic. In our models, we simulate length interarrival times individual packets, discrete unit transfer over internet. Our first approach based on fitting to known theoretical...

10.33697/ajur.2016.028 article EN American Journal of Undergraduate Research 2016-08-31
Coming Soon ...