- Data Visualization and Analytics
- Data Analysis with R
- Image and Video Quality Assessment
- Data Quality and Management
- Advanced Text Analysis Techniques
- Multimedia Communication and Technology
- Video Analysis and Summarization
- Data-Driven Disease Surveillance
- Aesthetic Perception and Analysis
- Mental Health Research Topics
- Big Data and Business Intelligence
- Innovative Human-Technology Interaction
- Statistics Education and Methodologies
- Species Distribution and Climate Change
- Time Series Analysis and Forecasting
- Plant Virus Research Studies
- Viral Infections and Vectors
- Data Management and Algorithms
- Anomaly Detection Techniques and Applications
- Human Mobility and Location-Based Analysis
- Participatory Visual Research Methods
- Viral gastroenteritis research and epidemiology
- Computational and Text Analysis Methods
- Genomics and Phylogenetic Studies
- Behavioral and Psychological Studies
Northeastern University
2024
Universidad del Noreste
2023-2024
Independent Sector
2024
DePaul University
2024
Northwestern University
2024
Tableau Software (United States)
2018-2023
University of Washington
2016-2023
Seattle University
2021-2023
Linköping University
2023
University of Utah
2023
Dashboards are one of the most common use cases for data visualization, and their design contexts considerably different from exploratory visualization tools. In this paper, we look at broad scope how dashboards used in practice through an analysis dashboard examples documentation about use. We systematically review literature surrounding use, construct a space dashboards, identify major types. characterize by goals, levels interaction, practices around them. Our framework suggest number...
When making an inference or comparison with uncertain, noisy, incomplete data, measurement error and confidence intervals can be as important for judgment the actual mean values of different groups. These often misunderstood statistical quantities are frequently represented by bar charts bars. This paper investigates drawbacks this standard encoding, considers a set alternatives designed to more effectively communicate implications data general audience, drawing from lessons learned use...
Understanding and accounting for uncertainty is critical to effectively reasoning about visualized data. However, evaluating the impact of an visualization complex due difficulties that people have interpreting challenge defining correct behavior with information. Currently, evaluators must rely on general purpose evaluation frameworks which can be ill-equipped provide guidance unique assessing judgments under uncertainty. To help navigate these complexities, we present a taxonomy...
Many visualization tasks require the viewer to make judgments about aggregate properties of data. Recent work has shown that viewers can perform such effectively, for example efficiently compare maximums or means over ranges However, this also shows effectiveness depends on designs displays. In paper, we explore relationship between aggregation task and design provide guidance matching with designs. We combine prior results from perceptual science graphical perception suggest a set variables...
Understanding uncertainty is critical for many analytical tasks. One common approach to encode data values and independently, using two visual variables. These resulting bivariate maps can be difficult interpret, interference between channels reduce the discriminability of marks. To address this issue, we contribute Value-Suppressing Uncertainty Palettes (VSUPs). VSUPs allocate larger ranges a channel when low, smaller high. This non-uniform budgeting makes more economical use limited...
The visual system can make highly efficient aggregate judgements about a set of objects, with speed roughly independent the number objects considered. While there is rich literature on these mechanisms and their ramifications for summarization tasks, this prior work rarely considers more complex tasks requiring multiple over long periods time, has not considered certain critical aggregation types, such as localization mean value points. In paper, we explore questions using common...
Visualizations often seek to aid viewers in assessing the big picture data, that is, make judgments about aggregate properties of data. In this paper, we present an empirical study a representative judgment task: finding regions maximum average series. We show how theory perceptual averaging suggests visual design other than typically-used line graph. describe experiment assesses participants' ability estimate averages and based on these averages. The confirms color encoding significantly...
Observing trends and predicting future values are common tasks for viewers of bivariate data visualizations. As many charts do not explicitly include trend lines or related statistical summaries, often visually estimate directly from a plot. How reliable the inferences draw when performing such regression by eye? Do particular visualization designs features bias perception? We present series crowdsourced experiments that assess accuracy estimated using eye across variety visualizations,...
Famous examples such as Anscombe's Quartet highlight that one of the core benefits visualizations is allowing people to discover visual patterns might otherwise be hidden by summary statistics. This inspection particularly important in exploratory data analysis, where analysts can use histograms and dot plots identify quality issues. Yet, these are driven parameters histogram bin size or mark opacity have a great deal impact on final appearance chart, but rarely optimized make features...
Visualizations have a potentially enormous influence on how data are used to make decisions across all areas of human endeavor. However, it is not clear this power connects ethical duties: what obligations do we when comes visualizations and visual analytics systems, beyond our duties as scientists engineers? Drawing historical contemporary examples, I address the moral components design use visualizations, identify some ongoing visualization research with dilemmas, propose set additional...
Bar charts with y-axes that don't begin at zero can visually exaggerate effect sizes. However, advice for whether or not to truncate the y-axis be equivocal other visualization types. In this paper we present examples of visualizations where truncation beneficial as well harmful, depending on communicative and analytic intent. We also results a series crowd-sourced experiments in which examine how impacts subjective size across types, explore alternative designs more directly alert viewers...
Key biological properties such as high genetic diversity and evolutionary rate enhance the potential of certain RNA viruses to adapt emerge. Identifying with these in their natural hosts could dramatically improve disease forecasting surveillance. Recently, we discovered two novel members viral family Arteriviridae: simian hemorrhagic fever virus (SHFV)-krc1 SHFV-krc2, infecting a single wild red colobus (Procolobus rufomitratus tephrosceles) Kibale National Park, Uganda. Nearly nothing is...
Sketching allows analysts to specify complex and free-form patterns of interest. Visual query systems can make use sketches locate these interest in large datasets. However, sketching is ambiguous: the same drawing could represent a multitude potential queries. In this work, we investigate ambiguities as they apply visual for time series data. We define class "invariants" - properties that analyst wishes ignore when performing sketch-based query. present results crowd-sourced study, showing...
Abstract Chart construction errors, such as truncated axes or inexpressive visual encodings, can hinder reading a visualization, worse, imply misleading facts about the underlying data. These errors be caught by critical readings of visualizations, but readers must have high level data and design literacy paying close attention. To address this issue, we introduce VisuaLint: technique for surfacing chart in situ. Inspired ubiquitous red wavy underline that indicates spelling mistakes,...
Thematic maps are commonly used for visualizing the density of events in spatial data. However, these can mislead by giving visual prominence to known base rates (such as population densities) or artifacts sample size and normalization outliers arising from smaller, thus more variable, samples). In this work, we adapt Bayesian surprise generate that counter biases. surprise, which has shown promise modeling human attention, weights information with respect how it updates beliefs over a space...
Abstract Data can be aggregated in many ways before being visualized charts, profoundly affecting what a chart conveys. Despite this importance, the type of aggregation is often communicated only via axis titles. In paper, we investigate use animation to disambiguate different types and communicate meaning aggregate operations. We present design rationales for animated transitions depicting operations results an experiment assessing impact these on identification tasks. find that judiciously...
A common research process in visualization is for researchers to collaborate with domain experts solve particular applied data problems. While there existing guidance and expertise around how structure collaborations strengthen contributions, comparatively little on navigate the implications of, power produced through socio-technical entanglements of collaborations. In this paper, we qualitatively analyze reflective interviews past participants from multiple perspectives: graduate students,...
Researchers have proposed many definitions of visualization literacy, targeting various aspects the term. But we yet to fully capture what it really means be literate in visualizations, which has important downstream implications, such as how effectively teach skills younger generations. We ran a meetup at IEEE VIS 2022 that attracted over 30 researchers field, who discussed literacy measure it, can improve develops, and relates other literacies. ACM CHI track record attracting from fields...
Abstract Modern literary scholars must combine access to vast collections of text with the traditional close analysis their field. In this paper, we discuss design and development tools support work. Based on needs scholars, constructed a suite visualization for large tagged (i.e. where one or more words have been annotated as belonging specific category). These unite aspects scholars’ work: scale overview help identify corpus‐wide statistical patterns while fine assist in finding details...
Since the 1960s, simian hemorrhagic fever virus (SHFV; Nidovirales, Arteriviridae) has caused highly fatal outbreaks of viral in captive Asian macaque colonies. However, source(s) these and natural reservoir(s) this remain obscure. Here we report identification two novel, divergent arteriviruses related to SHFV, Mikumi yellow baboon 1 (MYBV-1) Southwest (SWBV-1), wild baboons, respectively, demonstrate recent transmission SWBV-1 among baboons. These findings extend our knowledge genetic...
Dirty data and deceptive design practices can undermine, invert, or invalidate the purported messages of charts graphs. These failures arise silently: a conclusion derived from particular visualization may look plausible unless analyst looks closer discovers an issue with backing data, visual specification, their own assumptions. We term such silent but significant "visualization mirages". describe conceptual model mirages show how they be generated at every stage analytics process. adapt...
ABSTRACT CD8 + T cell responses rapidly select viral variants during acute human immunodeficiency virus (HIV)/simian (SIV) infection. We used pyrosequencing to examine variation within three SIV-derived epitopes (Gag 386-394 GW9, Nef 103-111 RM9, and Rev 59-68 SP10) targeted by immunodominant in acutely infected Mauritian cynomolgus macaques. In animals recognizing all epitopes, SP10 was associated with delayed accumulation of Gag GW9 but had no effect on RM9. This demonstrates that the...
In this paper, we examine the robustness of scagnostics through a series theoretical and empirical studies. First, investigate sensitivity by employing perturbing operations on more than 60M synthetic real-world scatterplots. We found that two scagnostic measures, Outlying Clumpy, are overly sensitive to data binning. To understand how these measures align with human judgments visual features, conducted study 24 participants, which reveals i) humans not small perturbations cause large...
A central tenet of information visualization research and practice is the notion visual variable effectiveness, or perceptual precision at which values are decoded given channels encoding. Formative work from Cleveland & McGill has shown that position along a common axis most effective for comparing individual values. One natural conclusion any chart not dot plot scatterplot deficient should be avoided. In this paper we refute caricature "scatterplots only" argument as way to call new...
A valuable task in text visualization is to have viewers make judgments about that has been annotated (either by hand or some algorithm such as clustering entity extraction). In this work we look at the ability of relative quantities tags (specifically tagged with one a set qualitatively distinct colors), and examine design choices can improve performance extracting statistical information from these texts. We find efficiently accurately estimate proportions tag levels over range situations;...