Katy Williams

ORCID: 0000-0003-0864-1446
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Data Visualization and Analytics
  • Data Analysis with R
  • Parallel Computing and Optimization Techniques
  • Multimedia Communication and Technology
  • Cloud Computing and Resource Management
  • Data Quality and Management
  • Software System Performance and Reliability
  • Software Engineering Research
  • Big Data and Business Intelligence
  • Advanced Data Storage Technologies
  • Scientific Computing and Data Management
  • Online Learning and Analytics
  • Complex Network Analysis Techniques
  • Data Stream Mining Techniques
  • Context-Aware Activity Recognition Systems
  • Big Data Technologies and Applications
  • Advanced Text Analysis Techniques
  • Usability and User Interface Design

Davidson College
2024

University of Arizona
2018-2023

Data workers may have a different mental model of their data that the one reified in code. Understanding organization is necessary for analyzing data, be it through scripting, visualization or abstract thought. More complicated organizations, such as tables with attached hierarchies, tax people's ability to think about and interact data. To better understand ultimately design these situations, we conduct study across team ten people work ing same model. Through interviews sketching, working...

10.48550/arxiv.2501.09862 preprint EN arXiv (Cornell University) 2025-01-16

Common pitfalls in visualization projects include lack of data availability and the domain users' needs focus changing too rapidly for design process to complete. While it is often prudent avoid such projects, we argue can be beneficial engage them some cases as help refine collection, solving a "chicken egg" problem having tools analyze it. We found this case task parallel computing where tooling an open area research. Despite these hurdles, conducted study. Through tightly-coupled...

10.1109/tvcg.2019.2934285 article EN IEEE Transactions on Visualization and Computer Graphics 2019-01-01

Despite advancements in the areas of parallel and distributed computing, complexity programming on High Performance Computing (HPC) resources has deterred many domain experts, especially machine learning artificial intelligence (AI), from utilizing performance benefits such systems. Researchers scientists favor high-productivity languages to avoid inconvenience low-level costs acquiring necessary skills required for at this level. In recent years, Python, with support linear algebra...

10.1109/espm2.2018.00009 preprint EN 2018-11-01

Performance analysis is critical for pinpointing bottlenecks in parallel applications. Several profilers exist to instrument programs on HPC systems and gather performance data. Hatchet an open-source Python library that can read profiling output of several tools, enables the user perform a variety programmatic analyses hierarchical profiles. In this paper, we augment support new features: query language representing call path patterns be used filter calling context tree, visualization...

10.1109/hustprotools51951.2020.00013 article EN 2020-11-01

Two people looking at the same dataset will create different mental models, prioritize attributes, and connect with visualizations. We seek to understand space of data abstractions associated models how well communicate their when sketching. Data have a profound influence on visualization design, yet it's unclear universal they may be not initially influenced by representation. conducted study about from dataset. Rather than presenting tabular data, we presented each participant one three...

10.1145/3544548.3580669 article EN 2023-04-19

We describe an interactive computing environment called JetLag. JetLag implements the following features of Phylanx project: (1) Phylanx, a Python-based asynchronous array toolkit; (2) APEX performance measurement library; (3) visualization framework Traveler; (4) Tapis/Agave Science as Service middleware; and (6) container infrastructure that includes Docker-based Jupyter notebook for client singularity image server.

10.1145/3311790.3396657 article EN Practice and Experience in Advanced Research Computing 2020-07-22

Many data abstraction types, such as networks or set relationships, remain unfamiliar to workers beyond the visualization research community. We conduct a survey and series of interviews about how people describe their data, either directly indirectly. refer latter latent abstractions. Grounded Theory analysis that (1) interprets extent which abstractions exist, (2) reveals far-reaching effects interventionist pursuit can have on workers, (3) describes why when may resist explorations, (4)...

10.1109/tvcg.2020.3030355 article EN IEEE Transactions on Visualization and Computer Graphics 2020-10-30

Interactive visualization can support fluid exploration but is often limited to predetermined tasks. Scripting a vast range of queries may be more cumbersome for free-form exploration. Embedding interactive in scripting environments, such as computational notebooks, provides an opportunity leverage the strengths both direct manipulation and scripting. We investigate design methodology, choices, strategies under this paradigm through study calling context trees used performance analysis,...

10.1109/tvcg.2024.3354561 article EN IEEE Transactions on Visualization and Computer Graphics 2024-01-01

Interactive visualization via direct manipulation has inherent design trade-offs in flexibility, discoverability, and ease-of-use. Scripting languages can support a vast range of user queries tasks, but may be more cumbersome for free-form exploration. Embedding interactive scripting environment, such as computational notebook, provides an opportunity leveraging the strengths both scripting. We conduct study investigating this context calling trees used performance analysis parallel...

10.48550/arxiv.2205.04557 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Many data abstraction types, such as networks or set relationships, remain unfamiliar to workers beyond the visualization research community. We conduct a survey and series of interviews about how people describe their data, either directly indirectly. refer latter latent abstractions. Grounded Theory analysis that (1) interprets extent which abstractions exist, (2) reveals far-reaching effects interventionist pursuit can have on workers, (3) describes why when may resist explorations, (4)...

10.48550/arxiv.2005.04058 preprint EN other-oa arXiv (Cornell University) 2020-01-01
Coming Soon ...