- Data Visualization and Analytics
- Complex Network Analysis Techniques
- Virtual Reality Applications and Impacts
- Scientific Computing and Data Management
- Advanced Text Analysis Techniques
- Image and Video Quality Assessment
- Multimedia Communication and Technology
- Topological and Geometric Data Analysis
- Peer-to-Peer Network Technologies
- Data Management and Algorithms
- Mental Health Research Topics
- Data Quality and Management
- Data Analysis with R
- Human Mobility and Location-Based Analysis
- Graph Theory and Algorithms
- Online Learning and Analytics
- Advanced Database Systems and Queries
- Species Distribution and Climate Change
- Time Series Analysis and Forecasting
- Functional Brain Connectivity Studies
- Computer Graphics and Visualization Techniques
- Cloud Computing and Resource Management
- Topic Modeling
- Interactive and Immersive Displays
- Technology Use by Older Adults
Northeastern University
2017-2025
Universidad del Noreste
2019-2023
Linköping University
2022
Masaryk University
2022
University of Massachusetts Amherst
2022
Georgia Institute of Technology
2020
John Brown University
2020
Prescott Center for the Arts
2019
IBM (United States)
2014-2016
University of Maryland, College Park
2008-2014
We present NodeXL, an extendible toolkit for network overview, discovery and exploration implemented as add-in to the Microsoft Excel 2007 spreadsheet software. demonstrate NodeXL data analysis visualization features with a social media sample drawn from enterprise intranet network. A sequence of operations import computation statistics refinement through sorting, filtering, clustering functions is described. These reveal sociologically relevant differences in patterns interconnection among...
Immersive Analytics is a quickly evolving field that unites several areas such as visualisation, immersive environments, and human-computer interaction to support human data analysis with emerging technologies. This research has thrived over the past years multiple workshops, seminars, growing body of publications, spanning conferences. Given rapid advancement technologies novel application domains, this paper aims toward broader agenda enable widespread adoption. We present 17 key...
Sentiment analysis often relies on a semantic orientation lexicon of positive and negative words. A number approaches have been proposed for creating such lexicons, but they tend to be computationally expensive, usually rely significant manual annotation large corpora. Most these methods use WordNet. In contrast, we propose simple approach generate high-coverage lexicon, which includes both individual words multi-word expressions, using only Roget-like thesaurus handful affixes. Further, the...
Analyzing networks involves understanding the complex relationships between entities, as well any attributes they may have. The widely used node-link diagrams excel at this task, but many are difficult to extract meaning from because of inherent complexity and limited screen space. To help address problem we introduce a technique called motif simplification, in which common patterns nodes links replaced with compact meaningful glyphs. Well-designed glyphs have several benefits: (1) require...
Exploring large network datasets, such as scientific collaboration networks, is challenging because they often contain a number of nodes and edges in several types with multiple attributes. Analyses networks are long complex, may require sessions by users. Therefore, it difficult for users to recall their own exploration history or share others. We introduce GraphTrail, an interactive visualization analyzing through node edge aggregates that captures users' interactions integrates this...
Type 1 diabetes is a chronic, incurable autoimmune disease affecting millions of Americans in which the body stops producing insulin and blood glucose levels rise. The goal intensive management to lower average through frequent adjustments protocol, diet, behavior. Manual logs medical device data are collected by patients, but these multiple sources presented disparate visualization designs clinician—making temporal inference difficult. We conducted design study over 18 months with...
Keeping up with rapidly growing research fields, especially when there are multiple interdisciplinary sources, requires substantial effort for researchers, program managers, or venture capital investors. Current theories and tools directed at finding a paper website, not gaining an understanding of the key papers, authors, controversies, hypotheses. This report presents to integrate statistics, text analytics, visualization in coordinated window environment that supports exploration. Our...
Virtual reality (VR) researchers struggle to conduct remote studies. Previous work has focused on working around limitations imposed by traditional crowdsourcing methods. However, the potential for leveraging social VR platforms HCI evaluations is largely unexplored. These have large VR-ready user populations, distributed synchronous virtual environments, and support user-generated content. We demonstrate how can be used practically ethically produce valid research results replicating two...
Layered network visualizations assign each node to one of several parallel axes. They can convey sequence or flow data, hierarchies, multiple data classes, but edge crossings and long edges often impair readability. Layout algorithms reduce shorten using quick heuristics optimal methods that prioritize human readability over computation speed. This work uses an optimization metaheuristic provide the best both worlds: high-quality layouts within a predetermined execution time. Our adaptation...
Machine learning practitioners often need to compare multiple models select the best one for their application. However, current methods of comparing fall short because they rely on aggregate metrics that can be difficult interpret or do not provide enough information understand differences between models. To better support comparison models, we propose set visualizations model outputs enable easier model-to-model comparison. We outline requirements using sets machine and demonstrate how...
Social Network Analysis (SNA) has evolved as a popular, standard method for modeling meaningful, often hidden structural relationships in communities. Existing SNA tools involve extensive pre-processing or intensive programming skills that can challenge practitioners and students alike. NodeXL, an open-source template Microsoft Excel, integrates library of common network metrics graph layout algorithms within the familiar spreadsheet format, offering potentially low-barrier-to-entry...
Analyzing network data can provide valuable insights in many diverse fields. However, designing node-link visualizations that effectively communicate the underlying is challenging, as for every there are potential unintelligible or even misleading layouts. Automated layout algorithms have helped, but frequently generate ineffective visualizations. In order to build awareness of effective visualization strategies, we detail new global readability metrics on a [0,1] continuous scale node-node...
Research involving Virtual Reality (VR) headsets is becoming more and popular. However, scaling VR experiments challenging as researchers are often limited to using one or a small number of for in-lab studies. One general way scale through crowdsourcing so have access large pool diverse participants with relatively little expense time money. Unfortunately, there no easy crowdsource experiments. We demonstrate that it possible implement run crowdsourced pre-existing massively multiplayer...
Many collaborative data analysis situations benefit from collaborators utilizing different platforms. However, maintaining group awareness between team members using diverging devices is difficult, not least because common ground diminishes. A person head-mounted VR cannot physically see a user on desktop computer even while co-located, and the easily relate to user's 3D workspace. To address this, we propose "eyes-and-shoes" principles for abstract them into four levels of techniques....
Social media, particularly Twitter, provides an abundance of real-time data. To account for this volume, researchers often use automated analysis and visualization techniques to produce a high-level overview Twitter stream. Existing understanding data make hashtags or word-pairs may ignore the complex trends in discussions over time. remedy this, we present application statistical topic modeling alignment (binned models) group related tweets into automatically generated topics TopicFlow,...
Understanding the meaning of existing SQL queries is critical for code maintenance and reuse. Yet can be hard to read, even expert users or original creator a query. We conjecture that it possible capture logical intent in \emph{automatically-generated visual diagrams} help understand faster more accurately than text alone. present initial steps direction with diagrams are based on first-order logic foundation deeply nested queries. Our build upon rich history diagrammatic reasoning systems...
Temporal event sequence alignment has been used in many domains to visualize nuanced changes and interactions overtime. Existing approaches align one or two sentinel events. Overview tasks require examining all alignments of interest using interaction time juxtaposition visualizations. Furthermore, any attribute overviews are not closely tied We present SEQUENCE BRAIDING, a novel overview visualization for temporal sequences attributes layered directed acyclic network. BRAIDING visually...
Node-link visualizations are a familiar and powerful tool for displaying the relationships in network. The readability of these highly depends on spatial layout used nodes. In this paper, we focus computing layered layouts, which nodes aligned set parallel axes to better expose hierarchical or sequential relationships. Heuristic-based layouts widely as they scale well larger networks usually create readable, albeit sub-optimal, visualizations. We instead use optimization model that...
Increasingly, visualization practitioners are working with, using, and studying private sensitive data. There can be many stakeholders interested in the resulting analyses-but widespread sharing of data cause harm to individuals, companies, organizations. Practitioners increasingly turning differential privacy enable public with a guaranteed amount privacy. Differential algorithms do this by aggregating statistics noise, now-private released visually differentially scatterplots. While visual...
Visualizations of static networks in the form node-link diagrams have evolved rapidly, though researchers are still grappling with how best to show evolution nodes over time these diagrams. This paper introduces NetVisia, a social network visualization system designed support users exploring temporal by using heat maps display node attribute changes time. NetVisia's novel contributions visualizations (1) cluster map similar metric values instead topological similarity, and (2) align events....
Comparing relational languages by their logical expressiveness is well understood. Less understood how to compare ability represent query patterns. Indeed, what are patterns other than "a certain way of writing a query"? And can be defined across procedural and declarative languages, irrespective syntax? To the best our knowledge, we provide first semantic definition using variant structure-preserving mappings between tables queries. This formalism allows us analyze relative pattern language...