Gromit Yeuk-Yin Chan

ORCID: 0000-0003-1356-4406
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Data Visualization and Analytics
  • Explainable Artificial Intelligence (XAI)
  • Scientific Computing and Data Management
  • Stroke Rehabilitation and Recovery
  • Image Retrieval and Classification Techniques
  • Time Series Analysis and Forecasting
  • Topic Modeling
  • Data Management and Algorithms
  • Innovative Human-Technology Interaction
  • Geographic Information Systems Studies
  • Multimedia Communication and Technology
  • Data Analysis with R
  • Natural Language Processing Techniques
  • Video Analysis and Summarization
  • Action Observation and Synchronization
  • Context-Aware Activity Recognition Systems
  • Nerve Injury and Rehabilitation
  • Recommender Systems and Techniques
  • Radiation Effects in Electronics
  • Biomedical Text Mining and Ontologies
  • Advanced Graph Neural Networks
  • Computer Graphics and Visualization Techniques
  • Data Quality and Management
  • Advanced Database Systems and Queries
  • Spinal Cord Injury Research

Adobe Systems (United States)
2023-2025

New York University
2017-2024

Hong Kong University of Science and Technology
2016

University of Hong Kong
2016

To design a successful Multiplayer Online Battle Arena (MOBA) game, the ratio of <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">snowballing</i> and comeback occurrences to all matches played must be maintained at certain level ensure its fairness engagement. Although it is easy identify these two types occurrences, game developers often find difficult determine their causes triggers with so many choices parameters involved. In addition, huge...

10.1109/tvcg.2016.2598415 article EN IEEE Transactions on Visualization and Computer Graphics 2016-08-05

The message a designer wants to convey plays pivotal role in directing the design of an infographic, yet most authoring workflows start with creating visualizations or graphics first without gauging whether they fit message. To address this gap, we propose Epigraphics, web-based system that treats "epigraph" as first-class object, and uses it guide infographic asset creation, editing, syncing. text-based recommend visualizations, graphics, data filters, color palettes, animations. It further...

10.1145/3613904.3642172 preprint EN cc-by-nc 2024-05-11

Visual data stories can effectively convey insights from data, yet their creation often necessitates intricate exploration, insight discovery, narrative organization, and customization to meet the communication objectives of storyteller. Existing automated storytelling techniques, however, tend overlook importance user during story authoring process, limiting system's ability create tailored narratives that reflect user's intentions. We present a novel generation workflow leverages adaptive...

10.1109/tvcg.2023.3327363 article EN IEEE Transactions on Visualization and Computer Graphics 2023-01-01

Bipartite graphs model the key relations in many large scale real-world data: customers purchasing items, legislators voting for bills, people's affiliation with different social groups, faults occurring vehicles, etc. However, it is challenging to visualize bipartite tens of thousands or even more nodes edges. In this paper, we propose a novel visual summarization technique based on minimum description length (MDL) principle. The method simultaneously groups two set and constructs...

10.1109/tvcg.2018.2864826 article EN IEEE Transactions on Visualization and Computer Graphics 2018-08-20

The brachial plexus is a complex network of peripheral nerves that enables sensing from and control the movements arms hand. Nowadays, coordination between muscles to generate simple still not well understood, hindering knowledge how best treat patients with this type nerve injury. To acquire enough information for medical data analysis, physicians conduct motion analysis assessments produce rich dataset electromyographic signals multiple recorded joint during real-world tasks. However,...

10.1109/tvcg.2019.2934280 article EN IEEE Transactions on Visualization and Computer Graphics 2019-08-22

With the increasing sophistication of machine learning models, there are growing trends developing model explanation techniques that focus on only one instance (local explanation) to ensure faithfulness original model. While these provide accurate interpretability various data primitive (e.g., tabular, image, or text), a holistic Explainable Artificial Intelligence (XAI) experience also requires global and dataset enable sensemaking in different granularity. Thus, is vast potential...

10.48550/arxiv.2007.10614 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Understanding the interpretation of machine learning (ML) models has been paramount importance when making decisions with societal impacts, such as transport control, financial activities, and medical diagnosis. While local explanation techniques are popular methods to interpret ML on a single instance, they do not scale understanding model's behavior whole dataset. In this article, we outline challenges needs visually analyzing explanations propose SUBPLEX, visual analytics approach help...

10.1109/mcg.2022.3199727 article EN IEEE Computer Graphics and Applications 2022-08-25

Understanding the interpretation of machine learning (ML) models has been paramount importance when making decisions with societal impacts such as transport control, financial activities, and medical diagnosis. While current model methodologies focus on using locally linear functions to approximate or creating self-explanatory that give explanations each input instance, they do not at subpopulation level, which is understanding interpretations across different subset aggregations in a...

10.48550/arxiv.2007.10609 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Chieh-Yang Huang, Ting-Yao Hsu, Ryan Rossi, Ani Nenkova, Sungchul Kim, Gromit Yeuk-Yin Chan, Eunyee Koh, C Lee Giles, Ting-Hao Huang. Proceedings of the 16th International Natural Language Generation Conference. 2023.

10.18653/v1/2023.inlg-main.6 article EN cc-by 2023-01-01

When reading financial news, although there are critics explaining the fluctuation of Economic indexes in articles everyday, news often assert bias on authors' favorite opinions. On other hand, amount published these days is staggering with diversified opinions same issues. Unless acumen for whole environment, audience will find hard to stay objective and identify useful information from mass media. If computer granted ability analyze all generate a narrative that addresses concerns related...

10.1109/bigcomp.2016.7425798 article EN 2016-01-01

The Data Polygamy framework allows users to uncover interesting patterns and interactions in the data exhaust from different components of an urban environment. But analyzing plethora relationships derived by is challenging. In this demo, we show how visualization can help discovery that are potentially allowing query explore relationship set intuitive way. We will demonstrate effectiveness visual interface through case studies, demo visitors also interact with polygamous relationships.

10.1145/3035918.3058741 article EN 2017-05-09

Online visitor behaviors are often modeled as a large sparse matrix, where rows represent visitors and columns behavior. To discover customer segments with different hierarchies, marketers need to cluster the data in splits. Such analyses require clustering algorithm provide real-time responses on user parameter changes, which current techniques cannot support. In this paper, we propose algorithm, density peaks, for large-scale data. It pre-processes input points compute annotations...

10.1145/3366423.3380183 article EN 2020-04-20

With the increasing use of black-box Machine Learning (ML) techniques in critical applications, there is a growing demand for methods that can provide transparency and accountability model predictions. As result, large number local explainability models have been developed popularized. However, machine learning explanations are still hard to evaluate compare due high dimensionality, heterogeneous representations, varying scales, stochastic nature some these methods. Topological Data Analysis...

10.1109/tvcg.2024.3418653 article EN IEEE Transactions on Visualization and Computer Graphics 2024-06-24

With the increasing use of black-box Machine Learning (ML) techniques in critical applications, there is a growing demand for methods that can provide transparency and accountability model predictions. As result, large number local explainability models have been developed popularized. However, machine learning explanations are still hard to evaluate compare due high dimensionality, heterogeneous representations, varying scales, stochastic nature some these methods. Topological Data Analysis...

10.48550/arxiv.2406.15613 preprint EN arXiv (Cornell University) 2024-06-21

Augmented Reality assistance are increasingly popular for supporting users with tasks like assembly and cooking. However, current practice typically provide reactive responses initialized from user requests, lacking consideration of rich contextual user-specific information. To address this limitation, we propose a novel AR system, Satori, that models both states environmental contexts to deliver proactive guidance. Our system combines the Belief-Desire-Intention (BDI) model state-of-the-art...

10.48550/arxiv.2410.16668 preprint EN arXiv (Cornell University) 2024-10-21

Ensuring large language models' (LLMs) responses align with prompt instructions is crucial for application development. Based on our formative study industry professionals, the alignment requires heavy human involvement and tedious trial-and-error especially when there are many in prompt. To address these challenges, we introduce CoPrompter, a framework that identifies misalignment based assessing multiple LLM criteria. It proposes method to generate evaluation criteria questions derived...

10.48550/arxiv.2411.06099 preprint EN arXiv (Cornell University) 2024-11-09

Online marketing platforms often store millions of website visitors' behavior as a large sparse matrix with rows visitors and columns behavior. These allow marketers to conduct Audience Expansion, technique identify new audiences similar the original target audiences. In this paper, we propose method achieve interactive Expansion from visitor data efficiently. Unlike other methods that undergo significant computations upon inputs, our approach provides responses when marketer inputs...

10.1145/3447548.3467179 article EN 2021-08-13

Good figure captions help paper readers understand complex scientific figures. Unfortunately, even published papers often have poorly written captions. Automatic caption generation could aid writers by providing good starting that can be refined for better quality. Prior work treated as a vision-to-language task. In this paper, we show it more effectively tackled text summarization task in documents. We fine-tuned PEGASUS, pre-trained abstractive model, to specifically summarize...

10.48550/arxiv.2302.12324 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Detecting structures and components in business emails is vital for editor software to convert third-party so that designers can edit them without needing know how HTML works. In a production environment, the challenge make model easy be understood maintained by different stakeholders. We propose detecting email with collection of constraints written Answer Set Programming (ASP). Hard detect well-defined like layouts, soft incorporate ML custom buttons titles emails. Using constraints,...

10.1145/3544549.3585714 article EN 2023-04-19

Given a large corpus of HTML-based emails (or websites, posters, documents) collected from the web, how can we train model capable learning such rich heterogeneous data for style recommendation tasks as recommending useful design styles or suggesting alternative HTML designs? To address this new task, first decompose each document in into sequence smaller fragments where fragment may consist set entities buttons, images, textual content (titles, paragraphs) and stylistic background-style,...

10.1145/3543873.3587300 article EN 2023-04-28

The ability to extract insights from large amounts of data in a timely manner is crucial problem. Exploratory Data Analysis (EDA) commonly used by analysts uncover using sequence SQL commands and associated visualizations. However, many cases, this process carried out non-programmers who must work within tight time constraints, such as marketing campaign where marketer quickly analyse reach target revenue. This paper presents ApproxEDA - system that combines natural language processing (NLP)...

10.1145/3555041.3589724 article EN 2023-06-04
Coming Soon ...