NFDI4DS | UHH-SEMS - Publication Details

Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance

OPENALEX - Publications

Gagan Bansal Tongshuang Wu Joyce Zhou Raymond Fok Besmira Nushi and 3 more

Many researchers motivate explainable AI with studies showing that human-AI team performance on decision-making tasks improves when the explains its recommendations. However, prior observed improvements from explanations only AI, alone, outperformed both human and best team. Can help lead to complementary performance, where accuracy is higher than either or working solo? We conduct mixed-method user three datasets, an comparable humans helps participants solve a task (explaining itself in...

10.1145/3411764.3445717 article EN 2021-05-06

Beyond Accuracy: The Role of Mental Models in Human-AI Team Performance

OPENALEX - Publications

Gagan Bansal Besmira Nushi Ece Kamar Walter S. Lasecki Daniel S. Weld and 1 more

Decisions made by human-AI teams (e.g., AI-advised humans) are increasingly common in high-stakes domains such as healthcare, criminal justice, and finance. Achieving high team performance depends on more than just the accuracy of AI system: Since human may have different expertise, highest is often reached when they both know how to complement one another. We focus a factor that crucial supporting complementary: human’s mental model capabilities, specifically system’s error boundary (i.e....

10.1609/hcomp.v7i1.5285 article EN Proceedings of the AAAI Conference on Human Computation and Crowdsourcing 2019-10-28

Updates in Human-AI Teams: Understanding and Addressing the Performance/Compatibility Tradeoff

OPENALEX - Publications

Gagan Bansal Besmira Nushi Ece Kamar Daniel S. Weld Walter S. Lasecki and 1 more

AI systems are being deployed to support human decision making in high-stakes domains such as healthcare and criminal justice. In many cases, the form a team, which makes decisions after reviewing AI’s inferences. A successful partnership requires that develops insights into performance of system, including its failures. We study influence updates an system this setting. While can increase predictive performance, they may also lead behavioral changes at odds with user’s prior experiences...

10.1609/aaai.v33i01.33012429 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2019-07-17

The challenge of crafting intelligible intelligence

OPENALEX - Publications

Daniel S. Weld Gagan Bansal

To trust the behavior of complex AI algorithms, especially in mission-critical settings, they must be made intelligible.

10.1145/3282486 article EN Communications of the ACM 2019-05-21

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

OPENALEX - Publications

Qingyun Wu Gagan Bansal Jieyu Zhang Yiran Wu Shaokun Zhang and 5 more

AutoGen is an open-source framework that allows developers to build LLM applications via multiple agents can converse with each other accomplish tasks. are customizable, conversable, and operate in various modes employ combinations of LLMs, human inputs, tools. Using AutoGen, also flexibly define agent interaction behaviors. Both natural language computer code be used program flexible conversation patterns for different applications. serves as a generic infrastructure diverse complexities...

10.48550/arxiv.2308.08155 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Understanding the Role of Human Intuition on Reliance in Human-AI Decision-Making with Explanations

OPENALEX - Publications

Valerie Chen Q. Vera Liao Jennifer Wortman Vaughan Gagan Bansal

AI explanations are often mentioned as a way to improve human-AI decision-making, but empirical studies have not found consistent evidence of explanations' effectiveness and, on the contrary, suggest that they can increase overreliance when system is wrong. While many factors may affect reliance support, one important factor how decision-makers reconcile their own intuition---beliefs or heuristics, based prior knowledge, experience, pattern recognition, used make judgments---with information...

10.1145/3610219 article EN cc-by Proceedings of the ACM on Human-Computer Interaction 2023-09-28

Gmail Smart Compose

OPENALEX - Publications

Mia Xu Chen Benjamin N. Lee Gagan Bansal Yuan Cao Shuyuan Zhang and 7 more

In this paper, we present Smart Compose, a novel system for generating interactive, real-time suggestions in Gmail that assists users writing mails by reducing repetitive typing. the design and deployment of such large-scale complicated system, faced several challenges including model selection, performance evaluation, serving other practical issues. At core Compose is neural language model. We leveraged state-of-the-art machine learning techniques training which enabled high-quality...

10.1145/3292500.3330723 article EN 2019-07-25

Is the Most Accurate AI the Best Teammate? Optimizing AI for Teamwork

OPENALEX - Publications

Gagan Bansal Besmira Nushi Ece Kamar Eric Horvitz Daniel S. Weld

AI practitioners typically strive to develop the most accurate systems, making an implicit assumption that system will function autonomously. However, in practice, systems often are used provide advice people domains ranging from criminal justice and finance healthcare. In such AI-advised decision making, humans machines form a team, where human is responsible for final decisions. But best teammate? We argue "not necessarily" --- predictable performance may be worth slight sacrifice...

10.1609/aaai.v35i13.17359 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming

OPENALEX - Publications

Hussein Mozannar Gagan Bansal Adam Fourney Eric Horvitz

Code-recommendation systems, such as Copilot and CodeWhisperer, have the potential to improve programmer productivity by suggesting auto-completing code. However, fully realize their potential, we must understand how programmers interact with these systems identify ways that interaction. To seek insights about human-AI collaboration code recommendations studied GitHub Copilot, a code-recommendation system used millions of daily. We developed CUPS, taxonomy common activities when interacting...

10.1145/3613904.3641936 article EN cc-by 2024-05-11

Hierarchical Summarization: Scaling Up Multi-Document Summarization

OPENALEX - Publications

Janara Christensen Stephen Soderland Gagan Bansal Mausam Mausam

Multi-document summarization (MDS) systems have been designed for short, unstructured summaries of 10-15 documents, and are inadequate larger document collections. We propose a new approach to scaling up called hierarchical summarization, present the first implemented system, SUMMA. SUMMA produces hierarchy relatively short summaries, in which top level provides general overview users can navigate drill down more details on topics interest. optimizes coherence as well coverage salient...

10.3115/v1/p14-1085 article EN cc-by 2014-01-01

Gmail Smart Compose: Real-Time Assisted Writing

OPENALEX - Publications

Mia Xu Chen Benjamin N. Lee Gagan Bansal Yuan Cao Shuyuan Zhang and 7 more

In this paper, we present Smart Compose, a novel system for generating interactive, real-time suggestions in Gmail that assists users writing mails by reducing repetitive typing. the design and deployment of such large-scale complicated system, faced several challenges including model selection, performance evaluation, serving other practical issues. At core Compose is neural language model. We leveraged state-of-the-art machine learning techniques training which enabled high-quality...

10.48550/arxiv.1906.00080 preprint EN other-oa arXiv (Cornell University) 2019-01-01

A Coverage-Based Utility Model for Identifying Unknown Unknowns

OPENALEX - Publications

Gagan Bansal Daniel S. Weld

A classifier’s low confidence in prediction is often indicative of whether its will be wrong; this case, inputs are called known unknowns. In contrast, unknown unknowns (UUs) on which a classifier makes high mistake. Identifying UUs especially important safety-critical domains like medicine (diagnosis) and law (recidivism prediction). Previous work by Lakkaraju et al. (2017) identifying assumes that the utility each revealed UU independent others, rather than considering set holistically....

10.1609/aaai.v32i1.11493 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-25

Automatic number plate detection and recognition using YOLO world

OPENALEX - Publications

Vartika Agarwal Gagan Bansal

10.1016/j.compeleceng.2024.109646 article EN Computers & Electrical Engineering 2024-09-19

Emerging Perspectives in Human-Centered Machine Learning

OPENALEX - Publications

Gonzalo Ramos Jina Suh Soroush Ghorashi Christopher Meek Richard Banks and 4 more

Current Machine Learning (ML) models can make predictions that are as good or better than those made by people. The rapid adoption of this technology puts it at the forefront systems impact lives many, yet consequences not fully understood. Therefore, work intersection people's needs and ML is more relevant ever. This area work, dubbed Human-Centered (HCML), re-thinks research in terms human goals. HCML gathers an interdisciplinary group HCI practitioners, each bringing their unique, related...

10.1145/3290607.3299014 article EN 2019-04-30

Do Explanations Help Users Detect Errors in Open-Domain QA? An Evaluation of Spoken vs. Visual Explanations

OPENALEX - Publications

Ana Valeria González Gagan Bansal Angela Fan Yashar Mehdad Robin Jia and 1 more

While research on explaining predictions of open-domain QA systems (ODQA) is gaining momentum, most works do not evaluate whether these explanations improve user trust.Furthermore, many users interact with ODQA using voice-assistants, yet prior exclusively focus visual displays, risking (as we also show) incorrectly extrapolating the effectiveness across modalities.To better understand strategies in wild, conduct studies that measure help correctly decide when to accept or reject an system's...

10.18653/v1/2021.findings-acl.95 article EN cc-by 2021-01-01

Workshop on Trust and Reliance in AI-Human Teams (TRAIT)

OPENALEX - Publications

Gagan Bansal Zana Buçinca Kenneth Holstein Jessica Hullman Alison Smith and 2 more

As humans increasingly interact (and even collaborate) with AI systems during decision-making, creative exercises, and other tasks, appropriate trust reliance are necessary to ensure proper usage adoption of these systems. Specifically, people should understand when or rely on an algorithm's outputs override them. Significant research focus has aimed define measure in human-AI interaction, design implement interactions that promote calibrate trust. However, conceptualizing reliance,...

10.1145/3544549.3573831 article EN 2023-04-19

Workshop on Trust and Reliance in AI-Human Teams (TRAIT)

OPENALEX - Publications

Gagan Bansal Alison Smith Zana Buçinca Tongshuang Wu Kenneth Holstein and 2 more

As humans increasingly interact (and even collaborate) with AI systems during decision-making, creative exercises, and other tasks, appropriate trust reliance are necessary to ensure proper usage adoption of these systems. Specifically, people should understand when or rely on an algorithm's outputs override them. While significant research focus has aimed measure promote in human-AI interaction, the field lacks synthesized definitions understanding results across contexts. Indeed,...

10.1145/3491101.3503704 article EN CHI Conference on Human Factors in Computing Systems Extended Abstracts 2022-04-27

Secure and Automated Enterprise Revenue Forecasting

OPENALEX - Publications

Jocelyn Barker Amita Gajewar Konstantin Golyaev Gagan Bansal Matt Conners

Revenue forecasting is required by most enterprises for strategic business planning and providing expected future results to investors. However, revenue processes in companies are time-consuming error-prone as they performed manually hundreds of financial analysts. In this paper, we present a novel machine learning based solution that developed forecast 100% Microsoft's (around $85 Billion 2016), now deployed into production an end-to-end automated secure pipeline Azure. Our combines...

10.1609/aaai.v32i1.11385 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-27

Human Evaluation of Spoken vs. Visual Explanations for Open-Domain QA

OPENALEX - Publications

Ana Valeria González Gagan Bansal Angela Fan Robin Jia Yashar Mehdad and 1 more

While research on explaining predictions of open-domain QA systems (ODQA) to users is gaining momentum, most works have failed evaluate the extent which explanations improve user trust. few using studies, they employ settings that may deviate from end-user's usage in-the-wild: ODQA ubiquitous in voice-assistants, yet current only evaluates a visual display, and erroneously extrapolate conclusions about performant other modalities. To alleviate these issues, we conduct studies measure whether...

10.48550/arxiv.2012.15075 preprint EN cc-by-sa arXiv (Cornell University) 2020-01-01

Online Discovery of Group Level Events in Time Series

OPENALEX - Publications

X. Chelsea Chen Abdullah Mueen Vijay K. Narayanan Nikos Karampatziakis Gagan Bansal and 1 more

Previous chapter Next Full AccessProceedings Proceedings of the 2014 SIAM International Conference on Data Mining (SDM)Online Discovery Group Level Events in Time SeriesXi C. Chen, Abdullah Mueen, Vijay K Narayanan, Nikos Karampatziakis, Gagan Bansal, and Vipin KumarXi Kumarpp.632 - 640Chapter DOI:https://doi.org/10.1137/1.9781611973440.73PDFBibTexSections ToolsAdd to favoritesExport CitationTrack CitationsEmail SectionsAboutAbstract Recent advances high throughput data collection storage...

10.1137/1.9781611973440.73 article EN 2014-04-28

Moedor Cleaning Robot

OPENALEX - Publications

Anmol Taneja Gagan Bansal Rohil Setia N. Hema

With technologies moving par the normal human effort and thinking, humans try to integrate technology into every aspect of their lives. For a healthy nutritious lifestyle, cleanliness hygiene are one most important requirements. In this paper, we implemented automated cleaning system called Moedor Robot for indoor as well an outdoor application such office, corridor, garden, room, etc. metropolitan cities people forced work long duration sustain city life expenses. situation, will look...

10.1109/ic3.2018.8530503 article EN 2018-08-01