NFDI4DS | UHH-SEMS - Publication Details

Anca D. Dragan

ORCID: 0000-0001-6312-5466

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5005997281

Research Areas

Reinforcement Learning in Robotics
Robot Manipulation and Learning
Social Robot Interaction and HRI
Adversarial Robustness in Machine Learning
Autonomous Vehicle Technology and Safety
Explainable Artificial Intelligence (XAI)
Robotic Path Planning Algorithms
Human-Automation Interaction and Safety
Ethics and Social Impacts of AI
Machine Learning and Algorithms
AI-based Problem Solving and Planning
Anomaly Detection Techniques and Applications
Topic Modeling
Natural Language Processing Techniques
Human Pose and Action Recognition
EEG and Brain-Computer Interfaces
Advanced Bandit Algorithms Research
Decision-Making and Behavioral Economics
Machine Learning and Data Classification
Gaze Tracking and Assistive Technology
Domain Adaptation and Few-Shot Learning
Data Stream Mining Techniques
Tactile and Sensory Interactions
Multimodal Machine Learning Applications
Bayesian Modeling and Causal Inference

University of California, Berkeley
2016-2025

Berkeley College
2018-2024

Carnegie Mellon University
2011-2022

South China University of Technology
2018

Stanford University
2018

Bangladesh University of Engineering and Technology
2016

Fraunhofer Institute for Industrial Mathematics
2009

CHOMP: Covariant Hamiltonian optimization for motion planning

OPENALEX - Publications

Matt Zucker Nathan Ratliff Anca D. Dragan Mihail Pivtoraiko Matthew Klingensmith and 3 more

In this paper, we present CHOMP (covariant Hamiltonian optimization for motion planning), a method trajectory invariant to reparametrization. uses functional gradient techniques iteratively improve the quality of an initial trajectory, optimizing that trades off between smoothness and obstacle avoidance component. can be used locally optimize feasible trajectories, as well solve planning queries, converging low-cost trajectories even when initialized with infeasible ones. It Monte Carlo...

10.1177/0278364913488805 article EN The International Journal of Robotics Research 2013-08-01

Legibility and predictability of robot motion

OPENALEX - Publications

Anca D. Dragan Kenton C.T. Lee Siddhartha S Srinivasa

A key requirement for seamless human-robot collaboration is the robot to make its intentions clear human collaborator. collaborative robot's motion must be legible, or intent-expressive. Legibility often described in literature as and effect of predictable, unsurprising, expected motion. Our central insight that predictability legibility are fundamentally different contradictory properties We develop a formalism mathematically define distinguish formalize two based on inferences between...

10.1109/hri.2013.6483603 article EN 2016 11th ACM/IEEE International Conference on Human-Robot Interaction (HRI) 2013-03-01

Planning for Autonomous Cars that Leverage Effects on Human Actions

OPENALEX - Publications

Dorsa Sadigh S. Shankar Sastry Sanjit A. Seshia Anca D. Dragan

Traditionally, autonomous cars make predictions about other drivers' future trajectories, and plan to stay out of their way.This tends result in defensive opaque behaviors.Our key insight is that an car's actions will actually affect what do response, whether the car aware it or not.Our thesis we can leverage these responses more efficient communicative behaviors.We model interaction between a human driver as dynamical system, which robot's have immediate consequences on state car, but also...

10.15607/rss.2016.xii.029 article EN 2016-06-27

Legibility and predictability of robot motion

OPENALEX - Publications

Anca D. Dragan Kenton C.T. Lee Siddhartha S Srinivasa

10.5555/2447556.2447672 article EN Human-Robot Interaction 2013-03-03

A policy-blending formalism for shared control

OPENALEX - Publications

Anca D. Dragan Siddhartha S Srinivasa

In shared control teleoperation, the robot assists user in accomplishing desired task, making teleoperation easier and more seamless. Rather than simply executing user’s input, which is hindered by inadequacies of interface, attempts to predict intent, it. this work, we are interested scientific underpinnings assistance: propose an intuitive formalism that captures assistance as policy blending, illustrate how some existing techniques for instantiate it, provide a principled analysis its...

10.1177/0278364913490324 article EN The International Journal of Robotics Research 2013-06-01

Cooperative Inverse Reinforcement Learning

OPENALEX - Publications

Dylan Hadfield-Menell Anca D. Dragan Pieter Abbeel Stuart Russell

For an autonomous system to be helpful humans and pose no unwarranted risks, it needs align its values with those of the in environment such a way that actions contribute maximization value for humans. We propose formal definition alignment problem as cooperative inverse reinforcement learning (CIRL). A CIRL is cooperative, partial-information game two agents, human robot; both are rewarded according human's reward function, but robot does not initially know what this is. In contrast...

10.48550/arxiv.1606.03137 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Active Preference-Based Learning of Reward Functions

OPENALEX - Publications

Dorsa Sadigh Anca D. Dragan S. Shankar Sastry Sanjit A. Seshia

Our goal is to efficiently learn reward functions encoding a human's preferences for how dynamical system should act.There are two challenges with this.First, in many problems it difficult people provide demonstrations of the desired trajectory (like high-DOF robot arm motion or an aggressive driving maneuver), even assign much numerical action get.We build on work label ranking and propose from (or comparisons) instead: person provides relative preference between trajectories.Second,...

10.15607/rss.2017.xiii.053 article EN 2017-07-12

Towards Seamless Human-Robot Handovers

OPENALEX - Publications

Kyle Strabala Min Kyung Lee Anca D. Dragan Jodi Forlizzi Siddhartha S Srinivasa and 2 more

A handover is a complex collaboration, where actors coordinate in time and space to transfer control of an object. This coordination comprises two processes: the physical process moving get close enough object, cognitive exchanging information guide transfer. Despite this complexity, we humans are capable performing handovers seamlessly wide variety situations, even when unexpected. suggests common procedure that guides all interactions. Our goal codify procedure.

10.5898/jhri.2.1.strabala article EN Journal of Human-Robot Interaction 2013-03-01

Effects of Robot Motion on Human-Robot Collaboration

OPENALEX - Publications

Anca D. Dragan Shira Bauman Jodi Forlizzi Siddhartha S Srinivasa

Most motion in robotics is purely functional, planned to achieve the goal and avoid collisions. Such great isolation, but collaboration affords a human who watching making inferences about it, trying coordinate with robot task. This paper analyzes benefit of planning that explicitly enables collaborator's on success physical collaboration, as measured by both objective subjective metrics. Results suggest legible motion, clearly express robot's intent, leads more fluent collaborations than...

10.1145/2696454.2696473 article EN 2015-03-02

Hierarchical Game-Theoretic Planning for Autonomous Vehicles

OPENALEX - Publications

Jaime F. Fisac Eli Bronstein Elis Stefansson Dorsa Sadigh S. Shankar Sastry and 1 more

The actions of an autonomous vehicle on the road affect and are affected by those other drivers, whether overtaking, negotiating a merge, or avoiding accident. This mutual dependence, best captured dynamic game theory, creates strong coupling between vehicle's planning its predictions drivers' behavior, constitutes open problem with direct implications safety viability driving technology. Unfortunately, games too computationally demanding to meet real-time constraints in continuous state...

10.1109/icra.2019.8794007 article EN 2022 International Conference on Robotics and Automation (ICRA) 2019-05-01

Information gathering actions over human internal state

OPENALEX - Publications

Dorsa Sadigh S. Shankar Sastry Sanjit A. Seshia Anca D. Dragan

Much of estimation human internal state (goal, intentions, activities, preferences, etc.) is passive: an algorithm observes actions and updates its estimate state. In this work, we embrace the fact that robot affect what humans do, leverage it to improve estimation. We enable robots do active information gathering, by planning probe user in order clarify their For instance, autonomous car will plan nudge into a driver's lane test driving style. Results simulation study suggest gathering...

10.1109/iros.2016.7759036 article EN 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2016-10-01

Planning for cars that coordinate with people: leveraging effects on human actions for planning and active information gathering over human internal state

OPENALEX - Publications

Dorsa Sadigh Nick Landolfi S. Shankar Sastry Sanjit A. Seshia Anca D. Dragan

10.1007/s10514-018-9746-1 article EN Autonomous Robots 2018-05-04

Managing extreme AI risks amid rapid progress

OPENALEX - Publications

Yoshua Bengio Geoffrey E. Hinton Andrew Chi-Chih Yao Dawn Song Pieter Abbeel and 20 more

Preparation requires technical research and development, as well adaptive, proactive governance

10.1126/science.adn0117 article EN Science 2024-05-20

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

OPENALEX - Publications

Stephen T. Casper Xander Davies Claudia Shi Thomas Krendl Gilbert Jérémy Scheurer and 27 more

Reinforcement learning from human feedback (RLHF) is a technique for training AI systems to align with goals. RLHF has emerged as the central method used finetune state-of-the-art large language models (LLMs). Despite this popularity, there been relatively little public work systematizing its flaws. In paper, we (1) survey open problems and fundamental limitations of related methods; (2) overview techniques understand, improve, complement in practice; (3) propose auditing disclosure...

10.48550/arxiv.2307.15217 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Formalizing Assistive Teleoperation

OPENALEX - Publications

Anca D. Dragan Siddhartha S Srinivasa

In assistive teleoperation, the robot helps user accomplish desired task, making teleoperation easier and more seamless.Rather than simply executing user's input, which is hindered by inadequacies of interface, attempts to predict intent, assists in accomplishing it.In this work, we are interested scientific underpinnings assistance: formalize assistance under general framework policy blending, show how previous work methods instantiate formalism, provide a principled analysis its main...

10.15607/rss.2012.viii.010 article EN 2012-07-09

The Social Cost of Strategic Classification

OPENALEX - Publications

Smitha Milli John P. Miller Anca D. Dragan Moritz Hardt

Consequential decision-making typically incentivizes individuals to behave strategically, tailoring their behavior the specifics of decision rule. A long line work has therefore sought counteract strategic by designing more conservative boundaries in an effort increase robustness effects covariate shift.

10.1145/3287560.3287576 article EN 2019-01-09

Shared Autonomy via Deep Reinforcement Learning

OPENALEX - Publications

Siddharth Reddy Anca D. Dragan Sergey Levine

In shared autonomy, user input is combined with semi-autonomous control to achieve a common goal.The goal often unknown ex-ante, so prior work enables agents infer the from and assist task.Such methods tend assume some combination of knowledge dynamics environment, user's policy given their goal, set possible goals might target, which limits application real-world scenarios.We propose deep reinforcement learning framework for model-free autonomy that lifts these assumptions.We use...

10.15607/rss.2018.xiv.005 article EN 2018-06-26

Model Reconstruction from Model Explanations

OPENALEX - Publications

Smitha Milli Ludwig Schmidt Anca D. Dragan Moritz Hardt

We show through theory and experiment that gradient-based explanations of a model quickly reveal the itself. Our results speak to tension between desire keep proprietary secret ability offer explanations.

10.1145/3287560.3287562 article EN 2019-01-09

Do You Want Your Autonomous Car To Drive Like You?

OPENALEX - Publications

Chandrayee Basu Qian Yang David Hungerman Mukesh Singhal Anca D. Dragan

With progress in enabling autonomous cars to drive safely on the road, it is time start asking how they should be driving. A common answer that adopting their users' driving style. This makes assumption users want like - aggressive drivers cars, defensive cars. In this paper, we put test. We find tend prefer a significantly more style than own. Interestingly, think own, even though actual tends aggressive. also preferences do depend specific scenario, opening door for new ways of learning preference.

10.1145/2909824.3020250 article EN 2017-03-01

Herb 2.0: Lessons Learned From Developing a Mobile Manipulator for the Home

OPENALEX - Publications

Siddhartha S Srinivasa Dmitry Berenson Maya Çakmak Alvaro Collet Mehmet R. Doğar and 6 more

We present the hardware design, software architecture, and core algorithms of Herb 2.0, a bimanual mobile manipulator developed at Personal Robotics Lab Carnegie Mellon University, Pittsburgh, PA. have 2.0 to perform useful tasks for with people in human environments. exploit two key paradigms environments: that they structure robot can learn, adapt exploit, demand general-purpose capability robotic systems. In this paper, we reveal some everyday environments been able harness manipulation...

10.1109/jproc.2012.2200561 article EN Proceedings of the IEEE 2012-06-21

Efficient Iterative Linear-Quadratic Approximations for Nonlinear Multi-Player General-Sum Differential Games

OPENALEX - Publications

David Fridovich-Keil Ellis Ratner Lasse Peters Anca D. Dragan Claire J. Tomlin

Many problems in robotics involve multiple decision making agents. To operate efficiently such settings, a robot must reason about the impact of its decisions on behavior other Differential games offer an expressive theoretical framework for formulating these types multi-agent problems. Unfortunately, most numerical solution techniques scale poorly with state dimension and are rarely used real-time applications. For this reason, it is common to predict future agents solve resulting...

10.1109/icra40945.2020.9197129 article EN 2020-05-01

Generating Legible Motion

OPENALEX - Publications

Anca D. Dragan Siddhartha S Srinivasa

Legible motion --- that communicates its intent to a human observer is crucial for enabling seamless human-robot collaboration. In this paper, we propose functional gradient optimization technique autonomously generating legible motion. Our algorithm optimizes legibility metric inspired by the psychology of action interpretation in humans, resulting trajectories purposefully deviate from what an would expect order better convey intent. A trust region constraint on ensures does not become too...

10.1184/r1/6554969.v1 article EN Robotics: Science and Systems 2013-06-01

Probabilistically Safe Robot Planning with Confidence-Based Human Predictions

OPENALEX - Publications

Jaime F. Fisac Andrea Bajcsy Sylvia Herbert David Fridovich-Keil Steven Wang and 2 more

In order to safely operate around humans, robots can employ predictive models of human motion. Unfortunately, these cannot capture the full complexity behavior and necessarily introduce simplifying assumptions. As a result, predictions may degrade whenever observed departs from assumed structure, which have negative implications for safety. this paper, we observe that how rational actions appear under particular model be viewed as an indicator model's ability describe human's current By...

10.15607/rss.2018.xiv.069 article EN 2018-06-26

Enabling robots to communicate their objectives

OPENALEX - Publications

Sandy H. Huang David Held Pieter Abbeel Anca D. Dragan

10.1007/s10514-018-9771-0 article EN Autonomous Robots 2018-06-18

Coming Soon ...