NFDI4DS | UHH-SEMS - Publication Details

Robust direct data-driven control for probabilistic systems

OPENALEX - Publications

Alexander von Rohr Д. С. Лихачев Sebastian Trimpe

10.1016/j.sysconle.2024.106011 article cc-by Systems & Control Letters 2025-01-11

Simulation-Aided Policy Tuning for Black-Box Robot Learning

OPENALEX - Publications

Shiming He Alexander von Rohr Dominik Baumann Xiang Ji Sebastian Trimpe

10.1109/tro.2025.3539192 article EN cc-by IEEE Transactions on Robotics 2025-01-01

Gait Learning for Soft Microrobots Controlled by Light Fields

OPENALEX - Publications

Alexander von Rohr Sebastian Trimpe Alonso Marco Peer Fischer Stefano Palagi

Soft microrobots based on photoresponsive materials and controlled by light fields can generate a variety of different gaits. This inherent flexibility be exploited to maximize their locomotion performance in given environment used adapt them changing conditions. Albeit, because the lack accurate models, intrinsic variability among microrobots, analytical control design is not possible. Common data-driven approaches, other hand, require running prohibitive numbers experiments lead very...

10.1109/iros.2018.8594092 preprint EN 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2018-10-01

On Controller Tuning with Time-Varying Bayesian Optimization

OPENALEX - Publications

Paul Brunzema Alexander von Rohr Sebastian Trimpe

Changing conditions or environments can cause system dynamics to vary over time. To ensure optimal control performance, controllers should adapt these changes. When the underlying and time of change is unknown, we need rely on online data for this adaptation. In paper, will use time-varying Bayesian optimization (TVBO) tune in changing using appropriate prior knowledge objective its Two properties are characteristic many controller tuning problems: First, they exhibit incremental lasting...

10.1109/cdc51059.2022.9992649 article EN 2022 IEEE 61st Conference on Decision and Control (CDC) 2022-12-06

Local policy search with Bayesian optimization

OPENALEX - Publications

Sarah Müller Alexander von Rohr Sebastian Trimpe

Reinforcement learning (RL) aims to find an optimal policy by interaction with environment. Consequently, complex behavior requires a vast number of samples, which can be prohibitive in practice. Nevertheless, instead systematically reasoning and actively choosing informative gradients for local search are often obtained from random perturbations. These samples yield high variance estimates hence sub-optimal terms sample complexity. Actively selecting is at the core Bayesian optimization,...

10.48550/arxiv.2106.11899 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Excursion Search for Constrained Bayesian Optimization under a Limited Budget of Failures

OPENALEX - Publications

Alonso Marco Alexander von Rohr Dominik Baumann José Miguel Hernández-Lobato Sebastian Trimpe

When learning to ride a bike, child falls down number of times before achieving the first success. As falling usually has only mild consequences, it can be seen as tolerable failure in exchange for faster process, provides rich information about an undesired behavior. In context Bayesian optimization under unknown constraints (BOC), typical strategies safe explore conservatively and avoid failures by all means. On other side spectrum, non conservative BOC algorithms that allow failing may...

10.48550/arxiv.2005.07443 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Simulation-Aided Policy Tuning for Black-Box Robot Learning

OPENALEX - Publications

Shiming He Alexander von Rohr Dominik Baumann Xiang Ji Sebastian Trimpe

How can robots learn and adapt to new tasks situations with little data? Systematic exploration simulation are crucial tools for efficient robot learning. We present a novel black-box policy search algorithm focused on data-efficient improvements. The learns directly the treats as an additional information source speed up learning process. At core of algorithm, probabilistic model dependence parameters objective not only by performing experiments robot, but also leveraging data from...

10.1109/tro.2025.3539192 preprint EN arXiv (Cornell University) 2024-11-21

Local Bayesian optimization for controller tuning with crash constraints

OPENALEX - Publications

Alexander von Rohr David A. Stenger Dominik Scheurenberg Sebastian Trimpe

Controller tuning is crucial for closed-loop performance but often involves manual adjustments. Although Bayesian optimization (BO) has been established as a data-efficient method automated tuning, applying it to large and high-dimensional search spaces remains challenging. We extend recently proposed local variant of BO include crash constraints, where the controller can only be successfully evaluated in an a-priori unknown feasible region. demonstrate efficiency through simulations...

10.1515/auto-2023-0181 article EN cc-by at - Automatisierungstechnik 2024-04-01

Latent Action Priors From a Single Gait Cycle Demonstration for Online Imitation Learning

OPENALEX - Publications

Oliver Hausdörfer Alexander von Rohr Éric Lefort Angela P. Schoellig

Deep Reinforcement Learning (DRL) in simulation often results brittle and unrealistic learning outcomes. To push the agent towards more desirable solutions, prior information can be injected process through, for instance, reward shaping, expert data, or motion primitives. We propose an additional inductive bias robot learning: latent actions learned from demonstration as priors action space. show that these only a single open-loop gait cycle using simple autoencoder. Using combined with...

10.48550/arxiv.2410.03246 preprint EN arXiv (Cornell University) 2024-10-04

Diffusion Predictive Control with Constraints

OPENALEX - Publications

Ralf Römer Alexander von Rohr Angela P. Schoellig

Diffusion models have recently gained popularity for policy learning in robotics due to their ability capture high-dimensional and multimodal distributions. However, diffusion policies are inherently stochastic typically trained offline, limiting handle unseen dynamic conditions where novel constraints not represented the training data must be satisfied. To overcome this limitation, we propose predictive control with (DPCC), an algorithm diffusion-based explicit state action that can deviate...

10.48550/arxiv.2412.09342 preprint EN arXiv (Cornell University) 2024-12-12

Event-Triggered Time-Varying Bayesian Optimization

OPENALEX - Publications

Paul Brunzema Alexander von Rohr Friedrich Solowjow Sebastian Trimpe

We consider the problem of sequentially optimizing a time-varying objective function using Bayesian optimization (TVBO). To cope with stale data arising from time variations, current approaches to TVBO require prior knowledge constant rate change. However, in practice, change is usually unknown. propose an event-triggered algorithm, ET-GP-UCB, that treats as static until it detects changes online and then resets dataset. This allows algorithm adapt realized temporal without need for...

10.48550/arxiv.2208.10790 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Improving the Performance of Robust Control through Event-Triggered Learning

OPENALEX - Publications

Alexander von Rohr Friedrich Solowjow Sebastian Trimpe

Robust controllers ensure stability in feedback loops designed under uncertainty but at the cost of performance. Model time-invariant systems can be reduced by recently proposed learning-based methods, which improve performance robust using data. However, practice, many also exhibit form changes over time, e.g., due to weight shifts or wear and tear, leading decreased instability controller. We propose an event-triggered learning algorithm that decides when learn face LQR problem with rare...

10.1109/cdc51059.2022.9993350 article EN 2022 IEEE 61st Conference on Decision and Control (CDC) 2022-12-06

Experience Transfer for Robust Direct Data-Driven Control

OPENALEX - Publications

Alexander von Rohr Д. С. Лихачев Sebastian Trimpe

We propose a data-driven control method for systems with aleatoric uncertainty, example, robot fleets variations between agents. Our leverages shared trajectory data to increase the robustness of designed controller and thus facilitate transfer new without need prior parameter uncertainty estimations. In contrast existing work on experience performance, our approach focuses uses collected from multiple realizations guarantee generalization unseen ones. is based scenario optimization combined...

10.48550/arxiv.2306.16973 preprint EN other-oa arXiv (Cornell University) 2023-01-01

A Learnable Safety Measure

OPENALEX - Publications

Steve Heim Alexander von Rohr Sebastian Trimpe Alexander Badri–Spröwitz

Failures are challenging for learning to control physical systems since they risk damage, time-consuming resets, and often provide little gradient information. Adding safety constraints exploration typically requires a lot of prior knowledge domain expertise. We present measure which implicitly captures how the system dynamics relate set failure states. Not only can this be used as function, but also directly compute safe state-action pairs. Further, we show model-free approach learn by...

10.48550/arxiv.1910.02835 preprint EN cc-by arXiv (Cornell University) 2019-01-01

On Controller Tuning with Time-Varying Bayesian Optimization

OPENALEX - Publications

Paul Brunzema Alexander von Rohr Sebastian Trimpe

Changing conditions or environments can cause system dynamics to vary over time. To ensure optimal control performance, controllers should adapt these changes. When the underlying and time of change is unknown, we need rely on online data for this adaptation. In paper, will use time-varying Bayesian optimization (TVBO) tune in changing using appropriate prior knowledge objective its Two properties are characteristic many controller tuning problems: First, they exhibit incremental lasting...

10.48550/arxiv.2207.11120 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Improving the Performance of Robust Control through Event-Triggered Learning

OPENALEX - Publications

Alexander von Rohr Friedrich Solowjow Sebastian Trimpe

Robust controllers ensure stability in feedback loops designed under uncertainty but at the cost of performance. Model time-invariant systems can be reduced by recently proposed learning-based methods, which improve performance robust using data. However, practice, many also exhibit form changes over time, e.g., due to weight shifts or wear and tear, leading decreased instability controller. We propose an event-triggered learning algorithm that decides when learn face LQR problem with rare...

10.48550/arxiv.2207.14252 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Multi-Arm Bin-Picking in Real-Time: A Combined Task and Motion Planning Approach

OPENALEX - Publications

Ilyes Toumi Andreas Orthey Alexander von Rohr Ngo Anh Vien

Automated bin-picking is a prerequisite for fully automated manufacturing and warehouses. To successfully pick an item from unstructured bin the robot needs to first detect possible grasps objects, decide on object remove consequently plan execute feasible trajectory retrieve chosen object. Over last years significant progress has been made towards solving these problems. However, when multiple arms are cooperating decision planning problems become exponentially harder. We propose integrated...

10.48550/arxiv.2211.11089 preprint EN cc-by-nc-nd arXiv (Cornell University) 2022-01-01

Probabilistic Robust Linear Quadratic Regulators with Gaussian Processes

OPENALEX - Publications

Alexander von Rohr Matthias Neumann-Brosig Sebastian Trimpe

Probabilistic models such as Gaussian processes (GPs) are powerful tools to learn unknown dynamical systems from data for subsequent use in control design. While learning-based has the potential yield superior performance demanding applications, robustness uncertainty remains an important challenge. Since Bayesian methods quantify of learning results, it is natural incorporate these uncertainties into a robust In contrast most state-of-the-art approaches that consider worst-case estimates,...

10.48550/arxiv.2105.07668 preprint EN other-oa arXiv (Cornell University) 2021-01-01