Daniel Hennes

ORCID: 0000-0002-3646-5286
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Reinforcement Learning in Robotics
  • Robotics and Sensor-Based Localization
  • Robotic Path Planning Algorithms
  • Artificial Intelligence in Games
  • Game Theory and Applications
  • Sports Analytics and Performance
  • Modular Robots and Swarm Intelligence
  • Auction Theory and Applications
  • Advanced Bandit Algorithms Research
  • Evolutionary Algorithms and Applications
  • Spacecraft Dynamics and Control
  • Space Satellite Systems and Control
  • Evolutionary Game Theory and Cooperation
  • Distributed Control Multi-Agent Systems
  • Anomaly Detection Techniques and Applications
  • Insect and Arachnid Ecology and Behavior
  • Astro and Planetary Science
  • Sports Performance and Training
  • Experimental Behavioral Economics Studies
  • Domain Adaptation and Few-Shot Learning
  • Complex Systems and Time Series Analysis
  • Advanced Vision and Imaging
  • Optimization and Search Problems
  • Generative Adversarial Networks and Image Synthesis
  • Intelligent Tutoring Systems and Adaptive Learning

DeepMind (United Kingdom)
2020-2024

Google (United Kingdom)
2024

Google (United States)
2019-2021

University of Stuttgart
2018-2019

European Space Research and Technology Centre
2014-2018

German Research Centre for Artificial Intelligence
2016-2017

European Space Agency
2015-2016

University of Bremen
2016

Maastricht University
2007-2013

Eindhoven University of Technology
2008-2009

The interaction of multiple autonomous agents gives rise to highly dynamic and nondeterministic environments, contributing the complexity in applications such as automated financial markets, smart grids, or robotics. Due sheer number situations that may arise, it is not possible foresee program optimal behaviour for all beforehand. Consequently, becomes essential success system can learn their adapt new circumstances. past two decades have seen emergence reinforcement learning, both single...

10.1613/jair.4818 article EN cc-by Journal of Artificial Intelligence Research 2015-08-17

We introduce DeepNash, an autonomous agent capable of learning to play the imperfect information game Stratego from scratch, up a human expert level. is one few iconic board games that Artificial Intelligence (AI) has not yet mastered. This popular enormous tree on order $10^{535}$ nodes, i.e., $10^{175}$ times larger than Go. It additional complexity requiring decision-making under information, similar Texas hold'em poker, which significantly smaller (on $10^{164}$ nodes). Decisions in are...

10.1126/science.add4679 article EN Science 2022-12-01

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning search/planning games. supports n-player (single- multi- agent) zero-sum, cooperative general-sum, one-shot sequential, strictly turn-taking simultaneous-move, perfect imperfect information games, as well traditional multiagent such (partially- fully- observable) grid worlds social dilemmas. also includes tools to analyze dynamics other common evaluation metrics. This document serves both...

10.48550/arxiv.1908.09453 preprint EN other-oa arXiv (Cornell University) 2019-01-01

The rapid progress in artificial intelligence (AI) and machine learning has opened unprecedented analytics possibilities various team individual sports, including baseball, basketball, tennis. More recently, AI techniques have been applied to football, due a huge increase data collection by professional teams, increased computational power, advances learning, with the goal of better addressing new scientific challenges involved analysis both players’ coordinated teams’ behaviors. research...

10.1613/jair.1.12505 article EN cc-by Journal of Artificial Intelligence Research 2021-05-06

Learning to combine control at the level of joint torques with longer-term goal-directed behavior is a long-standing challenge for physically embodied artificial agents. Intelligent in physical world unfolds across multiple spatial and temporal scales: Although movements are ultimately executed instantaneous muscle tensions or torques, they must be selected serve goals that defined on much longer time scales often involve complex interactions environment other Recent research has...

10.1126/scirobotics.abo0235 article EN Science Robotics 2022-08-31

Abstract Identifying key patterns of tactics implemented by rival teams, and developing effective responses, lies at the heart modern football. However, doing so algorithmically remains an open research challenge. To address this unmet need, we propose TacticAI, AI football assistant developed evaluated in close collaboration with domain experts from Liverpool FC. We focus on analysing corner kicks, as they offer coaches most direct opportunities for interventions improvements. TacticAI...

10.1038/s41467-024-45965-x article EN cc-by Nature Communications 2024-03-19

This paper describes a multi-robot collision avoidance system based on the velocity obstacle paradigm. In contrast to previous approaches, we alleviate strong requirement for perfect sensing (i.e. global positioning) using Adaptive Monte-Carlo Localization per-agent level. While such methods as Optimal Reciprocal Collision Avoidance guarantee local collision-free motion large number of robots, given knowledge positions and speeds, realistic implementation requires further extensions deal...

10.5555/2343576.2343597 article EN 2012-06-04

We present a multi-mobile robot collision avoidance system based on the velocity obstacle paradigm. Current positions and velocities of surrounding robots are translated to an efficient geometric representation determine safe motions. Each uses on-board localization local communication build its surroundings. Our close error-bounded convex approximation density distribution results in collision-free paths under uncertainty. While many algorithms approximated by circumscribed radii, we use...

10.1109/iros.2012.6386125 article EN 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems 2012-10-01

The space close to our planet is getting more and polluted. Orbiting debris are posing an increasing threat operational orbits the cascading effect, known as Kessler syndrome, may result in a future where risk of orbiting at some altitudes will be unacceptable. Many argue that density Low Earth Orbit (LEO) has already reached level sufficient trigger such effect. An obvious consequence we soon have actively clean from debris. Such mission involve complex combinatorial decision choose which...

10.1145/2739480.2754727 article EN 2015-07-07

The use of soft robots in future space exploration is still a far-fetched idea, but an attractive one. Soft are inherently compliant mechanisms that well suited for locomotion on rough terrain as often faced extra-planetary environments. Depending the particular application and requirements, best shape (or body morphology) strategy such will vary substantially. Recent developments robotics evolutionary optimization showed possibility to simultaneously evolve morphology simulated trials....

10.1145/2739480.2754731 article EN 2015-07-07

This paper investigates a population-based training regime based on game-theoretic principles called Policy-Spaced Response Oracles (PSRO). PSRO is general in the sense that it (1) encompasses well-known algorithms such as fictitious play and double oracle special cases, (2) principle applies to general-sum, many-player games. Despite this, prior studies of have been focused two-player zero-sum games, wherein Nash equilibria are tractably computable. In moving from games more settings,...

10.48550/arxiv.1909.12823 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Although multi-robot systems have received substantial research attention in recent years, coordination still remains a difficult task. Especially, when dealing with spatially distributed tasks and many robots, central control quickly becomes infeasible due to the exponential explosion number of joint actions states. We propose general algorithm that allows for control, overcomes growth by aggregating effect other agents system into probabilistic model, called subjective approximations, then...

10.5555/2772879.2773265 article EN Adaptive Agents and Multi-Agents Systems 2015-05-04

We investigate the use of deep artificial neural networks to approximate optimal state-feedback control continuous time, deterministic, non-linear systems. The are trained in a supervised manner using trajectories generated by solving problem via Hermite-Simpson transcription method. find that able represent with high accuracy and precision well outside training area. consider dynamical models under different cost functions result both smooth discontinuous (bang-bang) solutions. In...

10.1109/ssci.2016.7850105 article EN 2021 IEEE Symposium Series on Computational Intelligence (SSCI) 2016-12-01

Recent progress in deep reinforcement learning has enabled simulated agents to learn complex behavior policies from scratch, but their data complexity often prohibits real-world applications. The process can be sped up by expert demonstrations those costly acquire. We demonstrate that it is possible employ model-free combined with planning quickly generate informative for a manipulation task. In particular, we use an approximate trajectory optimization approach global exploration based on...

10.1109/lra.2019.2928212 article EN IEEE Robotics and Automation Letters 2019-07-12

The on-chip fabrication and manipulation of microstructures are expected to be applied for single cell analysis system such as measurement tools. In this paper, we previously present a methodology fabricating assembling inside microfluidic channel. By the illumination patterned UV-ray through mask under microscope, with arbitrary shape made photo-crosslinkable resin device. fabricated at desired place channel manipulated by optical tweezers. Based on technique which can manipulate multiple...

10.1109/iros.2011.6094571 article EN 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems 2011-09-01

The focus of this paper is on situational awareness airborne agents capable 6D motion, in particular multi-rotor UAVs. We propose the fusion 2D laser range finder, altitude, and attitude sensor data order to perform simultaneous localization mapping (SLAM) indoors. In contrast other planar finder based SLAM approaches, we a 3D instead map. To represent environment an octree map used. Our scan registration algorithm derived from Hector SLAM. evaluate performance our system simulation real...

10.1109/icuas.2013.6564688 article EN 2022 International Conference on Unmanned Aircraft Systems (ICUAS) 2013-05-01

Since early in robotics the performance of odometry techniques has been constant research for mobile robots. This is due to its direct influence on localization. The pose error grows unbounded dead-reckoning systems and uncertainty negative impacts localization mapping (i.e. SLAM). terms residuals, i.e. difference between expected real state, related statistical or probabilistic motion models. A novel approach model errors using Gaussian processes (GPs) presented. methodology trains a GP...

10.1109/icra.2017.7989670 article EN 2017-05-01

We consider the problem of optimally transferring a spacecraft from starting to target asteroid. introduce novel approximations for important quantities characterizing optimal transfer in case short times (asteroid hops). propose and study detail phasing value φ, maximum initial mass m* arrival m <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">f</sub> . The new require orders magnitude less computational effort with respect state-of-the-art...

10.1109/ssci.2016.7850107 article EN 2021 IEEE Symposium Series on Computational Intelligence (SSCI) 2016-12-01

In the present work, we propose an active tactile exploration framework to obtain a surface model of unknown object utilizing multiple contacts simultaneously. To incorporate these contacts, strategy is based on differential entropy underlying Gaussian process implicit model, which formalizes with within information theoretic context and additionally allows for nonmyopic multi-step planning. contrast many previous approaches, robot continuously slides along its end-effectors gather stimuli,...

10.1109/icra.2019.8793773 article EN 2022 International Conference on Robotics and Automation (ICRA) 2019-05-01

The on-chip fabrication and manipulation of microstructures are expected to be applied for single cell analysis system such as measurement tools. In this paper, we previously present a methodology fabricating assembling inside microfluidic channel. By the illumination patterned UV-ray through mask under microscope, with arbitrary shape made photo-crosslinkable resin device. fabricated at desired place channel manipulated by optical tweezers. Based on technique which can manipulate multiple...

10.1109/iros.2011.6048189 article EN 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems 2011-09-01

Self-supervised learning is a reliable mechanism in which robot uses an original, trusted sensor cue for training to recognize additional, complementary cue. We study the first time self-supervised how robot’s behavior should be organized, so that can keep performing its task case original becomes unavailable. this persistent form of context flying has avoid obstacles based on distance estimates from visual stereo vision. Over it will learn also estimate distances monocular appearance cues....

10.1177/1756829318756355 article EN cc-by-nc International Journal of Micro Air Vehicles 2018-05-16
Coming Soon ...