NFDI4DS | UHH-SEMS - Publication Details

Diederik M. Roijers

ORCID: 0000-0002-2825-2491

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5081436755

Research Areas

Reinforcement Learning in Robotics
Advanced Multi-Objective Optimization Algorithms
Advanced Bandit Algorithms Research
Game Theory and Applications
Auction Theory and Applications
Artificial Intelligence in Games
Evolutionary Algorithms and Applications
Adaptive Dynamic Programming Control
Smart Grid Energy Management
Experimental Behavioral Economics Studies
COVID-19 epidemiological studies
Optimization and Search Problems
Bayesian Modeling and Causal Inference
Machine Learning and Algorithms
Metaheuristic Optimization Algorithms Research
Economic theories and models
Water resources management and optimization
Influenza Virus Research Studies
Advanced Control Systems Optimization
Modular Robots and Swarm Intelligence
Adversarial Robustness in Machine Learning
Digital Games and Media
Simulation Techniques and Applications
Gaussian Processes and Bayesian Inference
Process Optimization and Integration

Vrije Universiteit Brussel
2017-2024

University of Applied Sciences Utrecht
2019-2023

Amsterdam University of Applied Sciences
2023

Vrije Universiteit Amsterdam
2018-2021

University of Amsterdam
2013-2021

Utrecht University
2012-2020

Amsterdam UMC Location Vrije Universiteit Amsterdam
2018

University of Surrey
2018

University of Oxford
2016-2017

Amsterdam University of the Arts
2014

A Survey of Multi-Objective Sequential Decision-Making

OPENALEX - Publications

Diederik M. Roijers Peter Vamplew Shimon Whiteson Richard Dazeley

Sequential decision-making problems with multiple objectives arise naturally in practice and pose unique challenges for research decision-theoretic planning learning, which has largely focused on single-objective settings. This article surveys algorithms designed sequential objectives. Though there is a growing body of literature this subject, little it makes explicit under what circumstances special methods are needed to solve multi-objective problems. Therefore, we identify three distinct...

10.1613/jair.3987 article EN cc-by Journal of Artificial Intelligence Research 2013-10-18

A practical guide to multi-objective reinforcement learning and planning

OPENALEX - Publications

Conor F. Hayes Roxana Rădulescu Eugenio Bargiacchi Johan Källström Matthew D Macfarlane and 13 more

Real-world decision-making tasks are generally complex, requiring trade-offs between multiple, often conflicting, objectives. Despite this, the majority of research in reinforcement learning and decision-theoretic planning either assumes only a single objective, or that multiple objectives can be adequately handled via simple linear combination. Such approaches may oversimplify underlying problem hence produce suboptimal results. This paper serves as guide to application multi-objective...

10.1007/s10458-022-09552-y article EN cc-by Autonomous Agents and Multi-Agent Systems 2022-04-01

Multi-Objective Decision Making

OPENALEX - Publications

Diederik M. Roijers Shimon Whiteson

Many real-world decision problems have multiple objectives. For example, when choosing a medical treatment plan, we want to maximize the efficacy of treatment, but also minimize side effects.

10.2200/s00765ed1v01y201704aim034 article EN Synthesis lectures on artificial intelligence and machine learning 2017-04-20

Coordination of Electric Vehicle Charging Through Multiagent Reinforcement Learning

OPENALEX - Publications

Felipe Leno da Silva Cyntia Eico Hayama Nishida Diederik M. Roijers Anna Helena Reali Costa

The number of Electric Vehicle (EV) owners is expected to significantly increase in the near future, since EVs are regarded as valuable assets both for transportation and energy storage purposes. However, recharging a large fleet during peak hours may overload transformers distribution grid. Although several methods have been proposed flatten peak-hour loads recharge fairly possible available time, these typically focus either on single type tariff or making strong assumptions regarding In...

10.1109/tsg.2019.2952331 article EN IEEE Transactions on Smart Grid 2019-11-07

Towards Personalised Gaming via Facial Expression Recognition

OPENALEX - Publications

Paris Mavromoustakos Blom Sander Bakkes Chek Tien Tan Shimon Whiteson Diederik M. Roijers and 2 more

In this paper we propose an approach for personalising the space in which a game is played (i.e., levels) dependent on classifications of user's facial expression — to end tailoring affective experience individual user. Our aimed at online personalisation, i.e., personalised during actual play game. A key insight that personalisation techniques can leverage novel computer vision-based unobtrusively infer player experiences automatically based analysis. Specifically, user, (1) proven InSight...

10.1609/aiide.v10i1.12707 article EN Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment 2021-06-29

Multi-Objective Deep Reinforcement Learning

OPENALEX - Publications

Hossam Mossalam Yannis Assael Diederik M. Roijers Shimon Whiteson

We propose Deep Optimistic Linear Support Learning (DOL) to solve high-dimensional multi-objective decision problems where the relative importances of objectives are not known a priori. Using features from inputs, DOL computes convex coverage set containing all potential optimal solutions combinations objectives. To our knowledge, this is first time that deep reinforcement learning has succeeded in policies. In addition, we provide testbed with two experiments be used as benchmark for learning.

10.48550/arxiv.1610.02707 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Open-Ended Learning: A Conceptual Framework Based on Representational Redescription

OPENALEX - Publications

Stéphane Doncieux David Filliat Natalia Díaz-Rodríguez Timothy M. Hospedales Richard J. Duro and 5 more

Reinforcement learning (RL) aims at building a policy that maximizes task-related reward within given domain. When the domain is known, i.e., when its states, actions and are defined, Markov Decision Processes (MDPs) provide convenient theoretical framework to formalize RL. But in an open-ended process, agent or robot must solve unbounded sequence of tasks not known advance corresponding MDPs cannot be built design time. This defines main challenges learning: how can learn behave...

10.3389/fnbot.2018.00059 article EN cc-by Frontiers in Neurorobotics 2018-09-25

Time efficiency in optimization with a bayesian-Evolutionary algorithm

OPENALEX - Publications

Gongjin Lan Jakub M. Tomczak Diederik M. Roijers A. E. Eiben

10.1016/j.swevo.2021.100970 article EN Swarm and Evolutionary Computation 2021-08-25

Deep Learning-Based Energy Disaggregation and On/Off Detection of Household Appliances

OPENALEX - Publications

Jie Jiang Qiuqiang Kong Mark D. Plumbley Nigel Gilbert Mark Hoogendoorn and 1 more

Energy disaggregation, a.k.a. Non-Intrusive Load Monitoring, aims to separate the energy consumption of individual appliances from readings a mains power meter measuring total of, e.g., whole house. can be useful in many applications, providing appliance-level feedback end users help them understand their and ultimately save energy. Recently, with availability large-scale datasets, various neural network models such as convolutional networks recurrent have been investigated solve...

10.1145/3441300 article EN ACM Transactions on Knowledge Discovery from Data 2021-05-03

Scalar reward is not enough: a response to Silver, Singh, Precup and Sutton (2021)

OPENALEX - Publications

Peter Vamplew Benjamin J. Smith Johan Källström Gabriel de Oliveira Ramos Roxana Rădulescu and 7 more

Abstract The recent paper “Reward is Enough” by Silver, Singh, Precup and Sutton posits that the concept of reward maximisation sufficient to underpin all intelligence, both natural artificial, provides a suitable basis for creation artificial general intelligence. We contest underlying assumption Silver et al. such can be scalar-valued. In this we explain why scalar rewards are insufficient account some aspects biological computational argue in favour explicitly multi-objective models...

10.1007/s10458-022-09575-5 article EN cc-by Autonomous Agents and Multi-Agent Systems 2022-07-16

Deep multi-objective reinforcement learning for utility-based infrastructural maintenance optimization

OPENALEX - Publications

Jesse van Remmerden Maurice Kenter Diederik M. Roijers C.P. Andriotis Yingqian Zhang and 1 more

Abstract In this paper, we introduce multi-objective deep centralized multi-agent actor-critic (MO-DCMAC), a reinforcement learning method for infrastructural maintenance optimization, an area traditionally dominated by single-objective (RL) approaches. Previous RL methods combine multiple objectives, such as probability of collapse and cost, into singular reward signal through reward-shaping. contrast, MO-DCMAC can optimize policy objectives directly, even when the utility function is...

10.1007/s00521-024-10954-0 article EN cc-by Neural Computing and Applications 2025-01-10

Multi-objective multi-agent decision making: a utility-based analysis and survey

OPENALEX - Publications

Roxana Rădulescu Patrick Mannion Diederik M. Roijers Ann Nowé

10.1007/s10458-019-09433-x article EN Autonomous Agents and Multi-Agent Systems 2019-12-09

Learning directed locomotion in modular robots with evolvable morphologies

OPENALEX - Publications

Gongjin Lan Matteo De Carlo Fuda van Diggelen Jakub M. Tomczak Diederik M. Roijers and 1 more

10.1016/j.asoc.2021.107688 article EN Applied Soft Computing 2021-07-09

Computing Convex Coverage Sets for Faster Multi-objective Coordination

OPENALEX - Publications

Diederik M. Roijers Shimon Whiteson Frans A. Oliehoek

In this article, we propose new algorithms for multi-objective coordination graphs (MO-CoGs). Key to the efficiency of these is that they compute a convex coverage set (CCS) instead Pareto (PCS). Not only CCS sufficient solution large class problems, it also has important characteristics facilitate more efficient solutions. We two main computing in MO-CoGs. Convex variable elimination (CMOVE) computes by performing series agent eliminations, which can be seen as solving local subproblems....

10.1613/jair.4550 article EN cc-by Journal of Artificial Intelligence Research 2015-03-31

Ordered Preference Elicitation Strategies for Supporting Multi-Objective Decision Making

OPENALEX - Publications

Luisa Zintgraf Diederik M. Roijers Sjoerd Linders Catholijn M. Jonker Ann Nowé

In multi-objective decision planning and learning, much attention is paid to producing optimal solution sets that contain an policy for every possible user preference profile. We argue the step follows, i.e, determining which execute by maximising user's intrinsic utility function over this (possibly infinite) set, under-studied. This paper aims fill gap. build on previous work Gaussian processes pairwise comparisons modelling, extend it support scenario, propose new ordered elicitation...

10.48550/arxiv.1802.07606 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Linear support for multi-objective coordination graphs

OPENALEX - Publications

Diederik M. Roijers Shimon Whiteson Frans A. Oliehoek

Many real-world decision problems require making trade-offs among multiple objectives. However, in some cases, the relative importance of these objectives is not known when problem solved, precluding use single-objective methods. Instead, multi-objective methods, which compute set all potentially useful solutions, are required. This paper proposes variable elimination linear support (VELS), a new algorithm for multi-agent coordination that exploits loose couplings to convex coverage (CCS):...

10.5555/2615731.2617454 article EN 2014-05-05

A utility-based analysis of equilibria in multi-objective normal-form games

OPENALEX - Publications

Roxana Rădulescu Patrick Mannion Yijie Zhang Diederik M. Roijers Ann Nowé

Abstract In multi-objective multi-agent systems (MOMASs), agents explicitly consider the possible trade-offs between conflicting objective functions. We argue that compromises competing objectives in MOMAS should be analyzed on basis of utility these have for users a system, where an agent’s function maps their payoff vectors to scalar values. This utility-based approach naturally leads two different optimization criteria MOMAS: expected scalarized returns (ESRs) and (SERs). this article, we...

10.1017/s0269888920000351 article EN The Knowledge Engineering Review 2020-01-01

Coming Soon ...