NFDI4DS | UHH-SEMS - Publication Details

Integrating LLMs and Decision Transformers for Language Grounded Generative Quality-Diversity

FOS: Computer and information sciences Computer Science - Machine Learning Computer Science - Robotics Artificial Intelligence (cs.AI) Computer Science - Artificial Intelligence Robotics (cs.RO) Machine Learning (cs.LG)

DOI: 10.13140/rg.2.2.21510.34884 Publication Date: 2023-01-01

Abstract Supplemental Material References Cited by

AUTHORS (2)

Salehi, Achkan

Doncieux, Stephane

ABSTRACT

Quality-Diversity is a branch of stochastic optimization that is often applied to problems from the Reinforcement Learning and control domains in order to construct repertoires of well-performing policies/skills that exhibit diversity with respect to a behavior space. Such archives are usually composed of a finite number of reactive agents which are each associated to a unique behavior descriptor, and instantiating behavior descriptors outside of that coarsely discretized space is not straight-forward. While a few recent works suggest solutions to that issue, the trajectory that is generated is not easily customizable beyond the specification of a target behavior descriptor. We propose to jointly solve those problems in environments where semantic information about static scene elements is available by leveraging a Large Language Model to augment the repertoire with natural language descriptions of trajectories, and training a policy conditioned on those descriptions. Thus, our method allows a user to not only specify an arbitrary target behavior descriptor, but also provide the model with a high-level textual prompt to shape the generated trajectory. We also propose an LLM-based approach to evaluating the performance of such generative agents. Furthermore, we develop a benchmark based on simulated robot navigation in a 2d maze that we use for experimental validation.<br/>16 pages, 9 figures, 2 tables<br/>

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES ()

CITATIONS ()

EXTERNAL LINKS

OPENAIRE - Products

PlumX Metrics

Integrating LLMs and Decision Transformers for Language Grounded Generative Quality-Diversity

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....