Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

FOS: Computer and information sciences Computer Science - Machine Learning Computer Science - Computation and Language Artificial Intelligence (cs.AI) Computer Science - Artificial Intelligence Computation and Language (cs.CL) Machine Learning (cs.LG)
DOI: 10.3929/ethz-b-000651806 Publication Date: 2023-01-01