😼Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

https://arxiv.org/abs/2307.15217

Last updated