"Open Problems and Fundamental Limitations of Reinforcement Learning from ..."

Stephen Casper et al. (2023)

> Home

Details and statistics

DOI: —

access: open

type: Journal Article

metadata version: 2024-08-02

- view
  - electronic edition @ openreview.net (open access)
- export record
  dblp key:
  - journals/tmlr/CasperDSGSRFKLF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/CasperDSGSRFKLF23
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Tong Wang, Samuel Marks, Charbel-Raphaël Ségerie, Micah Carroll, Andi Peng, Phillip J. K. Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Biyik, Anca D. Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell:
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback. Trans. Mach. Learn. Res. 2023 (2023)

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.