default search action
"A novel Q-learning algorithm with function approximation for constrained ..."
K. Lakshmanan, Shalabh Bhatnagar (2012)
- K. Lakshmanan, Shalabh Bhatnagar:
A novel Q-learning algorithm with function approximation for constrained Markov decision processes. Allerton Conference 2012: 400-405
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.