


default search action
"Policy Evaluation and Temporal-Difference Learning in Continuous Time and ..."
Yanwei Jia, Xun Yu Zhou (2022)
- Yanwei Jia, Xun Yu Zhou:
Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach. J. Mach. Learn. Res. 23: 154:1-154:55 (2022)

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.