"Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming."

Dimitri P. Bertsekas, Huizhen Yu (2012)

Details and statistics

DOI: 10.1287/MOOR.1110.0532

access: closed

type: Journal Article

metadata version: 2022-10-02