"Approximate Q-learning and SARSA(0) under the ε-greedy Policy: a ..."

Aditya Gopalan, Gugan Thoppe (2022)

Details and statistics

DOI: 10.48550/ARXIV.2205.13617

access: open

type: Informal or Other Publication

metadata version: 2022-05-31