"A Reinforcement Learning Method for Maximizing Undiscounted Rewards."

Anton Schwartz (1993)

Details and statistics

DOI: 10.1016/B978-1-55860-307-3.50045-9

access: closed

type: Conference or Workshop Paper

metadata version: 2019-06-24