default search action
Machine Learning, Volume 49, 2002
Volume 49, Number 1, October 2002
- Fredrik A. Dahl:
The Lagging Anchor Algorithm: Reinforcement Learning in Two-Player Zero-Sum Games with Imperfect Information. 5-37 - Michael D. Lee:
A Simple Method for Generating Additive Clustering Models with Limited Complexity. 39-58 - Shaul Markovitch, Dan Rosenstein:
Feature Generation Using General Constructor Functions. 59-98
Volume 49, Number 2-3, November-December 2002
- Satinder Singh:
Introduction. 107-109 - Hui Tong, Timothy X. Brown:
Reinforcement Learning for Call Admission Control and Routing under Quality of Service Constraints in Multimedia Networks. 111-139 - Amy McGovern, J. Eliot B. Moss, Andrew G. Barto:
Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts. 141-160 - Dirk Ormoneit, Saunak Sen:
Kernel-Based Reinforcement Learning. 161-178 - John N. Tsitsiklis, Benjamin Van Roy:
On Average Versus Discounted Reward Temporal-Difference Learning. 179-191 - Michael J. Kearns, Yishay Mansour, Andrew Y. Ng:
A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes. 193-208 - Michael J. Kearns, Satinder Singh:
Near-Optimal Reinforcement Learning in Polynomial Time. 209-232 - Justin A. Boyan:
Technical Update: Least-Squares Temporal Difference Learning. 233-246 - José del R. Millán, Daniele Posenato, Eric Dedieu:
Continuous-Action Q-Learning. 247-265 - Oliver Mihatsch, Ralph Neuneier:
Risk-Sensitive Reinforcement Learning. 267-290 - Rémi Munos, Andrew W. Moore:
Variable Resolution Discretization in Optimal Control. 291-323 - David J. Foster, Peter Dayan:
Structure in the Space of Value Functions. 325-346
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.