![](https://dblp.uni-trier.de./img/logo.320x120.png)
![search dblp search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
![search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
default search action
Neurocomputing, Volume 263
Volume 263, November 2017
- Madalina M. Drugan
, Marco A. Wiering
, Peter Vamplew
, Madhu Chetty
:
Special issue on multi-objective reinforcement learning. 1-2
- Simone Parisi, Matteo Pirotta, Jan Peters:
Manifold-based multi-objective policy search with sample reuse. 3-14 - Manuela Ruiz-Montiel, Lawrence Mandow, José-Luis Pérez-de-la-Cruz:
A temporal difference method for multi-objective reinforcement learning. 15-25 - Peter Vamplew
, Rustam Issabekov, Richard Dazeley
, Cameron Foale
, Adam Berry
, Tim Moore
, Douglas C. Creighton:
Steering approaches to Pareto-optimal multiobjective reinforcement learning. 26-38
- Thommen George Karimpanal
, Erik Wilhelm
:
Identification and off-policy learning of multiple objectives using adaptive clustering. 39-47 - Tim Brys, Anna Harutyunyan, Peter Vrancx, Ann Nowé
, Matthew E. Taylor:
Multi-objectivization and ensembles of shapings in reinforcement learning. 48-59
- Patrick Mannion
, Sam Devlin
, Karl Mason
, Jim Duggan, Enda Howley
:
Policy invariance under reward transformations for multi-objective reinforcement learning. 60-73 - Peter Vamplew
, Richard Dazeley
, Cameron Foale
:
Softmax exploration strategies for multiobjective reinforcement learning. 74-86
![](https://dblp.uni-trier.de./img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.