default search action

combined dblp search
author search
venue search
publication search

ask others

Edouard Leurent

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c11]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/WangKSADMGLGDRF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/WangKSADMGLGDRF24
Kaiwen Wang, Rahul Kidambi, Ryan Sullivan, Alekh Agarwal, Christoph Dann, Andrea Michi, Marco Gelmi, Yunxuan Li, Raghav Gupta, Kumar Dubey, Alexandre Ramé, Johan Ferret, Geoffrey Cideron, Le Hou, Hongkun Yu, Amr Ahmed, Aranyak Mehta, Léonard Hussenot, Olivier Bachem, Edouard Leurent:
Conditional Language Policy: A General Framework For Steerable Multi-Objective Finetuning. EMNLP (Findings) 2024: 2153-2186
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-15762
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-15762
Kaiwen Wang, Rahul Kidambi, Ryan Sullivan, Alekh Agarwal, Christoph Dann, Andrea Michi, Marco Gelmi, Yunxuan Li, Raghav Gupta, Avinava Dubey, Alexandre Ramé, Johan Ferret, Geoffrey Cideron, Le Hou, Hongkun Yu, Amr Ahmed, Aranyak Mehta, Léonard Hussenot, Olivier Bachem, Edouard Leurent:
Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning. CoRR abs/2407.15762 (2024)
2023
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/nature/MankowitzMZGSPL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nature/MankowitzMZGSPL23
Daniel J. Mankowitz, Andrea Michi, Anton Zhernov, Marco Gelmi, Marco Selvi, Cosmin Paduraru, Edouard Leurent, Shariq Iqbal, Jean-Baptiste Lespiau, Alex Ahern, Thomas Köppe, Kevin Millikin, Stephen Gaffney, Sophie Elster, Jackson Broshear, Chris Gamble, Kieran Milan, Robert Tung, Minjae Hwang, A. Taylan Cemgil, Mohammadamin Barekatain, Yujia Li, Amol Mandhane, Thomas Hubert, Julian Schrittwieser, Demis Hassabis, Pushmeet Kohli, Martin A. Riedmiller, Oriol Vinyals, David Silver:
Faster sorting algorithms discovered using deep reinforcement learning. Nat. 618(7964): 257-263 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-07440
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-07440
Pengming Wang, Mikita Sazanovich, Berkin Ilbeyi, Phitchaya Mangpo Phothilimthana, Manish Purohit, Han Yang Tay, Ngân Vu, Miaosen Wang, Cosmin Paduraru, Edouard Leurent, Anton Zhernov, Julian Schrittwieser, Thomas Hubert, Robert Tung, Paula Kurylowicz, Kieran Milan, Oriol Vinyals, Daniel J. Mankowitz:
Optimizing Memory Mapping Using Deep Reinforcement Learning. CoRR abs/2305.07440 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-09175
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-09175
Tom Zahavy, Vivek Veeriah, Shaobo Hou, Kevin Waugh, Matthew Lai, Edouard Leurent, Nenad Tomasev, Lisa Schut, Demis Hassabis, Satinder Singh:
Diversifying AI: Towards Creative Chess with AlphaZero. CoRR abs/2308.09175 (2023)
2022
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/robosoft/ScheggDCLSPD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/robosoft/ScheggDCLSPD22
Pierre Schegg, Jérémie Dequidt, Eulalie Coevoet, Edouard Leurent, Rémi Sabatier, Philippe Preux, Christian Duriez:
Automated Planning for Robotic Guidewire Navigation in the Coronary Arteries. RoboSoft 2022: 239-246
2021
[c9]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/alt/KaufmannMDJLV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/alt/KaufmannMDJLV21
Emilie Kaufmann, Pierre Ménard, Omar Darwiche Domingues, Anders Jonsson, Edouard Leurent, Michal Valko:
Adaptive Reward-Free Exploration. ALT 2021: 865-891
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/MenardDJKLV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MenardDJKLV21
Pierre Ménard, Omar Darwiche Domingues, Anders Jonsson, Emilie Kaufmann, Edouard Leurent, Michal Valko:
Fast active learning for pure exploration in reinforcement learning. ICML 2021: 7599-7608
2020
[b1]
- view
  - electronic edition @ archives-ouvertes.fr
  - details & citations
- export record
  dblp key:
  - phd/hal/Leurent20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/hal/Leurent20
Edouard Leurent:
Safe and Efficient Reinforcement Learning for Behavioural Planning in Autonomous Driving. (Apprentissage par renforcement sûr et efficace pour la planification comportementale en conduite autonome). University of Lille, France, 2020
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/acml/LeurentM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acml/LeurentM20
Edouard Leurent, Odalric-Ambrym Maillard:
Monte-Carlo Graph Search: the Value of Merging Similar States. ACML 2020: 577-592
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/cdc/LeurentEM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cdc/LeurentEM20
Edouard Leurent, Denis V. Efimov, Odalric-Ambrym Maillard:
Robust-Adaptive Interval Predictive Control for Linear Uncertain Systems. CDC 2020: 1429-1434
[c5]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/JonssonKMDLV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JonssonKMDLV20
Anders Jonsson, Emilie Kaufmann, Pierre Ménard, Omar Darwiche Domingues, Edouard Leurent, Michal Valko:
Planning in Markov Decision Processes with Gap-Dependent Sample Complexity. NeurIPS 2020
[c4]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LeurentME20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LeurentME20
Edouard Leurent, Odalric-Ambrym Maillard, Denis V. Efimov:
Robust-Adaptive Control of Linear Systems: beyond Quadratic Costs. NeurIPS 2020
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-10816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-10816
Edouard Leurent, Denis V. Efimov, Odalric-Ambrym Maillard:
Robust Estimation, Prediction and Control with Linear Dynamics and Generic Costs. CoRR abs/2002.10816 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-05879
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-05879
Anders Jonsson, Emilie Kaufmann, Pierre Ménard, Omar Darwiche Domingues, Edouard Leurent, Michal Valko:
Planning in Markov Decision Processes with Gap-Dependent Sample Complexity. CoRR abs/2006.05879 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-06294
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-06294
Emilie Kaufmann, Pierre Ménard, Omar Darwiche Domingues, Anders Jonsson, Edouard Leurent, Michal Valko:
Adaptive Reward-Free Exploration. CoRR abs/2006.06294 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-10401
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-10401
Edouard Leurent, Denis V. Efimov, Odalric-Ambrym Maillard:
Robust-Adaptive Interval Predictive Control for Linear Uncertain Systems. CoRR abs/2007.10401 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-13442
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-13442
Pierre Ménard, Omar Darwiche Domingues, Anders Jonsson, Emilie Kaufmann, Edouard Leurent, Michal Valko:
Fast active learning for pure exploration in reinforcement learning. CoRR abs/2007.13442 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/cdc/LeurentERP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cdc/LeurentERP19
Edouard Leurent, Denis V. Efimov, Tarek Raïssi, Wilfrid Perruquetti:
Interval Prediction for Continuous-Time Systems with Parametric Uncertainties. CDC 2019: 7049-7054
[c2]
- view
- export record
  dblp key:
  - conf/nips/CarraraLLUMP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CarraraLLUMP19
Nicolas Carrara, Edouard Leurent, Romain Laroche, Tanguy Urvoy, Odalric-Ambrym Maillard, Olivier Pietquin:
Budgeted Reinforcement Learning in Continuous State Space. NeurIPS 2019: 9295-9305
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/pkdd/LeurentM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/pkdd/LeurentM19
Edouard Leurent, Odalric-Ambrym Maillard:
Practical Open-Loop Optimistic Planning. ECML/PKDD (3) 2019: 69-85
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-00220
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-00220
Edouard Leurent, Yann Blanco, Denis V. Efimov, Odalric-Ambrym Maillard:
Approximate Robust Control of Uncertain Dynamical Systems. CoRR abs/1903.00220 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-01004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-01004
Nicolas Carrara, Edouard Leurent, Romain Laroche, Tanguy Urvoy, Odalric-Ambrym Maillard, Olivier Pietquin:
Scaling up budgeted reinforcement learning. CoRR abs/1903.01004 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-04700
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-04700
Edouard Leurent, Odalric-Ambrym Maillard:
Practical Open-Loop Optimistic Planning. CoRR abs/1904.04700 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-04727
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-04727
Edouard Leurent, Denis V. Efimov, Tarek Raïssi, Wilfrid Perruquetti:
Interval Prediction for Continuous-Time Systems with Parametric Uncertainties. CoRR abs/1904.04727 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-12250
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-12250
Edouard Leurent, Jean Mercat:
Social Attention for Autonomous Decision-Making in Dense Traffic. CoRR abs/1911.12250 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.