default search action

combined dblp search
author search
venue search
publication search

ask others

Nadav Merlis

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/BaudryMMRP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/BaudryMMRP24
Dorian Baudry, Nadav Merlis, Mathieu Benjamin Molina, Hugo Richard, Vianney Perchet:
Multi-armed bandits with guaranteed revenue per arm. AISTATS 2024: 379-387
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-11637
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-11637
Nadav Merlis, Dorian Baudry, Vianney Perchet:
The Value of Reward Lookahead in Reinforcement Learning. CoRR abs/2403.11637 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-16581
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-16581
Itai Shufaro, Nadav Merlis, Nir Weinberger, Shie Mannor:
On Bits and Bandits: Quantifying the Regret-Information Trade-off. CoRR abs/2405.16581 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02258
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02258
Nadav Merlis:
Reinforcement Learning with Lookahead Information. CoRR abs/2406.02258 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-11316
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-11316
Matilde Tullii, Solenne Gaucher, Nadav Merlis, Vianney Perchet:
Improved Algorithms for Contextual Dynamic Pricing. CoRR abs/2406.11316 (2024)
2023
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/KhannaTMMT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/KhannaTMMT23
Pranav Khanna, Guy Tennenholtz, Nadav Merlis, Shie Mannor, Chen Tessler:
Never Worse, Mostly Better: Stable Policy Improvement in Deep Reinforcement Learning. AAMAS 2023: 2430-2432
[c11]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/MerlisRSOMP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MerlisRSOMP23
Nadav Merlis, Hugo Richard, Flore Sentenac, Corentin Odic, Mathieu Molina, Vianney Perchet:
On Preemption and Learning in Stochastic Scheduling. ICML 2023: 24478-24516
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/TennenholtzMSMB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/TennenholtzMSMB23
Guy Tennenholtz, Nadav Merlis, Lior Shani, Martin Mladenov, Craig Boutilier:
Reinforcement Learning with History Dependent Dynamic Contexts. ICML 2023: 34011-34053
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-02061
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-02061
Guy Tennenholtz, Nadav Merlis, Lior Shani, Martin Mladenov, Craig Boutilier:
Reinforcement Learning with History-Dependent Dynamic Contexts. CoRR abs/2302.02061 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18333
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18333
Guy Tennenholtz, Martin Mladenov, Nadav Merlis, Craig Boutilier:
Ranking with Popularity Bias: User Welfare under Self-Amplification Dynamics. CoRR abs/2305.18333 (2023)
2022
[c9]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/TennenholtzMSMS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TennenholtzMSMS22
Guy Tennenholtz, Nadav Merlis, Lior Shani, Shie Mannor, Uri Shalit, Gal Chechik, Assaf Hallak, Gal Dalal:
Reinforcement Learning with a Terminator. NeurIPS 2022
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-15376
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-15376
Guy Tennenholtz, Nadav Merlis, Lior Shani, Shie Mannor, Uri Shalit, Gal Chechik, Assaf Hallak, Gal Dalal:
Reinforcement Learning with a Terminator. CoRR abs/2205.15376 (2022)
2021
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/EfroniMM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/EfroniMM21
Yonathan Efroni, Nadav Merlis, Shie Mannor:
Reinforcement Learning with Trajectory Feedback. AAAI 2021: 7288-7295
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/MerlisM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/MerlisM21
Nadav Merlis, Shie Mannor:
Lenient Regret for Multi-Armed Bandits. AAAI 2021: 8950-8957
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/EfroniMSM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/EfroniMSM21
Yonathan Efroni, Nadav Merlis, Aadirupa Saha, Shie Mannor:
Confidence-Budget Matching for Sequential Budgeted Learning. ICML 2021: 2937-2947
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/PeerTMM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/PeerTMM21
Oren Peer, Chen Tessler, Nadav Merlis, Ron Meir:
Ensemble Bootstrapping for Q-Learning. ICML 2021: 8454-8463
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-03400
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-03400
Yonathan Efroni, Nadav Merlis, Aadirupa Saha, Shie Mannor:
Confidence-Budget Matching for Sequential Budgeted Learning. CoRR abs/2102.03400 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-00445
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-00445
Oren Peer, Chen Tessler, Nadav Merlis, Ron Meir:
Ensemble Bootstrapping for Q-Learning. CoRR abs/2103.00445 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05724
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05724
Nadav Merlis, Yonathan Efroni, Shie Mannor:
Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits. CoRR abs/2110.05724 (2021)
2020
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/MerlisM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/MerlisM20
Nadav Merlis, Shie Mannor:
Tight Lower Bounds for Combinatorial Multi-Armed Bandits. COLT 2020: 2830-2857
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-05392
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-05392
Nadav Merlis, Shie Mannor:
Tight Lower Bounds for Combinatorial Multi-Armed Bandits. CoRR abs/2002.05392 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-03959
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-03959
Nadav Merlis, Shie Mannor:
Lenient Regret for Multi-Armed Bandits. CoRR abs/2008.03959 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-06036
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-06036
Yonathan Efroni, Nadav Merlis, Shie Mannor:
Reinforcement Learning with Trajectory Feedback. CoRR abs/2008.06036 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/MerlisM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/MerlisM19
Nadav Merlis, Shie Mannor:
Batch-Size Independent Regret Bounds for the Combinatorial Multi-Armed Bandit Problem. COLT 2019: 2465-2489
[c2]
- view
- export record
  dblp key:
  - conf/nips/EfroniMGM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/EfroniMGM19
Yonathan Efroni, Nadav Merlis, Mohammad Ghavamzadeh, Shie Mannor:
Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies. NeurIPS 2019: 12203-12213
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-03125
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-03125
Nadav Merlis, Shie Mannor:
Batch-Size Independent Regret Bounds for the Combinatorial Multi-Armed Bandit Problem. CoRR abs/1905.03125 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-11527
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-11527
Yonathan Efroni, Nadav Merlis, Mohammad Ghavamzadeh, Shie Mannor:
Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies. CoRR abs/1905.11527 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-01062
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-01062
Chen Tessler, Nadav Merlis, Shie Mannor:
Stabilizing Off-Policy Reinforcement Learning with Conservative Policy Gradients. CoRR abs/1910.01062 (2019)
2018
[c1]
- view
- export record
  dblp key:
  - conf/nips/ZahavyHMMM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZahavyHMMM18
Tom Zahavy, Matan Haroush, Nadav Merlis, Daniel J. Mankowitz, Shie Mannor:
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning. NeurIPS 2018: 3566-3577
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-02121
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-02121
Tom Zahavy, Matan Haroush, Nadav Merlis, Daniel J. Mankowitz, Shie Mannor:
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning. CoRR abs/1809.02121 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.