default search action

combined dblp search
author search
venue search
publication search

ask others

Andrea Zanette

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c18]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhouZPLK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhouZPLK24
Yifei Zhou, Andrea Zanette, Jiayi Pan, Sergey Levine, Aviral Kumar:
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL. ICML 2024
[c17]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/SunHZYQYWBZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SunHZYQYWBZ24
Hanshi Sun, Momin Haider, Ruiqi Zhang, Huitao Yang, Jiahao Qiu, Ming Yin, Mengdi Wang, Peter L. Bartlett, Andrea Zanette:
Fast Best-of-N Decoding via Speculative Rejection. NeurIPS 2024
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-15703
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-15703
Ruiqi Zhang, Yuexiang Zhai, Andrea Zanette:
Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement. CoRR abs/2402.15703 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-19446
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-19446
Yifei Zhou, Andrea Zanette, Jiayi Pan, Sergey Levine, Aviral Kumar:
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL. CoRR abs/2402.19446 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-20290
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-20290
Hanshi Sun, Momin Haider, Ruiqi Zhang, Huitao Yang, Jiahao Qiu, Ming Yin, Mengdi Wang, Peter L. Bartlett, Andrea Zanette:
Fast Best-of-N Decoding via Speculative Rejection. CoRR abs/2410.20290 (2024)
2023
[c16]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Zanette23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Zanette23
Andrea Zanette:
When is Realizability Sufficient for Off-Policy Reinforcement Learning? ICML 2023: 40637-40668
[c15]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhangZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangZ23
Ruiqi Zhang, Andrea Zanette:
Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data. NeurIPS 2023
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-04354
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-04354
Ruiqi Zhang, Andrea Zanette:
Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data. CoRR abs/2307.04354 (2023)
2022
[c14]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZanetteW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZanetteW22
Andrea Zanette, Martin J. Wainwright:
Stabilizing Q-learning with Linear Architectures for Provable Efficient Learning. ICML 2022: 25920-25954
[c13]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZanetteW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZanetteW22
Andrea Zanette, Martin J. Wainwright:
Bellman Residual Orthogonalization for Offline Reinforcement Learning. NeurIPS 2022
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-12786
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-12786
Andrea Zanette, Martin J. Wainwright:
Bellman Residual Orthogonalization for Offline Reinforcement Learning. CoRR abs/2203.12786 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-00796
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-00796
Andrea Zanette, Martin J. Wainwright:
Stabilizing Q-learning with Linear Architectures for Provably Efficient Learning. CoRR abs/2206.00796 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-05311
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-05311
Andrea Zanette:
When is Realizability Sufficient for Off-Policy Reinforcement Learning? CoRR abs/2211.05311 (2022)
2021
[c12]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/ZanetteCA21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/ZanetteCA21
Andrea Zanette, Ching-An Cheng, Alekh Agarwal:
Cautiously Optimistic Policy Optimization and Exploration with Linear Function Approximation. COLT 2021: 4473-4525
[c11]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Zanette21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Zanette21
Andrea Zanette:
Exponential Lower Bounds for Batch Reinforcement Learning: Batch RL can be Exponentially Harder than Online RL. ICML 2021: 12287-12297
[c10]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZanetteWB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZanetteWB21
Andrea Zanette, Martin J. Wainwright, Emma Brunskill:
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning. NeurIPS 2021: 13626-13640
[c9]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZanetteDLB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZanetteDLB21
Andrea Zanette, Kefan Dong, Jonathan N. Lee, Emma Brunskill:
Design of Experiments for Stochastic Contextual Linear Bandits. NeurIPS 2021: 22720-22731
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-12923
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-12923
Andrea Zanette, Ching-An Cheng, Alekh Agarwal:
Cautiously Optimistic Policy Optimization and Exploration with Linear Function Approximation. CoRR abs/2103.12923 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-09912
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-09912
Andrea Zanette, Kefan Dong, Jonathan N. Lee, Emma Brunskill:
Design of Experiments for Stochastic Contextual Linear Bandits. CoRR abs/2107.09912 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-08812
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-08812
Andrea Zanette, Martin J. Wainwright, Emma Brunskill:
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning. CoRR abs/2108.08812 (2021)
2020
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/ZanetteBBPL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/ZanetteBBPL20
Andrea Zanette, David Brandfonbrener, Emma Brunskill, Matteo Pirotta, Alessandro Lazaric:
Frequentist Regret Bounds for Randomized Least-Squares Value Iteration. AISTATS 2020: 1954-1964
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZanetteLKB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZanetteLKB20
Andrea Zanette, Alessandro Lazaric, Mykel J. Kochenderfer, Emma Brunskill:
Learning Near Optimal Policies with Low Inherent Bellman Error. ICML 2020: 10978-10989
[c6]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZanetteLKB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZanetteLKB20
Andrea Zanette, Alessandro Lazaric, Mykel J. Kochenderfer, Emma Brunskill:
Provably Efficient Reward-Agnostic Navigation with Linear Value Iteration. NeurIPS 2020
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-00153
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-00153
Andrea Zanette, Alessandro Lazaric, Mykel J. Kochenderfer, Emma Brunskill:
Learning Near Optimal Policies with Low Inherent Bellman Error. CoRR abs/2003.00153 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-07737
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-07737
Andrea Zanette, Alessandro Lazaric, Mykel J. Kochenderfer, Emma Brunskill:
Provably Efficient Reward-Agnostic Navigation with Linear Value Iteration. CoRR abs/2008.07737 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-08005
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-08005
Andrea Zanette:
Exponential Lower Bounds for Batch Reinforcement Learning: Batch RL can be Exponentially Harder than Online RL. CoRR abs/2012.08005 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZanetteB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZanetteB19
Andrea Zanette, Emma Brunskill:
Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds. ICML 2019: 7304-7312
[c4]
- view
- export record
  dblp key:
  - conf/nips/ZanetteLKB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZanetteLKB19
Andrea Zanette, Alessandro Lazaric, Mykel J. Kochenderfer, Emma Brunskill:
Limiting Extrapolation in Linear Approximate Value Iteration. NeurIPS 2019: 5616-5625
[c3]
- view
- export record
  dblp key:
  - conf/nips/ZanetteKB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZanetteKB19
Andrea Zanette, Mykel J. Kochenderfer, Emma Brunskill:
Almost Horizon-Free Structure-Aware Best Policy Identification with a Generative Model. NeurIPS 2019: 5626-5635
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-00210
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-00210
Andrea Zanette, Emma Brunskill:
Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds. CoRR abs/1901.00210 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-00567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-00567
Andrea Zanette, David Brandfonbrener, Matteo Pirotta, Alessandro Lazaric:
Frequentist Regret Bounds for Randomized Least-Squares Value Iteration. CoRR abs/1911.00567 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-00954
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-00954
Andrea Zanette, Emma Brunskill:
Problem Dependent Reinforcement Learning Bounds Which Can Identify Bandit Structure in MDPs. CoRR abs/1911.00954 (2019)
2018
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZanetteB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZanetteB18
Andrea Zanette, Emma Brunskill:
Problem Dependent Reinforcement Learning Bounds Which Can Identify Bandit Structure in MDPs. ICML 2018: 5732-5740
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/pkdd/ZanetteZK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/pkdd/ZanetteZK18
Andrea Zanette, Junzi Zhang, Mykel J. Kochenderfer:
Robust Super-Level Set Estimation Using Gaussian Processes. ECML/PKDD (2) 2018: 276-291
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-09977
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-09977
Andrea Zanette, Junzi Zhang, Mykel J. Kochenderfer:
Robust Super-Level Set Estimation using Gaussian Processes. CoRR abs/1811.09977 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.