default search action

combined dblp search
author search
venue search
publication search

ask others

Michael Gimelfarb

Mike Gimelfarb

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Journal Articles

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/aim/TaitlerAEBFGPSSSSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/aim/TaitlerAEBFGPSSSSS24
Ayal Taitler, Ron Alford, Joan Espasa, Gregor Behnke, Daniel Fiser, Michael Gimelfarb, Florian Pommerening, Scott Sanner, Enrico Scala, Dominik Schreiber, Javier Segovia-Aguas, Jendrik Seipp:
The 2023 International Planning Competition. AI Mag. 45(2): 280-296 (2024)

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icaps/GimelfarbTS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icaps/GimelfarbTS24
Michael Gimelfarb, Ayal Taitler, Scott Sanner:
JaxPlan and GurobiPlan: Optimization Baselines for Replanning in Discrete and Mixed Discrete-Continuous Probabilistic Domains. ICAPS 2024: 230-238
2023
[c7]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/JeongWGKAS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/JeongWGKAS23
Jihwan Jeong, Xiaoyu Wang, Michael Gimelfarb, Hyunwoo Kim, Baher Abdulhai, Scott Sanner:
Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization. ICLR 2023
2022
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/PattonJGS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/PattonJGS22
Noah Patton, Jihwan Jeong, Mike Gimelfarb, Scott Sanner:
A Distributional Framework for Risk-Sensitive End-to-End Planning in Continuous MDPs. AAAI 2022: 9894-9901
2021
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/GimelfarbSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/GimelfarbSL21
Mike Gimelfarb, Scott Sanner, Chi-Guhn Lee:
Bayesian Experience Reuse for Learning from Multiple Demonstrators. IJCAI 2021: 2425-2431
[c4]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/GimelfarbBSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GimelfarbBSL21
Michael Gimelfarb, André Barreto, Scott Sanner, Chi-Guhn Lee:
Risk-Aware Transfer in Reinforcement Learning using Successor Features. NeurIPS 2021: 17298-17310
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/GimelfarbSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/GimelfarbSL21
Michael Gimelfarb, Scott Sanner, Chi-Guhn Lee:
Contextual policy transfer in reinforcement learning domains via deep mixtures-of-experts. UAI 2021: 1787-1797
2019
[c2]
- view
- export record
  dblp key:
  - conf/uai/GimelfarbSL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/GimelfarbSL19
Michael Gimelfarb, Scott Sanner, Chi-Guhn Lee:
Epsilon-BMC: A Bayesian Ensemble Approach to Epsilon-Greedy Exploration in Model-Free Reinforcement Learning. UAI 2019: 476-485
2018
[c1]
- view
- export record
  dblp key:
  - conf/nips/GimelfarbSL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GimelfarbSL18
Michael Gimelfarb, Scott Sanner, Chi-Guhn Lee:
Reinforcement Learning with Multiple Experts: A Bayesian Model Combination Approach. NeurIPS 2018: 9549-9559

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-12243
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-12243
Michael Gimelfarb, Ayal Taitler, Scott Sanner:
Constraint-Generation Policy Optimization (CGPO): Nonlinear Programming for Policy Optimization in Mixed Discrete-Continuous MDPs. CoRR abs/2401.12243 (2024)
2023
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-07844
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-07844
Michael Gimelfarb, Michael Jong Kim:
Thompson Sampling for Parameterized Markov Decision Processes with Uninformative Actions. CoRR abs/2305.07844 (2023)
2022
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-03802
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-03802
Jihwan Jeong, Xiaoyu Wang, Michael Gimelfarb, Hyunwoo Kim, Baher Abdulhai, Scott Sanner:
Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization. CoRR abs/2210.03802 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-05939
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-05939
Ayal Taitler, Michael Gimelfarb, Sriram Gopalakrishnan, Xiaotian Liu, Scott Sanner:
pyRDDLGym: From RDDL to Gym Environments. CoRR abs/2211.05939 (2022)
2021
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-14127
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-14127
Michael Gimelfarb, André Barreto, Scott Sanner, Chi-Guhn Lee:
Risk-Aware Transfer in Reinforcement Learning using Successor Features. CoRR abs/2105.14127 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-07260
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-07260
Noah Patton, Jihwan Jeong, Michael Gimelfarb, Scott Sanner:
RAPTOR: End-to-end Risk-Aware MDP Planning and Policy Learning by Backpropagation. CoRR abs/2106.07260 (2021)
2020
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-00203
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-00203
Michael Gimelfarb, Scott Sanner, Chi-Guhn Lee:
Contextual Policy Reuse using Deep Mixture Models. CoRR abs/2003.00203 (2020)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-05725
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-05725
Michael Gimelfarb, Scott Sanner, Chi-Guhn Lee:
Bayesian Experience Reuse for Learning from Multiple Demonstrators. CoRR abs/2006.05725 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-00869
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-00869
Michael Gimelfarb, Scott Sanner, Chi-Guhn Lee:
ε-BMC: A Bayesian Ensemble Approach to Epsilon-Greedy Exploration in Model-Free Reinforcement Learning. CoRR abs/2007.00869 (2020)

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.