default search action

combined dblp search
author search
venue search
publication search

ask others

Ming Yin 0003

> Home > Persons

Person information

affiliation: University of California, Santa Barbara, CA, USA

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/jsait/FengYHWYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jsait/FengYHWYL24
Songtao Feng, Ming Yin, Ruiquan Huang, Yu-Xiang Wang, Jing Yang, Yingbin Liang:
Toward General Function Approximation in Nonstationary Reinforcement Learning. IEEE J. Sel. Areas Inf. Theory 5: 190-206 (2024)
[c15]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Feng000L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Feng000L24
Songtao Feng, Ming Yin, Yu-Xiang Wang, Jing Yang, Yingbin Liang:
Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games. ICML 2024
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/isit/Feng0H00L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isit/Feng0H00L24
Songtao Feng, Ming Yin, Ruiquan Huang, Yu-Xiang Wang, Jing Yang, Yingbin Liang:
Towards General Function Approximation in Nonstationary Reinforcement Learning. ISIT 2024: 1-6
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-11574
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-11574
Haque Ishfaq, Thanh Nguyen-Tang, Songtao Feng, Raman Arora, Mengdi Wang, Ming Yin, Doina Precup:
Offline Multitask Representation Learning for Reinforcement Learning. CoRR abs/2403.11574 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-20495
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-20495
Souradip Chakraborty, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Singh Bedi, Furong Huang:
Transfer Q Star: Principled Decoding for LLM Alignment. CoRR abs/2405.20495 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-02416
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-02416
Binshuai Wang, Qiwei Di, Ming Yin, Mengdi Wang, Quanquan Gu, Peng Wei:
Relative-Translation Invariant Wasserstein Distance. CoRR abs/2409.02416 (2024)
2023
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YinWW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YinWW23
Ming Yin, Mengdi Wang, Yu-Xiang Wang:
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient. ICLR 2023
[c12]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/FengYHW0L23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/FengYHW0L23
Songtao Feng, Ming Yin, Ruiquan Huang, Yu-Xiang Wang, Jing Yang, Yingbin Liang:
Non-stationary Reinforcement Learning under General Function Approximation. ICML 2023: 9976-10007
[c11]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LiZYBWW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiZYBWW23
Jiachen Li, Edwin Zhang, Ming Yin, Qinxun Bai, Yu-Xiang Wang, William Yang Wang:
Offline Reinforcement Learning with Closed-Form Policy Improvement Operators. ICML 2023: 20485-20528
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Kuang0W0M23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Kuang0W0M23
Nikki Lijing Kuang, Ming Yin, Mengdi Wang, Yu-Xiang Wang, Yian Ma:
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation. NeurIPS 2023
[c9]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/LiuYW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/LiuYW23
Chong Liu, Ming Yin, Yu-Xiang Wang:
No-Regret Linear Bandits beyond Realizability. UAI 2023: 1294-1303
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-12456
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-12456
Dan Qiao, Ming Yin, Yu-Xiang Wang:
Logarithmic Switching Cost in Reinforcement Learning beyond Linear MDPs. CoRR abs/2302.12456 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-13252
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-13252
Chong Liu, Ming Yin, Yu-Xiang Wang:
No-Regret Linear Bandits beyond Realizability. CoRR abs/2302.13252 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-00861
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-00861
Songtao Feng, Ming Yin, Ruiquan Huang, Yu-Xiang Wang, Jing Yang, Yingbin Liang:
Non-stationary Reinforcement Learning under General Function Approximation. CoRR abs/2306.00861 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-14063
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-14063
Sunil Madhow, Dan Xiao, Ming Yin, Yu-Xiang Wang:
Offline Policy Evaluation for Reinforcement Learning with Adaptively Collected Data. CoRR abs/2306.14063 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-08858
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-08858
Songtao Feng, Ming Yin, Yu-Xiang Wang, Jing Yang, Yingbin Liang:
Model-Free Algorithm with Improved Sample Efficiency for Zero-Sum Markov Games. CoRR abs/2308.08858 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-18919
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-18919
Nikki Lijing Kuang, Ming Yin, Mengdi Wang, Yu-Xiang Wang, Yi-An Ma:
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation. CoRR abs/2310.18919 (2023)
2022
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YinDW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YinDW022
Ming Yin, Yaqi Duan, Mengdi Wang, Yu-Xiang Wang:
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism. ICLR 2022
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/QiaoYM022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/QiaoYM022
Dan Qiao, Ming Yin, Ming Min, Yu-Xiang Wang:
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost. ICML 2022: 18031-18061
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/YinCW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/YinCW022
Ming Yin, Wenjing Chen, Mengdi Wang, Yu-Xiang Wang:
Offline stochastic shortest path: Learning, evaluation and towards optimality. UAI 2022: 2278-2288
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-06385
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-06385
Dan Qiao, Ming Yin, Ming Min, Yu-Xiang Wang:
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost. CoRR abs/2202.06385 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-05804
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-05804
Ming Yin, Yaqi Duan, Mengdi Wang, Yu-Xiang Wang:
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism. CoRR abs/2203.05804 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-04921
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-04921
Ming Yin, Wenjing Chen, Mengdi Wang, Yu-Xiang Wang:
Offline Stochastic Shortest Path: Learning, Evaluation and Towards Optimality. CoRR abs/2206.04921 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-05916
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-05916
Kaiqi Zhang, Ming Yin, Yu-Xiang Wang:
Why Quantization Improves Generalization: NTK of Binary Weight Neural Networks. CoRR abs/2206.05916 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-00750
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-00750
Ming Yin, Mengdi Wang, Yu-Xiang Wang:
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient. CoRR abs/2210.00750 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-15956
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-15956
Jiachen Li, Edwin Zhang, Ming Yin, Qinxun Bai, Yu-Xiang Wang, William Yang Wang:
Offline Reinforcement Learning with Closed-Form Policy Improvement Operators. CoRR abs/2211.15956 (2022)
2021
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/YinBW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/YinBW21
Ming Yin, Yu Bai, Yu-Xiang Wang:
Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning. AISTATS 2021: 1567-1575
[c4]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YinW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YinW21
Ming Yin, Yu-Xiang Wang:
Towards Instance-Optimal Offline Reinforcement Learning with Pessimism. NeurIPS 2021: 4065-4078
[c3]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YinBW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YinBW21
Ming Yin, Yu Bai, Yu-Xiang Wang:
Near-Optimal Offline Reinforcement Learning via Double Variance Reduction. NeurIPS 2021: 7677-7688
[c2]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YinW21a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YinW21a
Ming Yin, Yu-Xiang Wang:
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings. NeurIPS 2021: 12890-12903
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-01748
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-01748
Ming Yin, Yu Bai, Yu-Xiang Wang:
Near-Optimal Offline Reinforcement Learning via Double Variance Reduction. CoRR abs/2102.01748 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-06029
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-06029
Ming Yin, Yu-Xiang Wang:
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings. CoRR abs/2105.06029 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-08695
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-08695
Ming Yin, Yu-Xiang Wang:
Towards Instance-Optimal Offline Reinforcement Learning with Pessimism. CoRR abs/2110.08695 (2021)
2020
[c1]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/YinW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/YinW20
Ming Yin, Yu-Xiang Wang:
Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning. AISTATS 2020: 3948-3958
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-10742
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-10742
Ming Yin, Yu-Xiang Wang:
Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning. CoRR abs/2001.10742 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-03760
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-03760
Ming Yin, Yu Bai, Yu-Xiang Wang:
Near Optimal Provable Uniform Convergence in Off-Policy Evaluation for Reinforcement Learning. CoRR abs/2007.03760 (2020)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.