default search action

combined dblp search
author search
venue search
publication search

ask others

Hongming Zhang 0003

> Home > Persons

Person information

affiliation: Chinese Academy of Sciences, Center for Research on Intelligent System and Engineering, Beijing, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-00913
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-00913
Hongming Zhang, Fengshuo Bai, Chenjun Xiao, Chao Gao, Bo Xu, Martin Müller:
β-DQN: Improving Deep Q-Learning By Evolving the Behavior. CoRR abs/2501.00913 (2025)
2024
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/KohankhakiAZWG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/KohankhakiAZWG024
Farnaz Kohankhaki, Kiarash Aghakasiri, Hongming Zhang, Ting-Han Wei, Chao Gao, Martin Müller:
Monte Carlo Tree Search in the Presence of Transition Uncertainty. AAAI 2024: 20151-20158
[c5]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0003XGW0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0003XGW0024
Hongming Zhang, Chenjun Xiao, Chao Gao, Han Wang, Bo Xu, Martin Müller:
Exploiting the Replay Memory Before Exploring the Environment: Enhancing Reinforcement Learning Through Empirical MDP Iteration. NeurIPS 2024
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-18688
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-18688
Fengshuo Bai, Rui Zhao, Hongming Zhang, Sijia Cui, Ying Wen, Yaodong Yang, Bo Xu, Lei Han:
Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation. CoRR abs/2405.18688 (2024)
2023
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/BaiZTWW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/BaiZTWW023
Fengshuo Bai, Hongming Zhang, Tianyang Tao, Zhiheng Wu, Yanna Wang, Bo Xu:
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction. AAAI 2023: 6728-6736
[c3]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ZhangXW00023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhangXW00023
Hongming Zhang, Chenjun Xiao, Han Wang, Jun Jin, Bo Xu, Martin Müller:
Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay. ICLR 2023
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-11348
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-11348
Farnaz Kohankhaki, Kiarash Aghakasiri, Hongming Zhang, Ting-Han Wei, Chao Gao, Martin Müller:
Monte Carlo Tree Search in the Presence of Transition Uncertainty. CoRR abs/2312.11348 (2023)
2022
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-08234
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-08234
Jun Jin, Hongming Zhang, Jun Luo:
Build generally reusable agent-environment interaction models. CoRR abs/2211.08234 (2022)
2021
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DingYZHLGMD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DingYZHLGMD21
Zihan Ding, Tianyang Yu, Hongming Zhang, Yanhua Huang, Guo Li, Quancheng Guo, Luo Mai, Hao Dong:
Efficient Reinforcement Learning Development with RLzoo. ACM Multimedia 2021: 3759-3762
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-09889
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-09889
Hongming Zhang, Ke Sun, Bo Xu, Linglong Kong, Martin Müller:
A Simple Unified Framework for Anomaly Detection in Deep Reinforcement Learning. CoRR abs/2109.09889 (2021)
2020
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-08644
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-08644
Zihan Ding, Tianyang Yu, Yanhua Huang, Hongming Zhang, Luo Mai, Hao Dong:
RLzoo: A Comprehensive and Adaptive Reinforcement Learning Library. CoRR abs/2009.08644 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/ZhangC0CLW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/ZhangC0CLW19
Hongming Zhang, Fangjuan Cheng, Bo Xu, Feng Chen, Jiachen Liu, Wei Wu:
RevCuT Tree Search Method in Complex Single-player Game with Continuous Search Space. IJCNN 2019: 1-8
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-06758
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-06758
Jiancheng Long, Hongming Zhang, Tianyang Yu, Bo Xu:
Iterative Update and Unified Representation for Multi-Agent Reinforcement Learning. CoRR abs/1908.06758 (2019)
2018
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1812-06502
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-06502
Cheng Zeng, Hongming Zhang:
A Logarithmic Barrier Method For Proximal Policy Optimization. CoRR abs/1812.06502 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.