default search action
Yuanheng Zhu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j32]Minsong Liu, Yuanheng Zhu, Yaran Chen, Dongbin Zhao:
Enhancing Reinforcement Learning via Transformer-Based State Predictive Representations. IEEE Trans. Artif. Intell. 5(9): 4364-4375 (2024) - [j31]Haoran Li, Yaocheng Zhang, Haowei Wen, Yuanheng Zhu, Dongbin Zhao:
Stabilizing Diffusion Model for Robotic Control With Dynamic Programming and Transition Feasibility. IEEE Trans. Artif. Intell. 5(9): 4585-4594 (2024) - [j30]Boyu Li, Haoran Li, Yuanheng Zhu, Dongbin Zhao:
MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning. IEEE Trans. Cogn. Dev. Syst. 16(4): 1611-1621 (2024) - [j29]Guangzheng Hu, Yuanheng Zhu, Haoran Li, Dongbin Zhao:
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game. IEEE Trans. Emerg. Top. Comput. Intell. 8(6): 4033-4045 (2024) - [c24]Jiajun Chai, Yuqian Fu, Dongbin Zhao, Yuanheng Zhu:
Aligning Credit for Multi-Agent Cooperation via Model-based Counterfactual Imagination. AAMAS 2024: 281-289 - [i13]Guangzheng Hu, Yuanheng Zhu, Haoran Li, Dongbin Zhao:
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game. CoRR abs/2402.00738 (2024) - [i12]Yuanyang Zhu, Zhi Wang, Yuanheng Zhu, Chunlin Chen, Dongbin Zhao:
Discretizing Continuous Action Space with Unimodal Probability Distributions for On-Policy Reinforcement Learning. CoRR abs/2408.00309 (2024) - [i11]Zhi Wang, Li Zhang, Wenhao Wu, Yuanheng Zhu, Dongbin Zhao, Chunlin Chen:
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement. CoRR abs/2410.11448 (2024) - 2023
- [j28]Yuanheng Zhu, Dongbin Zhao:
Vision-based control in the open racing car simulator with deep and reinforcement learning. J. Ambient Intell. Humaniz. Comput. 14(12): 15673-15685 (2023) - [j27]Minsong Liu, Luntong Li, Shuai Hao, Yuanheng Zhu, Dongbin Zhao:
Soft Contrastive Learning With Q-Irrelevance Abstraction for Reinforcement Learning. IEEE Trans. Cogn. Dev. Syst. 15(3): 1463-1473 (2023) - [j26]Zhentao Tang, Yuanheng Zhu, Dongbin Zhao, Simon M. Lucas:
Enhanced Rolling Horizon Evolution Algorithm With Opponent Model Learning: Results for the Fighting Game AI Competition. IEEE Trans. Games 15(1): 5-15 (2023) - [j25]Yuanheng Zhu, Weifan Li, Mengchen Zhao, Jianye Hao, Dongbin Zhao:
Empirical Policy Optimization for n-Player Markov Games. IEEE Trans. Cybern. 53(10): 6443-6455 (2023) - [j24]Jiajun Chai, Weifan Li, Yuanheng Zhu, Dongbin Zhao, Zhe Ma, Kewu Sun, Jishiyu Ding:
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios. IEEE Trans. Neural Networks Learn. Syst. 34(4): 2093-2104 (2023) - [j23]Guangzheng Hu, Yuanheng Zhu, Dongbin Zhao, Mengchen Zhao, Jianye Hao:
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning. IEEE Trans. Neural Networks Learn. Syst. 34(8): 3966-3978 (2023) - [j22]Jiajun Chai, Wenzhang Chen, Yuanheng Zhu, Zong-xin Yao, Dongbin Zhao:
A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat. IEEE Trans. Syst. Man Cybern. Syst. 53(9): 5417-5429 (2023) - [c23]Yuming Chen, Yuanheng Zhu:
Policy Representation Opponent Shaping via Contrastive Learning. ICONIP (9) 2023: 124-135 - [c22]Guangzheng Hu, Haoran Li, Shasha Liu, Yuanheng Zhu, Dongbin Zhao:
NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks. IJCNN 2023: 1-8 - [c21]Weifan Li, Yuanheng Zhu, Dongbin Zhao:
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning. IJCNN 2023: 1-8 - [i10]Guangzheng Hu, Haoran Li, Shasha Liu, Mingjun Ma, Yuanheng Zhu, Dongbin Zhao:
NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks. CoRR abs/2303.12319 (2023) - [i9]Runyu Lu, Yuanheng Zhu, Dongbin Zhao:
Score-Based Equilibrium Learning in Multi-Player Finite Games with Imperfect Information. CoRR abs/2306.00350 (2023) - 2022
- [j21]Yuanheng Zhu, Dongbin Zhao:
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games. IEEE Trans. Neural Networks Learn. Syst. 33(3): 1228-1241 (2022) - [j20]Xiong Yang, Yuanheng Zhu, Na Dong, Qinglai Wei:
Decentralized Event-Driven Constrained Control Using Adaptive Critic Designs. IEEE Trans. Neural Networks Learn. Syst. 33(10): 5830-5844 (2022) - [c20]Yuqian Fu, Jiajun Chai, Yuanheng Zhu, Dongbin Zhao:
LILAC: Learning a Leader for Cooperative Reinforcement Learning. CoG 2022: 49-55 - [c19]Luntong Li, Zhiming Zhou, Jiajun Chai, Zhen Liu, Yuanheng Zhu, Jianqiang Yi:
Learning Continuous 3-DoF Air-to-Air Close-in Combat Strategy using Proximal Policy Optimization. CoG 2022: 616-619 - [i8]Jiajun Chai, Weifan Li, Yuanheng Zhu, Dongbin Zhao, Zhe Ma, Kewu Sun, Jishiyu Ding:
UNMAS: Multi-Agent Reinforcement Learning for Unshaped Cooperative Scenarios. CoRR abs/2203.14477 (2022) - [i7]Jiajun Chai, Yuanheng Zhu, Dongbin Zhao:
NVIF: Neighboring Variational Information Flow for Large-Scale Cooperative Multi-Agent Scenarios. CoRR abs/2207.00964 (2022) - [i6]Jiajun Chai, Wenzhang Chen, Yuanheng Zhu, Zong-xin Yao, Dongbin Zhao:
A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat. CoRR abs/2212.03830 (2022) - 2021
- [j19]Yuanheng Zhu, Dongbin Zhao, Haibo He:
Optimal Feedback Control of Pedestrian Flow in Heterogeneous Corridors. IEEE Trans Autom. Sci. Eng. 18(3): 1097-1108 (2021) - [c18]Rongqin Liang, Yuanheng Zhu, Zhentao Tang, Mu Yang, Xiaolong Zhu:
Proximal Policy Optimization with Elo-based Opponent Selection and Combination with Enhanced Rolling Horizon Evolution Algorithm. CoG 2021: 1-4 - [i5]Yuanheng Zhu, Dongbin Zhao, Mengchen Zhao, Dong Li:
Empirical Policy Optimization for n-Player Markov Games. CoRR abs/2110.08979 (2021) - 2020
- [j18]Yuanheng Zhu, Haibo He, Dongbin Zhao:
LMI-Based Synthesis of String-Stable Controller for Cooperative Adaptive Cruise Control. IEEE Trans. Intell. Transp. Syst. 21(11): 4516-4525 (2020) - [j17]Yuanheng Zhu, Dongbin Zhao, Haibo He:
Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control. IEEE Trans. Syst. Man Cybern. Syst. 50(11): 3959-3971 (2020) - [j16]Yuanheng Zhu, Dongbin Zhao, Haibo He:
Synthesis of Cooperative Adaptive Cruise Control With Feedforward Strategies. IEEE Trans. Veh. Technol. 69(4): 3615-3627 (2020) - [c17]Minsong Liu, Yuanheng Zhu, Dongbin Zhao:
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game. IJCNN 2020: 1-7 - [c16]Kun Shao, Yuanheng Zhu, Zhentao Tang, Dongbin Zhao:
Cooperative Multi-Agent Deep Reinforcement Learning with Counterfactual Reward. IJCNN 2020: 1-8 - [i4]Zhentao Tang, Yuanheng Zhu, Dongbin Zhao, Simon M. Lucas:
Enhanced Rolling Horizon Evolution Algorithm with Opponent Model Learning: Results for the Fighting Game AI Competition. CoRR abs/2003.13949 (2020) - [i3]Guangzheng Hu, Yuanheng Zhu, Dongbin Zhao, Mengchen Zhao, Jianye Hao:
Event-Triggered Multi-agent Reinforcement Learning with Communication under Limited-bandwidth Constraint. CoRR abs/2010.04978 (2020)
2010 – 2019
- 2019
- [j15]Yuanheng Zhu, Dongbin Zhao, Zhiguang Zhong:
Adaptive Optimal Control of Heterogeneous CACC System With Uncertain Dynamics. IEEE Trans. Control. Syst. Technol. 27(4): 1772-1779 (2019) - [j14]Kun Shao, Yuanheng Zhu, Dongbin Zhao:
StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning. IEEE Trans. Emerg. Top. Comput. Intell. 3(1): 73-84 (2019) - [j13]Yuanheng Zhu, Dongbin Zhao, Xiangjun Li, Ding Wang:
Control-Limited Adaptive Dynamic Programming for Multi-Battery Energy Storage Systems. IEEE Trans. Smart Grid 10(4): 4235-4244 (2019) - [c15]Yuanheng Zhu, Haibo He, Dongbin Zhao, Zhongsheng Hou:
Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming. IJCNN 2019: 1-8 - [c14]Weifan Li, Yuanheng Zhu, Dongbin Zhao:
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games. SSCI 2019: 57-63 - [i2]Kun Shao, Zhentao Tang, Yuanheng Zhu, Nannan Li, Dongbin Zhao:
A Survey of Deep Reinforcement Learning in Video Games. CoRR abs/1912.10944 (2019) - 2018
- [j12]Yuanheng Zhu, Dongbin Zhao:
Comprehensive comparison of online ADP algorithms for continuous-time optimal control. Artif. Intell. Rev. 49(4): 531-547 (2018) - [j11]Yuanheng Zhu, Dongbin Zhao, Xiong Yang, Qichao Zhang:
Policy Iteration for H∞ Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming. IEEE Trans. Cybern. 48(2): 500-509 (2018) - [c13]Kun Shao, Dongbin Zhao, Nannan Li, Yuanheng Zhu:
Learning Battles in ViZDoom via Deep Reinforcement Learning. CIG 2018: 1-4 - [c12]Yuanheng Zhu, Dongbin Zhao:
Driving Control with Deep and Reinforcement Learning in The Open Racing Car Simulator. ICONIP (3) 2018: 326-334 - [c11]Kun Shao, Dongbin Zhao, Yuanheng Zhu, Qichao Zhang:
Visual Navigation with Actor-Critic Deep Reinforcement Learning. IJCNN 2018: 1-6 - [c10]Zhentao Tang, Kun Shao, Yuanheng Zhu, Dong Li, Dongbin Zhao, Tingwen Huang:
A Review of Computational Intelligence for StarCraft AI. SSCI 2018: 1167-1173 - [c9]Dong Li, Dongbin Zhao, Qichao Zhang, Yuanheng Zhu:
An Autonomous Driving Experience Platform with Learning-Based Functions. SSCI 2018: 1174-1179 - [i1]Kun Shao, Yuanheng Zhu, Dongbin Zhao:
StarCraft Micromanagement with Reinforcement Learning and Curriculum Transfer Learning. CoRR abs/1804.00810 (2018) - 2017
- [j10]Qichao Zhang, Dongbin Zhao, Yuanheng Zhu:
Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs. Neurocomputing 238: 377-386 (2017) - [j9]Yuanheng Zhu, Dongbin Zhao, Haibo He, Junhong Ji:
Event-Triggered Optimal Control for Partially Unknown Constrained-Input Systems via Adaptive Dynamic Programming. IEEE Trans. Ind. Electron. 64(5): 4101-4109 (2017) - [j8]Yuanheng Zhu, Dongbin Zhao, Xiangjun Li:
Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data. IEEE Trans. Neural Networks Learn. Syst. 28(3): 714-725 (2017) - [j7]Qichao Zhang, Dongbin Zhao, Yuanheng Zhu:
Event-Triggered H∞ Control for Continuous-Time Nonlinear System via Concurrent Learning. IEEE Trans. Syst. Man Cybern. Syst. 47(7): 1071-1081 (2017) - [c8]Kun Shao, Yuanheng Zhu, Dongbin Zhao:
Cooperative reinforcement learning for multiple units combat in starCraft. SSCI 2017: 1-6 - 2016
- [j6]Dongbin Zhao, Qichao Zhang, Ding Wang, Yuanheng Zhu:
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics. IEEE Trans. Cybern. 46(3): 854-865 (2016) - [c7]Qichao Zhang, Dongbin Zhao, Yuanheng Zhu, Xi Chen:
Model-free reinforcement learning for nonlinear zero-sum games with simultaneous explorations. IJCNN 2016: 4533-4538 - [c6]Dongbin Zhao, Yuanheng Zhu, Le Lv, Yaran Chen, Qichao Zhang:
Convolutional fitted Q iteration for vision-based control problems. IJCNN 2016: 4539-4544 - [c5]Dongbin Zhao, Haitao Wang, Kun Shao, Yuanheng Zhu:
Deep reinforcement learning with experience replay based on SARSA. SSCI 2016: 1-6 - 2015
- [j5]Yuanheng Zhu, Dongbin Zhao, Haibo He, Junhong Ji:
Convergence Proof of Approximate Policy Iteration for Undiscounted Optimal Control of Discrete-Time Systems. Cogn. Comput. 7(6): 763-771 (2015) - [j4]Yuanheng Zhu, Dongbin Zhao, Derong Liu:
Convergence analysis and application of fuzzy-HDP for nonlinear discrete-time HJB systems. Neurocomputing 149: 124-131 (2015) - [j3]Yuanheng Zhu, Dongbin Zhao:
A data-based online reinforcement learning algorithm satisfying probably approximately correct principle. Neural Comput. Appl. 26(4): 775-787 (2015) - [j2]Dongbin Zhao, Yuanheng Zhu:
MEC - A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems. IEEE Trans. Neural Networks Learn. Syst. 26(2): 346-356 (2015) - [c4]Dong Li, Dongbin Zhao, Yuanheng Zhu, Zhongpu Xia:
Thermal comfort control based on MEC algorithm for HVAC systems. IJCNN 2015: 1-6 - 2014
- [j1]Dongbin Zhao, Zhaohui Hu, Zhongpu Xia, Cesare Alippi, Yuanheng Zhu, Ding Wang:
Full-range adaptive cruise control based on supervised adaptive dynamic programming. Neurocomputing 125: 57-67 (2014) - [c3]Yuanheng Zhu, Dongbin Zhao:
A data-based online reinforcement learning algorithm with high-efficient exploration. ADPRL 2014: 1-6 - 2013
- [c2]Yuanheng Zhu, Dongbin Zhao:
Online Model-Free RLSPI Algorithm for Nonlinear Discrete-Time Non-affine Systems. ICONIP (2) 2013: 242-249 - 2012
- [c1]Dongbin Zhao, Yuanheng Zhu, Haibo He:
Neural and fuzzy dynamic programming for under-actuated systems. IJCNN 2012: 1-7
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-04 21:10 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint