default search action
Tianpei Yang
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j8]Carl Orge Retzlaff, Srijita Das, Christabel Wayllace, Payam Mousavi, Mohammad Afshari, Tianpei Yang, Anna Saranti, Alessa Angerschmid, Matthew E. Taylor, Andreas Holzinger:
Human-in-the-Loop Reinforcement Learning: A Survey and Position on Requirements, Challenges, and Opportunities. J. Artif. Intell. Res. 79: 359-415 (2024) - [j7]Claire Glanois, Paul Weng, Matthieu Zimmer, Dong Li, Tianpei Yang, Jianye Hao, Wulong Liu:
A survey on interpretable reinforcement learning. Mach. Learn. 113(8): 5847-5890 (2024) - [j6]Jianye Hao, Tianpei Yang, Hongyao Tang, Chenjia Bai, Jinyi Liu, Zhaopeng Meng, Peng Liu, Zhen Wang:
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain. IEEE Trans. Neural Networks Learn. Syst. 35(7): 8762-8782 (2024) - [c28]Jizhou Wu, Jianye Hao, Tianpei Yang, Xiaotian Hao, Yan Zheng, Weixun Wang, Matthew E. Taylor:
PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning. AAAI 2024: 15934-15942 - [c27]Tianpei Yang, Heng You, Jianye Hao, Yan Zheng, Matthew E. Taylor:
A Transfer Approach Using Graph Neural Networks in Deep Reinforcement Learning. AAAI 2024: 16352-16360 - [c26]Yihong Chen, Cong Wang, Tianpei Yang, Meng Wang, Yingfeng Chen, Jifei Zhou, Chaoyi Zhao, Xinfeng Zhang, Zeng Zhao, Changjie Fan, Zhipeng Hu, Rong Xiong, Long Zeng:
Mastering Robot Control through Point-based Reinforcement Learning with Pre-training. AAMAS 2024: 2198-2200 - [c25]Hao Zhang, Tianpei Yang, Yan Zheng, Jianye Hao, Matthew E. Taylor:
PADDLE: Logic Program Guided Policy Reuse in Deep Reinforcement Learning. AAMAS 2024: 2585-2587 - [i16]Qianxi Li, Yingyue Cao, Jikun Kang, Tianpei Yang, Xi Chen, Jun Jin, Matthew E. Taylor:
LaFFi: Leveraging Hybrid Natural Language Feedback for Fine-tuning Language Models. CoRR abs/2401.00907 (2024) - [i15]Shang Wang, Deepak Ranganatha Sastry Mamillapalli, Tianpei Yang, Matthew E. Taylor:
FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning. CoRR abs/2404.13061 (2024) - 2023
- [j5]Tianpei Yang, Weixun Wang, Jianye Hao, Matthew E. Taylor, Yong Liu, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Chunxu Ren, Ye Huang, Jiangcheng Zhu, Yang Gao:
ASN: action semantics network for multiagent reinforcement learning. Auton. Agents Multi Agent Syst. 37(2): 45 (2023) - [c24]David Mguni, Taher Jafferjee, Jianhong Wang, Nicolas Perez Nieves, Wenbin Song, Feifei Tong, Matthew E. Taylor, Tianpei Yang, Zipeng Dai, Hui Chen, Jiangcheng Zhu, Kun Shao, Jun Wang, Yaodong Yang:
Learning to Shape Rewards Using a Game of Two Partners. AAAI 2023: 11604-11612 - [c23]Jizhou Wu, Tianpei Yang, Xiaotian Hao, Jianye Hao, Yan Zheng, Weixun Wang, Matthew E. Taylor:
PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning. AAMAS 2023: 2460-2462 - [c22]Siqi Chen, Qisong Sun, Heng You, Tianpei Yang, Jianye Hao:
Transfer Learning based Agent for Automated Negotiation. AAMAS 2023: 2895-2898 - [c21]Yuanqiang Yu, Tianpei Yang, Yongliang Lv, Yan Zheng, Jianye Hao:
T3S: Improving Multi-Task Reinforcement Learning with Task-Specific Feature Selector and Scheduler. IJCNN 2023: 1-8 - [c20]Siqi Chen, Tianpei Yang, Heng You, Jianing Zhao, Jianye Hao, Gerhard Weiss:
Transfer Reinforcement Learning Based Negotiating Agent Framework. PAKDD (2) 2023: 386-397 - 2022
- [j4]Chengwei Zhang, Kangjie Zheng, Yu Tian, Wanli Xue, Tianpei Yang, Dou An, Yongqi Pi, Rong Chen:
Advertising Impression Resource Allocation Strategy with Multi-Level Budget Constraint DQN in Real-Time Bidding. Neurocomputing 488: 647-656 (2022) - [j3]Chengwei Zhang, Yu Tian, Zhibin Zhang, Wanli Xue, Xiaofei Xie, Tianpei Yang, Xin Ge, Rong Chen:
Neighborhood Cooperative Multiagent Reinforcement Learning for Adaptive Traffic Signal Control in Epidemic Regions. IEEE Trans. Intell. Transp. Syst. 23(12): 25157-25168 (2022) - [c19]Yining Li, Tianpei Yang, Jianye Hao, Yan Zheng, Hongyao Tang:
Efficient Deep Reinforcement Learning via Policy-Extended Successor Feature Approximator. DAI 2022: 29-44 - [c18]Pengyi Li, Hongyao Tang, Tianpei Yang, Xiaotian Hao, Tong Sang, Yan Zheng, Jianye Hao, Matthew E. Taylor, Wenyuan Tao, Zhen Wang:
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration. ICML 2022: 12979-12997 - [c17]Yushi Cao, Zhiming Li, Tianpei Yang, Hao Zhang, Yan Zheng, Yi Li, Jianye Hao, Yang Liu:
GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis. NeurIPS 2022 - [c16]Heng You, Tianpei Yang, Yan Zheng, Jianye Hao, Matthew E. Taylor:
Cross-domain adaptive transfer reinforcement learning based on state-action correspondence. UAI 2022: 2299-2309 - [i14]Pengyi Li, Hongyao Tang, Tianpei Yang, Xiaotian Hao, Tong Sang, Yan Zheng, Jianye Hao, Matthew E. Taylor, Zhen Wang:
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration. CoRR abs/2203.08553 (2022) - [i13]Yushi Cao, Zhiming Li, Tianpei Yang, Hao Zhang, Yan Zheng, Yi Li, Jianye Hao, Yang Liu:
GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis. CoRR abs/2205.13728 (2022) - [i12]Taher Jafferjee, Juliusz Krysztof Ziomek, Tianpei Yang, Zipeng Dai, Jianhong Wang, Matthew E. Taylor, Kun Shao, Jun Wang, David Mguni:
Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training. CoRR abs/2209.01054 (2022) - [i11]Amir Rasouli, Randy Goebel, Matthew E. Taylor, Iuliia Kotseruba, Soheil Alizadeh, Tianpei Yang, Montgomery Alban, Florian Shkurti, Yuzheng Zhuang, Adam Scibior, Kasra Rezaee, Animesh Garg, David Meger, Jun Luo, Liam Paull, Weinan Zhang, Xinyu Wang, Xi Chen:
NeurIPS 2022 Competition: Driving SMARTS. CoRR abs/2211.07545 (2022) - 2021
- [j2]Yan Zheng, Jianye Hao, Zongzhang Zhang, Zhaopeng Meng, Tianpei Yang, Yanran Li, Changjie Fan:
Efficient policy detecting and reusing for non-stationarity in Markov games. Auton. Agents Multi Agent Syst. 35(1): 2 (2021) - [j1]Ruotong Li, Yuqi Tong, Tianpei Yang, Jianxi Guo, Weixin Si, Yanfang Zhang, Reinhard Klein, Pheng-Ann Heng:
Towards quantitative and intuitive percutaneous tumor puncture via augmented virtual reality. Comput. Medical Imaging Graph. 90: 101905 (2021) - [c15]Amir Rasouli, Soheil Alizadeh, Iuliia Kotseruba, Yi Ma, Hebin Liang, Yuan Tian, Zhiyu Huang, Haochen Liu, Jingda Wu, Randy Goebel, Tianpei Yang, Matthew E. Taylor, Liam Paull, Xi Chen:
Driving SMARTS Competition at NeurIPS 2022: Insights and Outcome. NeurIPS (Competition and Demos) 2021: 73-84 - [c14]Tianpei Yang, Weixun Wang, Hongyao Tang, Jianye Hao, Zhaopeng Meng, Hangyu Mao, Dong Li, Wulong Liu, Yingfeng Chen, Yujing Hu, Changjie Fan, Chengwei Zhang:
An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning. NeurIPS 2021: 17037-17048 - [i10]Tianpei Yang, Hongyao Tang, Chenjia Bai, Jinyi Liu, Jianye Hao, Zhaopeng Meng, Peng Liu:
Exploration in Deep Reinforcement Learning: A Comprehensive Survey. CoRR abs/2109.06668 (2021) - [i9]Cong Wang, Tianpei Yang, Jianye Hao, Yan Zheng, Hongyao Tang, Fazl Barez, Jinyi Liu, Jiajie Peng, Haiyin Piao, Zhixiao Sun:
ED2: An Environment Dynamics Decomposition Framework for World Model Construction. CoRR abs/2112.02817 (2021) - [i8]Claire Glanois, Paul Weng, Matthieu Zimmer, Dong Li, Tianpei Yang, Jianye Hao, Wulong Liu:
A Survey on Interpretable Reinforcement Learning. CoRR abs/2112.13112 (2021) - 2020
- [c13]Weixun Wang, Tianpei Yang, Yong Liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao:
From Few to More: Large-Scale Dynamic Multiagent Curriculum Learning. AAAI 2020: 7293-7300 - [c12]Tianpei Yang, Jianye Hao, Zhaopeng Meng, Zongzhang Zhang, Yujing Hu, Yingfeng Chen, Changjie Fan, Weixun Wang, Zhaodong Wang, Jiajie Peng:
Efficient Deep Reinforcement Learning through Policy Transfer. AAMAS 2020: 2053-2055 - [c11]Chao Yu, Tianpei Yang, Wenxuan Zhu, Yinzhao Dong, Guangliang Li:
Interactive RL via Online Human Demonstrations. AAMAS 2020: 2065-2067 - [c10]Yang Tian, Yuming Bai, Shengdong Zhao, Chi-Wing Fu, Tianpei Yang, Pheng-Ann Heng:
Virtually-Extended Proprioception: Providing Spatial Reference in VR through an Appended Virtual Limb. CHI 2020: 1-12 - [c9]Weixun Wang, Tianpei Yang, Yong Liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao:
Action Semantics Network: Considering the Effects of Actions in Multiagent Systems. ICLR 2020 - [c8]Tianpei Yang, Jianye Hao, Zhaopeng Meng, Zongzhang Zhang, Yujing Hu, Yingfeng Chen, Changjie Fan, Weixun Wang, Wulong Liu, Zhaodong Wang, Jiajie Peng:
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer. IJCAI 2020: 3094-3100 - [i7]Tianpei Yang, Weixun Wang, Hongyao Tang, Jianye Hao, Zhaopeng Meng, Wulong Liu, Yujing Hu, Yingfeng Chen:
Learning When to Transfer among Agents: An Efficient Multiagent Transfer Learning Framework. CoRR abs/2002.08030 (2020) - [i6]Tianpei Yang, Jianye Hao, Zhaopeng Meng, Zongzhang Zhang, Weixun Wang, Yujing Hu, Yingfeng Chen, Changjie Fan, Zhaodong Wang, Jiajie Peng:
Efficient Deep Reinforcement Learning through Policy Transfer. CoRR abs/2002.08037 (2020)
2010 – 2019
- 2019
- [c7]Tianpei Yang, Jianye Hao, Zhaopeng Meng, Yan Zheng, Chongjie Zhang, Ze Zheng:
Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents. AAMAS 2019: 2282-2284 - [c6]Tianpei Yang, Jianye Hao, Zhaopeng Meng, Chongjie Zhang, Yan Zheng, Ze Zheng:
Towards Efficient Detection and Optimal Response against Sophisticated Opponents. IJCAI 2019: 623-629 - [c5]Ruotong Li, Tianpei Yang, Weixin Si, Xiangyun Liao, Qiong Wang, Reinhard Klein, Pheng-Ann Heng:
Augmented Reality Guided Respiratory Liver Tumors Punctures: A Preliminary Feasibility Study. SIGGRAPH Asia Technical Briefs 2019: 114-117 - [i5]Weixun Wang, Tianpei Yang, Yong Liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao:
Action Semantics Network: Considering the Effects of Actions in Multiagent Systems. CoRR abs/1907.11461 (2019) - [i4]Weixun Wang, Tianpei Yang, Yong Liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao:
From Few to More: Large-scale Dynamic Multiagent Curriculum Learning. CoRR abs/1909.02790 (2019) - 2018
- [c4]Yan Zheng, Zhaopeng Meng, Jianye Hao, Zongzhang Zhang, Tianpei Yang, Changjie Fan:
A Deep Bayesian Policy Reuse Approach Against Non-Stationary Agents. NeurIPS 2018: 962-972 - [c3]Chao Yu, Dongxu Wang, Tianpei Yang, Wenxuan Zhu, Yuchen Li, Hongwei Ge, Jiankang Ren:
Adaptively Shaping Reinforcement Learning Agents via Human Reward. PRICAI (1) 2018: 85-97 - [c2]Wanshu Liu, Chengwei Zhang, Tianpei Yang, Jianye Hao, Xiaohong Li, Zhijie Bao:
Achieving Multiagent Coordination Through CALA-rFMQ Learning in Continuous Action Space. PRICAI 2018: 132-139 - [i3]Tianpei Yang, Jianye Hao, Zhaopeng Meng, Sandip Sen, Sheng Jin:
Hierarchical Heuristic Learning towards Effcient Norm Emergence. CoRR abs/1803.03059 (2018) - [i2]Tianpei Yang, Zhaopeng Meng, Jianye Hao, Chongjie Zhang, Yan Zheng:
Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents. CoRR abs/1809.04240 (2018) - [i1]Chao Yu, Tianpei Yang, Wenxuan Zhu, Dongxu Wang, Guangliang Li:
Learning Shaping Strategies in Human-in-the-loop Interactive Reinforcement Learning. CoRR abs/1811.04272 (2018) - 2016
- [c1]Tianpei Yang, Zhaopeng Meng, Jianye Hao, Sandip Sen, Chao Yu:
Accelerating Norm Emergence Through Hierarchical Heuristic Learning. ECAI 2016: 1344-1352
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:15 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint