


default search action
Long Yang 0004
Person information
- affiliation: Peking University, School of Artificial Intelligence, Institute for AI, Beijing, China
- affiliation (PhD 2021): Zhejiang University, College of Computer Science and Technology, Hangzhou, China
Other persons with the same name
- Long Yang — disambiguation page
- Long Yang 0001
— Northwest A&F University, College of Information Engineering, China (and 1 more)
- Long Yang 0002
— Xidian University, State Key Laboratory of Integrated Services Networks, Xi'an, China
- Long Yang 0003
— Shandong Agricultural University, College of Plant Protection, Tai'an, China (and 2 more)
- Long Yang 0005
— Northwestern Polytechnical University, School of Marine Science and Technology, Shaanxi Key Laboratory of Underwater Information Technology, Xi'an, China (and 2 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j5]Shangding Gu
, Long Yang, Yali Du
, Guang Chen
, Florian Walter, Jun Wang
, Alois Knoll
:
A Review of Safe Reinforcement Learning: Methods, Theories, and Applications. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 11216-11235 (2024) - [c17]Fenghao Lei, Long Yang, Shiting Wen, Zhixiong Huang, Zhiwang Zhang, Chaoyi Pang:
Langevin Policy for Safe Reinforcement Learning. ICML 2024 - [c16]Tianfu Wang, Qilin Fan, Chao Wang, Long Yang, Leilei Ding, Nicholas Jing Yuan, Hui Xiong:
FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation. IJCAI 2024: 2406-2414 - [c15]Shihong Ding, Long Yang, Luo Luo, Cong Fang:
Optimizing over Multiple Distributions under Generalized Quasar-Convexity Condition. NeurIPS 2024 - [i15]Tianfu Wang, Qilin Fan, Chao Wang, Long Yang, Leilei Ding, Nicholas Jing Yuan, Hui Xiong:
FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation. CoRR abs/2404.12633 (2024) - [i14]Wenjia Meng, Qian Zheng, Long Yang, Yilong Yin, Gang Pan:
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline. CoRR abs/2405.02572 (2024) - [i13]Tianfu Wang, Long Yang, Chao Wang, Chuan Qin, Liwei Deng, Li Shen, Hui Xiong:
Towards Constraint-aware Learning for Resource Allocation in NFV-enabled Networks. CoRR abs/2410.22999 (2024) - 2023
- [j4]Shangding Gu
, Jakub Grudzien Kuba, Yuanpei Chen, Yali Du, Long Yang, Alois C. Knoll, Yaodong Yang
:
Safe multi-agent reinforcement learning for multi-robot control. Artif. Intell. 319: 103905 (2023) - [j3]Pengfei Li
, Hua Lu
, Rong Zhu, Bolin Ding, Long Yang, Gang Pan:
DILI: A Distribution-Driven Learned Index. Proc. VLDB Endow. 16(9): 2212-2224 (2023) - [j2]Long Yang
, Zhao Li
, Zehong Hu, Shasha Ruan, Gang Pan
:
A Thompson Sampling Algorithm With Logarithmic Regret for Unimodal Gaussian Bandit. IEEE Trans. Neural Networks Learn. Syst. 34(9): 5332-5341 (2023) - [c14]Juntao Dai, Jiaming Ji, Long Yang, Qian Zheng, Gang Pan:
Augmented Proximal Policy Optimization for Safe Reinforcement Learning. AAAI 2023: 7288-7295 - [c13]Pengyun Yue, Long Yang, Cong Fang, Zhouchen Lin:
Zeroth-order Optimization with Weak Dimension Dependency. COLT 2023: 4429-4472 - [c12]Jiayi Guan, Guang Chen, Jiaming Ji, Long Yang, Ao Zhou, Zhijun Li, Changjun Jiang:
VOCE: Variational Optimization with Conservative Estimation for Offline Safe Reinforcement Learning. NeurIPS 2023 - [i12]Pengfei Li, Hua Lu, Rong Zhu, Bolin Ding, Long Yang, Gang Pan:
DILI: A Distribution-Driven Learned Index. CoRR abs/2304.08817 (2023) - [i11]Long Yang, Zhixiong Huang, Fenghao Lei, Yucun Zhong, Yiming Yang, Cong Fang, Shiting Wen, Binbin Zhou
, Zhouchen Lin:
Policy Representation via Diffusion Probability Model for Reinforcement Learning. CoRR abs/2305.13122 (2023) - 2022
- [c11]Long Yang, Yu Zhang, Gang Zheng, Qian Zheng, Pengfei Li
, Jianhang Huang, Gang Pan:
Policy Optimization with Stochastic Mirror Descent. AAAI 2022: 8823-8831 - [c10]Linrui Zhang, Li Shen, Long Yang, Shixiang Chen, Xueqian Wang, Bo Yuan, Dacheng Tao:
Penalized Proximal Policy Optimization for Safe Reinforcement Learning. IJCAI 2022: 3744-3750 - [c9]Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan:
Constrained Update Projection Approach to Safe Policy Optimization. NeurIPS 2022 - [i10]Long Yang, Jiaming Ji, Juntao Dai, Yu Zhang, Pengfei Li, Gang Pan:
CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning. CoRR abs/2202.07565 (2022) - [i9]Shangding Gu, Long Yang, Yali Du, Guang Chen, Florian Walter, Jun Wang, Yaodong Yang
, Alois C. Knoll:
A Review of Safe Reinforcement Learning: Methods, Theory and Applications. CoRR abs/2205.10330 (2022) - [i8]Linrui Zhang, Li Shen, Long Yang, Shixiang Chen, Bo Yuan, Xueqian Wang, Dacheng Tao:
Penalized Proximal Policy Optimization for Safe Reinforcement Learning. CoRR abs/2205.11814 (2022) - [i7]Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang
, Gang Pan:
Constrained Update Projection Approach to Safe Policy Optimization. CoRR abs/2209.07089 (2022) - 2021
- [c8]Long Yang, Gang Zheng, Yu Zhang, Qian Zheng, Pengfei Li
, Gang Pan:
On Convergence of Gradient Expected Sarsa(λ). AAAI 2021: 10621-10629 - [c7]Long Yang, Qian Zheng, Gang Pan:
Sample Complexity of Policy Gradient Finding Second-Order Stationary Points. AAAI 2021: 10630-10638 - 2020
- [j1]Wenjia Meng
, Qian Zheng, Long Yang, Pengfei Li
, Gang Pan
:
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network. IEEE Trans. Neural Networks Learn. Syst. 31(10): 4374-4380 (2020) - [c6]Longxiang Shi, Shijian Li, Qian Zheng, Longbing Cao
, Long Yang, Gang Pan:
Maximum Entropy Reinforcement Learning with Evolution Strategies. IJCNN 2020: 1-8 - [c5]Pengfei Li
, Hua Lu
, Qian Zheng, Long Yang, Gang Pan:
LISA: A Learned Index Structure for Spatial Data. SIGMOD Conference 2020: 2119-2133 - [i6]Long Yang, Qian Zheng, Gang Pan:
Sample Complexity of Policy Gradient Finding Second-Order Stationary Points. CoRR abs/2012.01491 (2020) - [i5]Long Yang, Gang Zheng, Yu Zhang, Qian Zheng, Pengfei Li, Gang Pan:
On Convergence of Gradient Expected Sarsa(λ). CoRR abs/2012.07199 (2020)
2010 – 2019
- 2019
- [c4]Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Pan:
TBQ(σ): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning. AAMAS 2019: 1025-1032 - [c3]Pengfei Li
, Hua Lu
, Gang Zheng, Qian Zheng, Long Yang, Gang Pan:
Exploiting Ratings, Reviews and Relationships for Item Recommendations in Topic Based Social Networks. WWW 2019: 995-1005 - [i4]Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Pan:
TBQ(σ): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning. CoRR abs/1905.07237 (2019) - [i3]Long Yang, Yu Zhang, Qian Zheng, Pengfei Li, Gang Pan:
Gradient Q(σ, λ): A Unified Algorithm with Function Approximation for Reinforcement Learning. CoRR abs/1909.02877 (2019) - 2018
- [c2]Long Yang, Minhao Shi, Qian Zheng, Wenjia Meng, Gang Pan:
A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning. IJCAI 2018: 2984-2990 - [c1]Pengfei Li, Jingtian Peng, Long Yang, Qian Zheng, Gang Pan:
Crux - A New Fast, Flexible and Decentralized Consensus Algorithm with High Fault Tolerance Rate. SmartBlock 2018: 66-76 - [i2]Long Yang, Minhao Shi, Qian Zheng, Wenjia Meng, Gang Pan:
A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning. CoRR abs/1802.03171 (2018) - [i1]Wenjia Meng, Qian Zheng, Long Yang, Pengfei Li, Gang Pan:
Qualitative Measurements of Policy Discrepancy for Return-based Deep Q-Network. CoRR abs/1806.06953 (2018)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-02-18 02:19 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint