default search action

combined dblp search
author search
venue search
publication search

ask others

Long Yang 0004

> Home > Persons

Person information

affiliation: Peking University, School of Artificial Intelligence, Institute for AI, Beijing, China
affiliation (PhD 2021): Zhejiang University, College of Computer Science and Technology, Hangzhou, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/GuYDCWWK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/GuYDCWWK24
Shangding Gu, Long Yang, Yali Du, Guang Chen, Florian Walter, Jun Wang, Alois Knoll:
A Review of Safe Reinforcement Learning: Methods, Theories, and Applications. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 11216-11235 (2024)
[c17]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LeiYWHZP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LeiYWHZP24
Fenghao Lei, Long Yang, Shiting Wen, Zhixiong Huang, Zhiwang Zhang, Chaoyi Pang:
Langevin Policy for Safe Reinforcement Learning. ICML 2024
[c16]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/0002F0YDY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/0002F0YDY024
Tianfu Wang, Qilin Fan, Chao Wang, Long Yang, Leilei Ding, Nicholas Jing Yuan, Hui Xiong:
FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation. IJCAI 2024: 2406-2414
[c15]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/DingYL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DingYL024
Shihong Ding, Long Yang, Luo Luo, Cong Fang:
Optimizing over Multiple Distributions under Generalized Quasar-Convexity Condition. NeurIPS 2024
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-12633
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-12633
Tianfu Wang, Qilin Fan, Chao Wang, Long Yang, Leilei Ding, Nicholas Jing Yuan, Hui Xiong:
FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation. CoRR abs/2404.12633 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-02572
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-02572
Wenjia Meng, Qian Zheng, Long Yang, Yilong Yin, Gang Pan:
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline. CoRR abs/2405.02572 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-22999
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-22999
Tianfu Wang, Long Yang, Chao Wang, Chuan Qin, Liwei Deng, Li Shen, Hui Xiong:
Towards Constraint-aware Learning for Resource Allocation in NFV-enabled Networks. CoRR abs/2410.22999 (2024)
2023
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/ai/GuKCDYKY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/GuKCDYKY23
Shangding Gu, Jakub Grudzien Kuba, Yuanpei Chen, Yali Du, Long Yang, Alois C. Knoll, Yaodong Yang:
Safe multi-agent reinforcement learning for multi-robot control. Artif. Intell. 319: 103905 (2023)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/pvldb/LiLZDYP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pvldb/LiLZDYP23
Pengfei Li, Hua Lu, Rong Zhu, Bolin Ding, Long Yang, Gang Pan:
DILI: A Distribution-Driven Learned Index. Proc. VLDB Endow. 16(9): 2212-2224 (2023)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/YangLHRP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/YangLHRP23
Long Yang, Zhao Li, Zehong Hu, Shasha Ruan, Gang Pan:
A Thompson Sampling Algorithm With Logarithmic Regret for Unimodal Gaussian Bandit. IEEE Trans. Neural Networks Learn. Syst. 34(9): 5332-5341 (2023)
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/DaiJYZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/DaiJYZ023
Juntao Dai, Jiaming Ji, Long Yang, Qian Zheng, Gang Pan:
Augmented Proximal Policy Optimization for Safe Reinforcement Learning. AAAI 2023: 7288-7295
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/YueYFL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/YueYFL23
Pengyun Yue, Long Yang, Cong Fang, Zhouchen Lin:
Zeroth-order Optimization with Weak Dimension Dependency. COLT 2023: 4429-4472
[c12]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Guan0JYZLJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Guan0JYZLJ23
Jiayi Guan, Guang Chen, Jiaming Ji, Long Yang, Ao Zhou, Zhijun Li, Changjun Jiang:
VOCE: Variational Optimization with Conservative Estimation for Offline Safe Reinforcement Learning. NeurIPS 2023
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-08817
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-08817
Pengfei Li, Hua Lu, Rong Zhu, Bolin Ding, Long Yang, Gang Pan:
DILI: A Distribution-Driven Learned Index. CoRR abs/2304.08817 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13122
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13122
Long Yang, Zhixiong Huang, Fenghao Lei, Yucun Zhong, Yiming Yang, Cong Fang, Shiting Wen, Binbin Zhou, Zhouchen Lin:
Policy Representation via Diffusion Probability Model for Reinforcement Learning. CoRR abs/2305.13122 (2023)
2022
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YangZZZLH022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YangZZZLH022
Long Yang, Yu Zhang, Gang Zheng, Qian Zheng, Pengfei Li, Jianhang Huang, Gang Pan:
Policy Optimization with Stochastic Mirror Descent. AAAI 2022: 8823-8831
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/ZhangSYCWYT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/ZhangSYCWYT22
Linrui Zhang, Li Shen, Long Yang, Shixiang Chen, Xueqian Wang, Bo Yuan, Dacheng Tao:
Penalized Proximal Policy Optimization for Safe Reinforcement Learning. IJCAI 2022: 3744-3750
[c9]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YangJDZZL0022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangJDZZL0022
Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan:
Constrained Update Projection Approach to Safe Policy Optimization. NeurIPS 2022
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-07565
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-07565
Long Yang, Jiaming Ji, Juntao Dai, Yu Zhang, Pengfei Li, Gang Pan:
CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning. CoRR abs/2202.07565 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-10330
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-10330
Shangding Gu, Long Yang, Yali Du, Guang Chen, Florian Walter, Jun Wang, Yaodong Yang, Alois C. Knoll:
A Review of Safe Reinforcement Learning: Methods, Theory and Applications. CoRR abs/2205.10330 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-11814
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-11814
Linrui Zhang, Li Shen, Long Yang, Shixiang Chen, Bo Yuan, Xueqian Wang, Dacheng Tao:
Penalized Proximal Policy Optimization for Safe Reinforcement Learning. CoRR abs/2205.11814 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-07089
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-07089
Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan:
Constrained Update Projection Approach to Safe Policy Optimization. CoRR abs/2209.07089 (2022)
2021
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YangZZZL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YangZZZL021
Long Yang, Gang Zheng, Yu Zhang, Qian Zheng, Pengfei Li, Gang Pan:
On Convergence of Gradient Expected Sarsa(λ). AAAI 2021: 10621-10629
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YangZ021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YangZ021
Long Yang, Qian Zheng, Gang Pan:
Sample Complexity of Policy Gradient Finding Second-Order Stationary Points. AAAI 2021: 10630-10638
2020
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/MengZYLP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/MengZYLP20
Wenjia Meng, Qian Zheng, Long Yang, Pengfei Li, Gang Pan:
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network. IEEE Trans. Neural Networks Learn. Syst. 31(10): 4374-4380 (2020)
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/ShiLZCY020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/ShiLZCY020
Longxiang Shi, Shijian Li, Qian Zheng, Longbing Cao, Long Yang, Gang Pan:
Maximum Entropy Reinforcement Learning with Evolution Strategies. IJCNN 2020: 1-8
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/sigmod/Li0ZY020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigmod/Li0ZY020
Pengfei Li, Hua Lu, Qian Zheng, Long Yang, Gang Pan:
LISA: A Learned Index Structure for Spatial Data. SIGMOD Conference 2020: 2119-2133
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-01491
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-01491
Long Yang, Qian Zheng, Gang Pan:
Sample Complexity of Policy Gradient Finding Second-Order Stationary Points. CoRR abs/2012.01491 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-07199
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-07199
Long Yang, Gang Zheng, Yu Zhang, Qian Zheng, Pengfei Li, Gang Pan:
On Convergence of Gradient Expected Sarsa(λ). CoRR abs/2012.07199 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c4]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/ShiLCYP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/ShiLCYP19
Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Pan:
TBQ(σ): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning. AAMAS 2019: 1025-1032
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/www/Li0ZZYP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/Li0ZZYP19
Pengfei Li, Hua Lu, Gang Zheng, Qian Zheng, Long Yang, Gang Pan:
Exploiting Ratings, Reviews and Relationships for Item Recommendations in Topic Based Social Networks. WWW 2019: 995-1005
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-07237
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-07237
Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Pan:
TBQ(σ): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning. CoRR abs/1905.07237 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-02877
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-02877
Long Yang, Yu Zhang, Qian Zheng, Pengfei Li, Gang Pan:
Gradient Q(σ, λ): A Unified Algorithm with Function Approximation for Reinforcement Learning. CoRR abs/1909.02877 (2019)
2018
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/YangSZMP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/YangSZMP18
Long Yang, Minhao Shi, Qian Zheng, Wenjia Meng, Gang Pan:
A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning. IJCAI 2018: 2984-2990
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/smartblock/LiPYZP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/smartblock/LiPYZP18
Pengfei Li, Jingtian Peng, Long Yang, Qian Zheng, Gang Pan:
Crux - A New Fast, Flexible and Decentralized Consensus Algorithm with High Fault Tolerance Rate. SmartBlock 2018: 66-76
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-03171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-03171
Long Yang, Minhao Shi, Qian Zheng, Wenjia Meng, Gang Pan:
A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning. CoRR abs/1802.03171 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-06953
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-06953
Wenjia Meng, Qian Zheng, Long Yang, Pengfei Li, Gang Pan:
Qualitative Measurements of Policy Discrepancy for Return-based Deep Q-Network. CoRR abs/1806.06953 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.