default search action
Yao Mu 0001
Person information
- affiliation: University of Hong Kong, Department of Computer Science, Hong Kong
- affiliation (former): Tsinghua University, School of Vehicle and Mobility, Beijing, China
Other persons with the same name
- Yao Mu 0002 — China Agricultural University, College of Food Science and Nutritional Engineering, Beijing, China
- Yao Mu 0003 — Space Engineering University, Institute of Aerospace Information, Beijing, China (and 1 more)
- Yao Mu 0004 — Shanghai International Studies University, School of Business and Management, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2024
- [j4]Ji Ma, Hongming Dai, Yao Mu, Pengying Wu, Hao Wang, Xiaowei Chi, Yang Fei, Shanghang Zhang, Chang Liu:
DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments. IEEE Robotics Autom. Lett. 9(9): 7389-7396 (2024) - [j3]Junjie Wang, Qichao Zhang, Yao Mu, Dong Li, Dongbin Zhao, Yuzheng Zhuang, Ping Luo, Bin Wang, Jianye Hao:
Prototypical Context-Aware Dynamics for Generalization in Visual Control With Model-Based Reinforcement Learning. IEEE Trans. Ind. Informatics 20(9): 10717-10727 (2024) - [j2]Zeyu Gao, Yao Mu, Chen Chen, Jingliang Duan, Ping Luo, Yanfeng Lu, Shengbo Eben Li:
Enhance Sample Efficiency and Robustness of End-to-End Urban Autonomous Driving via Semantic Masked World Model. IEEE Trans. Intell. Transp. Syst. 25(10): 13067-13079 (2024) - [j1]Baiyu Peng, Jingliang Duan, Jianyu Chen, Shengbo Eben Li, Genjin Xie, Congsheng Zhang, Yang Guan, Yao Mu, Enxin Sun:
Model-Based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian. IEEE Trans. Neural Networks Learn. Syst. 35(1): 466-478 (2024)
Conference and Workshop Papers
- 2024
- [c24]Zhixuan Liang, Yao Mu, Hengbo Ma, Masayoshi Tomizuka, Mingyu Ding, Ping Luo:
SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution. CVPR 2024: 16467-16476 - [c23]Zibin Dong, Yifu Yuan, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Tangjie Lv, Changjie Fan, Zhipeng Hu:
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model. ICLR 2024 - [c22]Mengkang Hu, Yao Mu, Xinmiao Yu, Mingyu Ding, Shiguang Wu, Wenqi Shao, Qiguang Chen, Bin Wang, Yu Qiao, Ping Luo:
Tree-Planner: Efficient Close-loop Task Planning with Large Language Models. ICLR 2024 - [c21]Zhiqian Lan, Yuxuan Jiang, Yao Mu, Chen Chen, Shengbo Eben Li:
SEPT: Towards Efficient Scene Representation Learning for Motion Prediction. ICLR 2024 - [c20]Yao Mu, Junting Chen, Qinglong Zhang, Shoufa Chen, Qiaojun Yu, Chongjian Ge, Runjian Chen, Zhixuan Liang, Mengkang Hu, Chaofan Tao, Peize Sun, Haibao Yu, Chao Yang, Wenqi Shao, Wenhai Wang, Jifeng Dai, Yu Qiao, Mingyu Ding, Ping Luo:
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis. ICML 2024 - [c19]Shentao Qin, Yujie Yang, Yao Mu, Jie Li, Wenjun Zou, Jingliang Duan, Shengbo Eben Li:
Feasible Reachable Policy Iteration. ICML 2024 - [c18]Pengying Wu, Yao Mu, Bingxian Wu, Yi Hou, Ji Ma, Shanghang Zhang, Chang Liu:
VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model. ICML 2024 - [c17]Yuwei Zeng, Yao Mu, Lin Shao:
Learning Reward for Robot Skills Using Large Language Models via Self-Alignment. ICML 2024 - 2023
- [c16]Yao Mu, Shunyu Yao, Mingyu Ding, Ping Luo, Chuang Gan:
EC2: Emergent Communication for Embodied Control. CVPR 2023: 6704-6714 - [c15]Runjian Chen, Yao Mu, Runsen Xu, Wenqi Shao, Chenhan Jiang, Hang Xu, Yu Qiao, Zhenguo Li, Ping Luo:
CO3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving. ICLR 2023 - [c14]Yifu Yuan, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Jinyi Liu, Yingfeng Chen, Changjie Fan:
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model. ICLR 2023 - [c13]Zhixuan Liang, Yao Mu, Mingyu Ding, Fei Ni, Masayoshi Tomizuka, Ping Luo:
AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners. ICML 2023: 20725-20745 - [c12]Fei Ni, Jianye Hao, Yao Mu, Yifu Yuan, Yan Zheng, Bin Wang, Zhixuan Liang:
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL. ICML 2023: 26087-26105 - [c11]Yao Mu, Zhiqian Lan, Chen Chen, Chang Liu, Ping Luo, Shengbo Eben Li:
Neural MPC-Based Decision-Making Framework for Autonomous Driving in Multi-Lane Roundabout. ITSC 2023: 5403-5409 - [c10]Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo:
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought. NeurIPS 2023 - 2022
- [c9]Qiushan Guo, Yao Mu, Jianyu Chen, Tianqi Wang, Yizhou Yu, Ping Luo:
Scale-Equivalent Distillation for Semi-Supervised Object Detection. CVPR 2022: 14502-14511 - [c8]Xiaoyu Chen, Yao Mark Mu, Ping Luo, Shengbo Li, Jianyu Chen:
Flow-based Recurrent Belief State Learning for POMDPs. ICML 2022: 3444-3468 - [c7]Yao Mark Mu, Shoufa Chen, Mingyu Ding, Jianyu Chen, Runjian Chen, Ping Luo:
CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer. ICML 2022: 16043-16061 - [c6]Zhecheng Yuan, Guozheng Ma, Yao Mu, Bo Xia, Bo Yuan, Xueqian Wang, Ping Luo, Huazhe Xu:
Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for Visual Reinforcement Learning. IJCAI 2022: 3702-3708 - [c5]Yao Lai, Yao Mu, Ping Luo:
MaskPlace: Fast Chip Placement via Reinforced Visual Representation Learning. NeurIPS 2022 - [c4]Yao Mu, Yuzheng Zhuang, Fei Ni, Bin Wang, Jianyu Chen, Jianye Hao, Ping Luo:
DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning. NeurIPS 2022 - 2021
- [c3]Baiyu Peng, Yao Mu, Yang Guan, Shengbo Eben Li, Yuming Yin, Jianyu Chen:
Model-Based Actor-Critic with Chance Constraint for Stochastic System. CDC 2021: 4694-4700 - [c2]Baiyu Peng, Yao Mu, Jingliang Duan, Yang Guan, Shengbo Eben Li, Jianyu Chen:
Separated Proportional-Integral Lagrangian for Chance Constrained Reinforcement Learning. IV 2021: 193-199 - [c1]Yao Mu, Yuzheng Zhuang, Bin Wang, Guangxiang Zhu, Wulong Liu, Jianyu Chen, Ping Luo, Shengbo Li, Chongjie Zhang, Jianye Hao:
Model-Based Reinforcement Learning via Imagination with Derived Memory. NeurIPS 2021: 9493-9505
Informal and Other Publications
- 2024
- [i38]Pengying Wu, Yao Mu, Bingxian Wu, Yi Hou, Ji Ma, Shanghang Zhang, Chang Liu:
VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model. CoRR abs/2401.02695 (2024) - [i37]Junting Chen, Yao Mu, Qiaojun Yu, Tianming Wei, Silang Wu, Zhecheng Yuan, Zhixuan Liang, Chao Yang, Kaipeng Zhang, Wenqi Shao, Yu Qiao, Huazhe Xu, Mingyu Ding, Ping Luo:
RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation. CoRR abs/2402.14623 (2024) - [i36]Yao Mu, Junting Chen, Qinglong Zhang, Shoufa Chen, Qiaojun Yu, Chongjian Ge, Runjian Chen, Zhixuan Liang, Mengkang Hu, Chaofan Tao, Peize Sun, Haibao Yu, Chao Yang, Wenqi Shao, Wenhai Wang, Jifeng Dai, Yu Qiao, Mingyu Ding, Ping Luo:
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis. CoRR abs/2402.16117 (2024) - [i35]Ji Ma, Hongming Dai, Yao Mu, Pengying Wu, Hao Wang, Xiaowei Chi, Yang Fei, Shanghang Zhang, Chang Liu:
DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments. CoRR abs/2402.19007 (2024) - [i34]Qiaojun Yu, Ce Hao, Junbo Wang, Wenhai Liu, Liu Liu, Yao Mu, Yang You, Hengxu Yan, Cewu Lu:
ManiPose: A Comprehensive Benchmark for Pose-aware Object Manipulation in Robotics. CoRR abs/2403.13365 (2024) - [i33]Yuwei Zeng, Yao Mu, Lin Shao:
Learning Reward for Robot Skills Using Large Language Models via Self-Alignment. CoRR abs/2405.07162 (2024) - [i32]Zeyu Gao, Yao Mu, Jinye Qu, Mengkang Hu, Lingyue Guo, Ping Luo, Yanfeng Lu:
DAG-Plan: Generating Directed Acyclic Dependency Graphs for Dual-Arm Cooperative Planning. CoRR abs/2406.09953 (2024) - [i31]Pengying Wu, Yao Mu, Kangjie Zhou, Ji Ma, Junting Chen, Chang Liu:
CAMON: Cooperative Agents for Multi-Object Navigation with LLM-based Conversations. CoRR abs/2407.00632 (2024) - [i30]Mengkang Hu, Tianxing Chen, Qiguang Chen, Yao Mu, Wenqi Shao, Ping Luo:
HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model. CoRR abs/2408.09559 (2024) - [i29]Zhiyuan Li, Yanfeng Lu, Yao Mu, Hong Qiao:
Cog-GA: A Large Language Models-based Generative Agent for Vision-Language Navigation in Continuous Environments. CoRR abs/2409.02522 (2024) - [i28]Yao Mu, Tianxing Chen, Shijia Peng, Zanxin Chen, Zeyu Gao, Yude Zou, Lunkai Lin, Zhiqiang Xie, Ping Luo:
RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version). CoRR abs/2409.02920 (2024) - [i27]Xi Wang, Tianxing Chen, Qiaojun Yu, Tianling Xu, Zanxin Chen, Yiting Fu, Cewu Lu, Yao Mu, Ping Luo:
Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking. CoRR abs/2409.16287 (2024) - [i26]Guanyan Chen, Meiling Wang, Te Cui, Yao Mu, Haoyang Lu, Tianxing Zhou, Zicai Peng, Mengxiao Hu, Haizhou Li, Yuan Li, Yi Yang, Yufeng Yue:
VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions. CoRR abs/2410.20927 (2024) - [i25]Junting Chen, Checheng Yu, Xunzhe Zhou, Tianqi Xu, Yao Mu, Mengkang Hu, Wenqi Shao, Yikai Wang, Guohao Li, Lin Shao:
EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents. CoRR abs/2410.22662 (2024) - 2023
- [i24]Zhixuan Liang, Yao Mu, Mingyu Ding, Fei Ni, Masayoshi Tomizuka, Ping Luo:
AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners. CoRR abs/2302.01877 (2023) - [i23]Yao Mu, Shunyu Yao, Mingyu Ding, Ping Luo, Chuang Gan:
EC^2: Emergent Communication for Embodied Control. CoRR abs/2304.09448 (2023) - [i22]Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo:
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought. CoRR abs/2305.15021 (2023) - [i21]Fei Ni, Jianye Hao, Yao Mu, Yifu Yuan, Yan Zheng, Bin Wang, Zhixuan Liang:
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL. CoRR abs/2305.19923 (2023) - [i20]Zibin Dong, Yifu Yuan, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Tangjie Lv, Changjie Fan, Zhipeng Hu:
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model. CoRR abs/2310.02054 (2023) - [i19]Mingxiao Huo, Mingyu Ding, Chenfeng Xu, Thomas Tian, Xinghao Zhu, Yao Mu, Lingfeng Sun, Masayoshi Tomizuka, Wei Zhan:
Human-oriented Representation Learning for Robotic Manipulation. CoRR abs/2310.03023 (2023) - [i18]Hao Sha, Yao Mu, Yuxuan Jiang, Li Chen, Chenfeng Xu, Ping Luo, Shengbo Eben Li, Masayoshi Tomizuka, Wei Zhan, Mingyu Ding:
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving. CoRR abs/2310.03026 (2023) - [i17]Mengkang Hu, Yao Mu, Xinmiao Yu, Mingyu Ding, Shiguang Wu, Wenqi Shao, Qiguang Chen, Bin Wang, Yu Qiao, Ping Luo:
Tree-Planner: Efficient Close-loop Task Planning with Large Language Models. CoRR abs/2310.08582 (2023) - [i16]Zhixuan Liang, Yao Mu, Hengbo Ma, Masayoshi Tomizuka, Mingyu Ding, Ping Luo:
SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution. CoRR abs/2312.11598 (2023) - 2022
- [i15]Zhecheng Yuan, Guozheng Ma, Yao Mu, Bo Xia, Bo Yuan, Xueqian Wang, Ping Luo, Huazhe Xu:
Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for Visual Reinforcement Learning. CoRR abs/2202.09982 (2022) - [i14]Qiushan Guo, Yao Mu, Jianyu Chen, Tianqi Wang, Yizhou Yu, Ping Luo:
Scale-Equivalent Distillation for Semi-Supervised Object Detection. CoRR abs/2203.12244 (2022) - [i13]Xiaoyu Chen, Yao Mu, Ping Luo, Shengbo Li, Jianyu Chen:
Flow-based Recurrent Belief State Learning for POMDPs. CoRR abs/2205.11051 (2022) - [i12]Runjian Chen, Yao Mu, Runsen Xu, Wenqi Shao, Chenhan Jiang, Hang Xu, Zhenguo Li, Ping Luo:
CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving. CoRR abs/2206.04028 (2022) - [i11]Yao Mu, Shoufa Chen, Mingyu Ding, Jianyu Chen, Runjian Chen, Ping Luo:
CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer. CoRR abs/2206.08883 (2022) - [i10]Yifu Yuan, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Jinyi Liu, Yingfeng Chen, Changjie Fan:
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model. CoRR abs/2210.00498 (2022) - [i9]Zeyu Gao, Yao Mu, Ruoyan Shen, Chen Chen, Yangang Ren, Jianyu Chen, Shengbo Eben Li, Ping Luo, Yanfeng Lu:
Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model. CoRR abs/2210.04017 (2022) - [i8]Yao Mu, Yuzheng Zhuang, Fei Ni, Bin Wang, Jianyu Chen, Jianye Hao, Ping Luo:
Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning. CoRR abs/2210.04209 (2022) - [i7]Junjie Wang, Yao Mu, Dong Li, Qichao Zhang, Dongbin Zhao, Yuzheng Zhuang, Ping Luo, Bin Wang, Jianye Hao:
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning. CoRR abs/2211.12774 (2022) - [i6]Yao Lai, Yao Mu, Ping Luo:
MaskPlace: Fast Chip Placement via Reinforced Visual Representation Learning. CoRR abs/2211.13382 (2022) - 2021
- [i5]Yuhang Zhang, Yao Mu, Yujie Yang, Yang Guan, Shengbo Eben Li, Qi Sun, Jianyu Chen:
Steadily Learn to Drive with Virtual Memory. CoRR abs/2102.08072 (2021) - [i4]Baiyu Peng, Yao Mu, Jingliang Duan, Yang Guan, Shengbo Eben Li, Jianyu Chen:
Separated Proportional-Integral Lagrangian for Chance Constrained Reinforcement Learning. CoRR abs/2102.08539 (2021) - [i3]Baiyu Peng, Jingliang Duan, Jianyu Chen, Shengbo Eben Li, Genjin Xie, Congsheng Zhang, Yang Guan, Yao Mu, Enxin Sun:
Model-based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian. CoRR abs/2108.11623 (2021) - 2020
- [i2]Yao Mu, Shengbo Eben Li, Chang Liu, Qi Sun, Bingbing Nie, Bo Cheng, Baiyu Peng:
Mixed Reinforcement Learning with Additive Stochastic Uncertainty. CoRR abs/2003.00848 (2020) - [i1]Baiyu Peng, Yao Mu, Yang Guan, Shengbo Eben Li, Yuming Yin, Jianyu Chen:
Model-Based Actor-Critic with Chance Constraint for Stochastic System. CoRR abs/2012.10716 (2020)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-19 23:11 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint