default search action

combined dblp search
author search
venue search
publication search

ask others

Siyuan Huang 0004

> Home > Persons

Person information

affiliation: Shanghai Jiao Tong University, China
affiliation: Shanghai AI Lab, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LuLXZHZYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LuLXZHZYL24
Xudong Lu, Qi Liu, Yuhui Xu, Aojun Zhou, Siyuan Huang, Bo Zhang, Junchi Yan, Hongsheng Li:
Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models. ACL (1) 2024: 6159-6172
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/LinLZGQXQSCHHZHQL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/LinLZGQXQSCHHZHQL24
Ziyi Lin, Dongyang Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Yu Qiao, Hongsheng Li:
SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models. ECCV (62) 2024: 36-55
[c4]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LiuZQHLZGLJZSXH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiuZQHLZGLJZSXH24
Dongyang Liu, Renrui Zhang, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Yu Qiao, Hongsheng Li, Peng Gao:
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models. ICML 2024
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/CaiHCLGS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/CaiHCLGS024
Wenzhe Cai, Siyuan Huang, Guangran Cheng, Yuxing Long, Peng Gao, Changyin Sun, Hao Dong:
Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill. ICRA 2024: 5228-5234
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-11289
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-11289
Siyuan Huang, Iaroslav Ponomarenko, Zhengkai Jiang, Xiaoqi Li, Xiaobin Hu, Peng Gao, Hongsheng Li, Hao Dong:
ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models. CoRR abs/2403.11289 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-20271
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-20271
Weifeng Lin, Xinyu Wei, Ruichuan An, Peng Gao, Bocheng Zou, Yulin Luo, Siyuan Huang, Shanghang Zhang, Hongsheng Li:
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want. CoRR abs/2403.20271 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-17490
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-17490
Yuxiang Chai, Siyuan Huang, Yazhe Niu, Han Xiao, Liang Liu, Dingyu Zhang, Peng Gao, Shuai Ren, Hongsheng Li:
AMEX: Android Multi-annotation Expo Dataset for Mobile GUI Agents. CoRR abs/2407.17490 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-15278
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-15278
Weifeng Lin, Xinyu Wei, Renrui Zhang, Le Zhuo, Shitian Zhao, Siyuan Huang, Junlin Xi, Yu Qiao, Peng Gao, Hongsheng Li:
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions. CoRR abs/2409.15278 (2024)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-18082
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-18082
Xin Li, Siyuan Huang, Qiaojun Yu, Zhengkai Jiang, Ce Hao, Yimeng Zhu, Hongsheng Li, Peng Gao, Cewu Lu:
SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation. CoRR abs/2409.18082 (2024)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-20551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-20551
Qiaojun Yu, Siyuan Huang, Xibin Yuan, Zhengkai Jiang, Ce Hao, Xin Li, Haonan Chang, Junbo Wang, Liu Liu, Hongsheng Li, Peng Gao, Cewu Lu:
UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models. CoRR abs/2409.20551 (2024)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-01220
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-01220
Wenbo Zhang, Yang Li, Yanyuan Qiao, Siyuan Huang, Jiajun Liu, Feras Dayoub, Xiao Ma, Lingqiao Liu:
Effective Tuning Strategies for Generalist Robot Manipulation Policies. CoRR abs/2410.01220 (2024)
2023
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZhangHLHDQGL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhangHLHDQGL23
Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Yu Qiao, Peng Gao, Hongsheng Li:
Prompt, Generate, Then Cache: Cascade of Foundation Models Makes Strong Few-Shot Learners. CVPR 2023: 15211-15222
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangZSLLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangZSLLG23
Siyuan Huang, Bo Zhang, Botian Shi, Hongsheng Li, Yikang Li, Peng Gao:
SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification. ACM Multimedia 2023: 8644-8652
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-11176
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-11176
Siyuan Huang, Zhengkai Jiang, Hao Dong, Yu Qiao, Peng Gao, Hongsheng Li:
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model. CoRR abs/2305.11176 (2023)
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-09265
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-09265
Peng Xu, Wenqi Shao, Kaipeng Zhang, Peng Gao, Shuo Liu, Meng Lei, Fanqing Meng, Siyuan Huang, Yu Qiao, Ping Luo:
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models. CoRR abs/2306.09265 (2023)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.