default search action
Yicong Hong
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c15]Bahram Mohammadi, Yicong Hong, Yuankai Qi, Qi Wu, Shirui Pan, Javen Qinfeng Shi:
Augmented Commonsense Knowledge for Remote Object Grounding. AAAI 2024: 4269-4277 - [c14]Gengze Zhou, Yicong Hong, Qi Wu:
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models. AAAI 2024: 7641-7649 - [c13]Gengze Zhou, Yicong Hong, Zun Wang, Xin Eric Wang, Qi Wu:
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models. ECCV (7) 2024: 260-278 - [c12]Yicong Hong, Kai Zhang, Jiuxiang Gu, Sai Bi, Yang Zhou, Difan Liu, Feng Liu, Kalyan Sunkavalli, Trung Bui, Hao Tan:
LRM: Large Reconstruction Model for Single Image to 3D. ICLR 2024 - [c11]Jiahao Li, Hao Tan, Kai Zhang, Zexiang Xu, Fujun Luan, Yinghao Xu, Yicong Hong, Kalyan Sunkavalli, Greg Shakhnarovich, Sai Bi:
Instant3D: Fast Text-to-3D with Sparse-view Generation and Large Reconstruction Model. ICLR 2024 - [c10]Zheyuan Liu, Weixuan Sun, Yicong Hong, Damien Teney, Stephen Gould:
Bi-directional Training for Composed Image Retrieval via Text Prompt Learning. WACV 2024: 5741-5750 - [i19]Jiazhao Zhang, Kunyu Wang, Rongtao Xu, Gengze Zhou, Yicong Hong, Xiaomeng Fang, Qi Wu, Zhizheng Zhang, He Wang:
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation. CoRR abs/2402.15852 (2024) - [i18]Bahram Mohammadi, Yicong Hong, Yuankai Qi, Qi Wu, Shirui Pan, Javen Qinfeng Shi:
Augmented Commonsense Knowledge for Remote Object Grounding. CoRR abs/2406.01256 (2024) - [i17]Gengze Zhou, Yicong Hong, Zun Wang, Xin Eric Wang, Qi Wu:
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models. CoRR abs/2407.12366 (2024) - [i16]Desai Xie, Zhan Xu, Yicong Hong, Hao Tan, Difan Liu, Feng Liu, Arie E. Kaufman, Yang Zhou:
Progressive Autoregressive Video Diffusion Models. CoRR abs/2410.08151 (2024) - [i15]Ziwen Chen, Hao Tan, Kai Zhang, Sai Bi, Fujun Luan, Yicong Hong, Fuxin Li, Zexiang Xu:
Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats. CoRR abs/2410.12781 (2024) - 2023
- [j1]Yanyuan Qiao, Yuankai Qi, Yicong Hong, Zheng Yu, Peng Wang, Qi Wu:
HOP+: History-Enhanced and Order-Aware Pre-Training for Vision-and-Language Navigation. IEEE Trans. Pattern Anal. Mach. Intell. 45(7): 8524-8537 (2023) - [c9]Yicong Hong, Yang Zhou, Ruiyi Zhang, Franck Dernoncourt, Trung Bui, Stephen Gould, Hao Tan:
Learning Navigational Visual Representations with Semantic Map Supervision. ICCV 2023: 3032-3044 - [c8]Zun Wang, Jialu Li, Yicong Hong, Yi Wang, Qi Wu, Mohit Bansal, Stephen Gould, Hao Tan, Yu Qiao:
Scaling Data Generation in Vision-and-Language Navigation. ICCV 2023: 11975-11986 - [i14]Zheyuan Liu, Weixuan Sun, Yicong Hong, Damien Teney, Stephen Gould:
Bi-directional Training for Composed Image Retrieval via Text Prompt Learning. CoRR abs/2303.16604 (2023) - [i13]Gengze Zhou, Yicong Hong, Qi Wu:
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models. CoRR abs/2305.16986 (2023) - [i12]Yicong Hong, Yang Zhou, Ruiyi Zhang, Franck Dernoncourt, Trung Bui, Stephen Gould, Hao Tan:
Learning Navigational Visual Representations with Semantic Map Supervision. CoRR abs/2307.12335 (2023) - [i11]Zun Wang, Jialu Li, Yicong Hong, Yi Wang, Qi Wu, Mohit Bansal, Stephen Gould, Hao Tan, Yu Qiao:
Scaling Data Generation in Vision-and-Language Navigation. CoRR abs/2307.15644 (2023) - [i10]Yicong Hong, Kai Zhang, Jiuxiang Gu, Sai Bi, Yang Zhou, Difan Liu, Feng Liu, Kalyan Sunkavalli, Trung Bui, Hao Tan:
LRM: Large Reconstruction Model for Single Image to 3D. CoRR abs/2311.04400 (2023) - [i9]Jiahao Li, Hao Tan, Kai Zhang, Zexiang Xu, Fujun Luan, Yinghao Xu, Yicong Hong, Kalyan Sunkavalli, Greg Shakhnarovich, Sai Bi:
Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model. CoRR abs/2311.06214 (2023) - 2022
- [c7]Yanyuan Qiao, Yuankai Qi, Yicong Hong, Zheng Yu, Peng Wang, Qi Wu:
HOP: History-and-Order Aware Pretraining for Vision-and-Language Navigation. CVPR 2022: 15397-15406 - [c6]Yicong Hong, Zun Wang, Qi Wu, Stephen Gould:
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation. CVPR 2022: 15418-15428 - [i8]Yicong Hong, Zun Wang, Qi Wu, Stephen Gould:
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation. CoRR abs/2203.02764 (2022) - [i7]Yanyuan Qiao, Yuankai Qi, Yicong Hong, Zheng Yu, Peng Wang, Qi Wu:
HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation. CoRR abs/2203.11591 (2022) - [i6]Dong An, Zun Wang, Yangguang Li, Yi Wang, Yicong Hong, Yan Huang, Liang Wang, Jing Shao:
1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022). CoRR abs/2206.11610 (2022) - 2021
- [c5]Yicong Hong, Qi Wu, Yuankai Qi, Cristian Rodriguez Opazo, Stephen Gould:
VLN BERT: A Recurrent Vision-and-Language BERT for Navigation. CVPR 2021: 1643-1653 - [c4]Yuankai Qi, Zizheng Pan, Yicong Hong, Ming-Hsuan Yang, Anton van den Hengel, Qi Wu:
The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation. ICCV 2021: 1635-1644 - [c3]Jiawei Liu, Jing Zhang, Yicong Hong, Nick Barnes:
Learning structure-aware semantic segmentation with image-level supervision. IJCNN 2021: 1-8 - [i5]Yuankai Qi, Zizheng Pan, Yicong Hong, Ming-Hsuan Yang, Anton van den Hengel, Qi Wu:
Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation. CoRR abs/2104.04167 (2021) - [i4]Jiawei Liu, Jing Zhang, Yicong Hong, Nick Barnes:
Learning structure-aware semantic segmentation with image-level supervision. CoRR abs/2104.07216 (2021) - 2020
- [c2]Yicong Hong, Cristian Rodriguez Opazo, Qi Wu, Stephen Gould:
Sub-Instruction Aware Vision-and-Language Navigation. EMNLP (1) 2020: 3360-3376 - [c1]Yicong Hong, Cristian Rodriguez Opazo, Yuankai Qi, Qi Wu, Stephen Gould:
Language and Visual Entity Relationship Graph for Agent Navigation. NeurIPS 2020 - [i3]Yicong Hong, Cristian Rodriguez Opazo, Qi Wu, Stephen Gould:
Sub-Instruction Aware Vision-and-Language Navigation. CoRR abs/2004.02707 (2020) - [i2]Yicong Hong, Cristian Rodriguez Opazo, Yuankai Qi, Qi Wu, Stephen Gould:
Language and Visual Entity Relationship Graph for Agent Navigation. CoRR abs/2010.09304 (2020) - [i1]Yicong Hong, Qi Wu, Yuankai Qi, Cristian Rodriguez Opazo, Stephen Gould:
A Recurrent Vision-and-Language BERT for Navigation. CoRR abs/2011.13922 (2020)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-29 02:26 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint