default search action
18th ECCV 2024: Milan, Italy - Part XLV
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XLV. Lecture Notes in Computer Science 15103, Springer 2025, ISBN 978-3-031-72994-2 - Yifan Zhan, Zhuoxiao Li, Muyao Niu, Zhihang Zhong, Shohei Nobuhara, Ko Nishino, Yinqiang Zheng:
KFD-NeRF: Rethinking Dynamic NeRF with Kalman Filter. 1-18 - Haiqian Han, Jiacheng Lyu, Jianing Li, Henglu Wei, Cheng Li, Yajing Wei, Shu Chen, Xiangyang Ji:
Physical-Based Event Camera Simulator. 19-35 - Jihan Yang, Runyu Ding, Ellis Brown, Xiaojuan Qi, Saining Xie:
V-IRL: Grounding Virtual Intelligence in Real Life. 36-55 - Jiaming Zhang, Xingjun Ma, Xin Wang, Lingyu Qiu, Jiaqi Wang, Yu-Gang Jiang, Jitao Sang:
Adversarial Prompt Tuning for Vision-Language Models. 56-72 - Jian Gao, Chun Gu, Youtian Lin, Zhihao Li, Hao Zhu, Xun Cao, Li Zhang, Yao Yao:
Relightable 3D Gaussians: Realistic Point Cloud Relighting with BRDF Decomposition and Ray Tracing. 73-89 - Jinfeng Liu, Lingtong Kong, Bo Li, Zerong Wang, Hong Gu, Jinwei Chen:
Mono-ViFI: A Unified Learning Framework for Self-supervised Single and Multi-frame Monocular Depth Estimation. 90-107 - Shreyank N. Gowda, David A. Clifton:
CC-SAM: SAM with Cross-Feature Attention and Context for Ultrasound Image Segmentation. 108-124 - Wei Chen, Long Chen, Yu Wu:
An Efficient and Effective Transformer Decoder-Based Framework for Multi-task Visual Grounding. 125-141 - Qifeng Li, Xiaosong Jia, Shaobo Wang, Junchi Yan:
Think2Drive: Efficient Reinforcement Learning by Thinking with Latent World Model for Autonomous Driving (in CARLA-V2). 142-158 - Guansong Lu, Yuanfan Guo, Jianhua Han, Minzhe Niu, Yihan Zeng, Songcen Xu, Zeyi Huang, Zhao Zhong, Wei Zhang, Hang Xu:
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion. 159-176 - Artemis Panagopoulou, Le Xue, Ning Yu, Junnan Li, Dongxu Li, Shafiq Joty, Ran Xu, Silvio Savarese, Caiming Xiong, Juan Carlos Niebles:
X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-Modal Reasoning. 177-197 - Jingyu Lin, Jiaqi Gu, Bojian Wu, Lubin Fan, Renjie Chen, Ligang Liu, Jieping Ye:
Learning Neural Volumetric Pose Features for Camera Localization. 198-214 - Shuangrui Ding, Rui Qian, Haohang Xu, Dahua Lin, Hongkai Xiong:
Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation. 215-233 - Chaojie Ji, Yufeng Li, Yiyi Liao:
REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices. 234-252 - Bolivar Solarte, Chin-Hsuan Wu, Jin-Cheng Jhang, Jonathan Lee, Yi-Hsuan Tsai, Min Sun:
Self-training Room Layout Estimation via Geometry-Aware Ray-Casting. 253-269 - Xin Jin, Bohan Li, Baao Xie, Wenyao Zhang, Jinming Liu, Ziqiang Li, Tao Yang, Wenjun Zeng:
Closed-Loop Unsupervised Representation Disentanglement with β-VAE Distillation and Diffusion Probabilistic Feedback. 270-289 - Xiang Fang, Zeyu Xiong, Wanlong Fang, Xiaoye Qu, Chen Chen, Jianfeng Dong, Keke Tang, Pan Zhou, Yu Cheng, Daizong Liu:
Rethinking Weakly-Supervised Video Temporal Grounding From a Game Perspective. 290-311 - Ming-Yang Ho, Che-Ming Wu, Min-Sheng Wu, Yufeng Jane Tseng:
Every Pixel Has Its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization. 312-328 - Fu-Yun Wang, Zhaoyang Huang, Qiang Ma, Guanglu Song, Xudong Lu, Weikang Bian, Yijin Li, Yu Liu, Hongsheng Li:
ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model. 329-345 - Taolin Zhang, Jiawang Bai, Zhihe Lu, Dongze Lian, Genping Wang, Xinchao Wang, Shu-Tao Xia:
Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach. 346-363 - Chu-Jie Qin, Ruiqi Wu, Zikun Liu, Xin Lin, Chun-Le Guo, Hyun Hee Park, Chongyi Li:
Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration. 364-380 - Xingyu Jiang, Xiuhui Zhang, Ning Gao, Yue Deng:
When Fast Fourier Transform Meets Transformer for Image Restoration. 381-402 - Yingzi Ma, Yulong Cao, Jiachen Sun, Marco Pavone, Chaowei Xiao:
Dolphins: Multimodal Language Model for Driving. 403-420 - Chen Rao, Guangyuan Li, Zehua Lan, Jiakai Sun, Junsheng Luan, Wei Xing, Lei Zhao, Huaizhong Lin, Jianfeng Dong, Dalong Zhang:
Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model. 421-437 - Xunfa Lai, Zhiyu Yang, Jie Hu, Shengchuan Zhang, Liujuan Cao, Guannan Jiang, Zhiyu Wang, Songan Zhang, Rongrong Ji:
CamoTeacher: Dual-Rotation Consistency Learning for Semi-supervised Camouflaged Object Detection. 438-455 - Pau de Jorge, Riccardo Volpi, Puneet K. Dokania, Philip H. S. Torr, Grégory Rogez:
Placing Objects in Context via Inpainting for Out-of-Distribution Segmentation. 456-473 - Mengjun Cheng, Chengquan Zhang, Chang Liu, Yuke Li, Bohan Li, Kun Yao, Xiawu Zheng, Rongrong Ji, Jie Chen:
Textual Grounding for Open-Vocabulary Visual Information Extraction in Layout-Diversified Documents. 474-491
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.