


default search action
18th ECCV 2024: Milan, Italy - Part V
- Ales Leonardis
, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part V. Lecture Notes in Computer Science 15063, Springer 2025, ISBN 978-3-031-72651-4 - Zhengdi Yu
, Shaoli Huang
, Yongkang Cheng, Tolga Birdal
:
SignAvatars: A Large-Scale 3D Sign Language Holistic Motion Dataset and Benchmark. 1-19 - Lujun Li, Zimian Wei, Peijie Dong, Wenhan Luo, Wei Xue, Qifeng Liu, Yike Guo:
AttnZero: Efficient Attention Discovery for Vision Transformers. 20-37 - Lujun Li, Haosen Sun, Shiwen Li, Peijie Dong, Wenhan Luo, Wei Xue, Qifeng Liu, Yike Guo:
Auto-GAS: Automated Proxy Discovery for Training-Free Generative Architecture Search. 38-55 - Haosen Sun, Lujun Li, Peijie Dong, Zimian Wei, Shitong Shao:
Auto-DAS: Automated Proxy Discovery for Training-Free Distillation-Aware Architecture Search. 56-73 - ZeXiang Liu, Yangguang Li, Youtian Lin, Xin Yu, Sida Peng, Yan-Pei Cao, Xiaojuan Qi, Xiaoshui Huang, Ding Liang, Wanli Ouyang:
UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation. 74-91 - Huabin Liu
, Xiao Ma, Cheng Zhong, Yang Zhang, Weiyao Lin
:
TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning. 92-107 - Haejoon Lee
, Aswin C. Sankaranarayanan
:
Spectral Subsurface Scattering for Material Classification. 108-124 - Benjin Zhu
, Zhe Wang, Hongsheng Li:
nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding. 125-141 - Xianrui Luo
, Huiqiang Sun
, Juewen Peng
, Zhiguo Cao
:
Dynamic Neural Radiance Field from Defocused Monocular Video. 142-159 - Yang Liu, Pengxiang Ding, Siteng Huang, Min Zhang, Han Zhao, Donglin Wang:
PiTe: Pixel-Temporal Alignment for Large Video-Language Model. 160-176 - Shadi Hamdan
, Fatma Güney
:
CarFormer: Self-driving with Learned Object-Centric Representations. 177-193 - Wei Wu, Qingnan Fan, Shuai Qin, Hong Gu, Ruoyu Zhao, Antoni B. Chan:
FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models. 194-209 - Cheng Shi, Yuchen Zhu, Sibei Yang:
Plain-Det: A Plain Multi-dataset Object Detector. 210-226 - Zhen Zhao
, Zicheng Wang
, Longyue Wang
, Dian Yu
, Yixuan Yuan
, Luping Zhou
:
Alternate Diverse Teaching for Semi-supervised Medical Image Segmentation. 227-243 - Wei Cong
, Yang Cong
, Yuyang Liu
, Gan Sun
:
Cs2K: Class-Specific and Class-Shared Knowledge Guidance for Incremental Semantic Segmentation. 244-261 - Dongliang Cao
, Zorah Lähner
, Florian Bernard:
Synchronous Diffusion for Unsupervised Smooth Non-rigid 3D Shape Matching. 262-281 - David Fan
, Jue Wang
, Shuai Liao, Zhikang Zhang
, Vimal Bhat, Xinyu Li
:
Text-Guided Video Masked Autoencoder. 282-298 - Laurynas Karazija, Iro Laina, Andrea Vedaldi, Christian Rupprecht:
Diffusion Models for Open-Vocabulary Segmentation. 299-317 - Peixi Xiong
, Michael Kozuch, Nilesh Jain
:
Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation. 318-334 - Pengyu Zhang, Hao Yin, Zeren Wang, Wenyue Chen, Shengming Li, Dong Wang, Huchuan Lu, Xu Jia:
EvSign: Sign Language Recognition and Translation with Streaming Events. 335-351 - Pengxiang Ding, Han Zhao, Wenjie Zhang, Wenxuan Song, Min Zhang, Siteng Huang, Ningxi Yang, Donglin Wang:
QUAR-VLA: Vision-Language-Action Model for Quadruped Robots. 352-367 - Huilin Zhu, Jingling Yuan, Zhengwei Yang, Yu Guo, Zheng Wang, Xian Zhong, Shengfeng He:
Zero-Shot Object Counting with Good Exemplars. 368-385 - Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei:
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering. 386-402 - Yanbo Wang
, Wentao Zhao
, Chuan Cao, Tianchen Deng
, Jingchuan Wang
, Weidong Chen:
SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds. 403-421 - Hyunjin Kim
, Minhyuk Sung
:
PartSTAD: 2D-to-3D Part Segmentation Task Adaptation. 422-439 - Rajeev Yasarla
, Manish Kumar Singh, Hong Cai
, Yunxiao Shi, Jisoo Jeong
, Yinhao Zhu, Shizhong Han
, Risheek Garrepalli, Fatih Porikli
:
FutureDepth: Learning to Predict the Future Improves Video Depth Estimation. 440-458 - Yanyuan Qiao
, Qianyi Liu
, Jiajun Liu
, Jing Liu
, Qi Wu
:
LLM as Copilot for Coarse-Grained Vision-and-Language Navigation. 459-476

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.