![](https://dblp.uni-trier.de./img/logo.320x120.png)
![search dblp search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
![search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
default search action
18th ECCV 2024: Milan, Italy - Part XII
- Ales Leonardis
, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XII. Lecture Notes in Computer Science 15070, Springer 2025, ISBN 978-3-031-73253-9 - Ziyue Huang, Yongchao Feng, Qingjie Liu, Yunhong Wang:
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection. 1-17 - Minlong Lu, Yichen Lu, Siwei Nie, Xudong Yang, Xiaobo Zhang:
Self-supervised Video Copy Localization with Regional Token Representation. 18-35 - Claudio Rota
, Marco Buzzelli
, Joost van de Weijer
:
Enhancing Perceptual Quality in Video Super-Resolution Through Temporally-Consistent Detail Synthesis Using Diffusion Models. 36-53 - Sibi Catley-Chandar, Richard Shaw, Gregory G. Slabaugh, Eduardo Pérez-Pellitero:
RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF. 54-71 - ShahRukh Athar, Shunsuke Saito, Zhengyu Yang, Stanislav Pidhorskyi, Chen Cao:
Bridging the Gap: Studio-Like Avatar Creation from a Monocular Phone Capture. 72-88 - Zhaoyang Liu, Zeqiang Lai, Zhangwei Gao, Erfei Cui, Ziheng Li, Xizhou Zhu, Lewei Lu, Qifeng Chen, Yu Qiao, Jifeng Dai, Wenhai Wang:
ControlLLM: Augment Language Models with Tools by Searching on Graphs. 89-105 - Lan Feng, Mohammadhossein Bahari, Kaouther Messaoud Ben Amor, Éloi Zablocki, Matthieu Cord, Alexandre Alahi:
UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction. 106-123 - Zizheng Yan, Jiapeng Zhou, Fanpeng Meng, Yushuang Wu, Lingteng Qiu, Zisheng Ye, Shuguang Cui, Guanying Chen, Xiaoguang Han:
DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors. 124-141 - Shijie Wang, Qi Zhao, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, Chen Sun:
Vamos: Versatile Action Models for Video Understanding. 142-160 - Xinyu Sun, Lizhao Liu, Hongyan Zhi, Ronghe Qiu, Junwei Liang
:
Prioritized Semantic Learning for Zero-Shot Instance Navigation. 161-178 - Zhongxing Ma, Shuang Liang, Yongkun Wen, Weixin Lu, Guowei Wan:
RoadPainter: Points Are Ideal Navigators for Topology TransformER. 179-195 - Linjiang Huang
, Rongyao Fang
, Aiping Zhang
, Guanglu Song, Si Liu
, Yu Liu, Hongsheng Li
:
FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis. 196-212 - Jiahui Liu, Xin Wen, Shizhen Zhao, Yingxian Chen, Xiaojuan Qi:
Can OOD Object Detectors Learn from Foundation Models? 213-231 - Xiang Fan, Anand Bhattad, Ranjay Krishna:
VIDEOSHOP: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion. 232-250 - Ashish Tiwari
, Satoshi Ikehata
, Shanmuganathan Raman
:
MERLiN: Single-Shot Material Estimation and Relighting for Photometric Stereo. 251-269 - Qiangqiang Wu
, Yan Xia
, Jia Wan
, Antoni B. Chan
:
Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training. 270-288 - Junsung Lee, Minsoo Kang, Bohyung Han:
Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation. 289-304 - Siqi Yang, Zhaojun Huang, Yakun Chang, Bin Fan, Zhaofei Yu, Boxin Shi:
Real-Data-Driven 2000 FPS Color Video from Mosaicked Chromatic Spikes. 305-321 - Peirong Liu
, Oula Puonti
, Xiaoling Hu
, Daniel C. Alexander
, Juan Eugenio Iglesias
:
Brain-ID: Learning Contrast-Agnostic Anatomical Representations for Brain Imaging. 322-340 - Youssef Mansour
, Xuyang Zhong
, Serdar I. Caglar
, Reinhard Heckel
:
TTT-MIM: Test-Time Training with Masked Image Modeling for Denoising Distribution Shifts. 341-357 - Fernando Pérez-García, Sam Bond-Taylor, Pedro P. Sanchez, Boris van Breugel, Daniel C. Castro, Harshita Sharma, Valentina Salvatelli, Maria T. A. Wetscherek, Hannah Richardson, Matthew P. Lungren, Aditya V. Nori, Javier Alvarez-Valle, Ozan Oktay, Maximilian Ilse:
RadEdit: Stress-Testing Biomedical Vision Models via Diffusion Image Editing. 358-376 - Orcun Cetintas, Tim Meinhardt, Guillem Brasó, Laura Leal-Taixé:
SPAMming Labels: Efficient Annotations for the Trackers of Tomorrow. 377-395 - Yuanting Fan
, Chengxu Liu
, Nengzhong Yin, Changlong Gao, Xueming Qian
:
AdaDiffSR: Adaptive Region-Aware Dynamic Acceleration Diffusion Model for Real-World Image Super-Resolution. 396-413 - Hang Xu, Chen Long, Wenxiao Zhang, Yuan Liu, Zhen Cao, Zhen Dong, Bisheng Yang:
Explicitly Guided Information Interaction Network for Cross-Modal Point Cloud Completion. 414-432 - Taewoo Kim
, Jaeseok Jeong
, Hoonhee Cho
, Yuhwan Jeong
, Kuk-Jin Yoon
:
Towards Real-World Event-Guided Low-Light Video Enhancement and Deblurring. 433-451 - Zixin Zhu, Xuelu Feng, Dongdong Chen, Junsong Yuan, Chunming Qiao, Gang Hua:
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation. 452-469 - Jinjie Mai
, Wenxuan Zhu
, Sara Rojas
, Jesus Zarzar
, Abdullah Hamdi
, Guocheng Qian
, Bing Li
, Silvio Giancola
, Bernard Ghanem
:
TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks. 470-489
![](https://dblp.uni-trier.de./img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.