![](https://dblp.uni-trier.de./img/logo.320x120.png)
![search dblp search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
![search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
default search action
18th ECCV 2024: Milan, Italy - Part XLIV
- Ales Leonardis
, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XLIV. Lecture Notes in Computer Science 15102, Springer 2025, ISBN 978-3-031-72783-2 - Yan Hong
, Yuxuan Duan
, Bo Zhang
, Haoxing Chen
, Jun Lan
, Huijia Zhu
, Weiqiang Wang
, Jianfu Zhang
:
ComFusion: Enhancing Personalized Generation by Instance-Scene Compositing and Fusion. 1-18 - Shaocheng Yan
, Pengcheng Shi
, Jiayuan Li
:
ML-SemReg: Boosting Point Cloud Registration with Multi-level Semantic Consistency. 19-37 - Yuchen Yang
, Yu Qiao
, Xiao Sun
:
Mask as Supervision: Leveraging Unified Mask Information for Unsupervised 3D Pose Estimation. 38-55 - Jingyun Liang, Yuchen Fan, Kai Zhang, Radu Timofte, Luc Van Gool, Rakesh Ranjan:
MoVideo: Motion-Aware Video Generation with Diffusion Model. 56-74 - Haiwen Diao, Bo Wan, Xu Jia, Yunzhi Zhuge, Ying Zhang, Huchuan Lu, Long Chen:
SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning. 75-95 - Hongbin Lin
, Yifan Zhang
, Shuaicheng Niu
, Shuguang Cui
, Zhen Li
:
MonoTTA: Fully Test-Time Adaptation for Monocular 3D Object Detection. 96-114 - Qianjiang Hu
, Zhimin Zhang
, Wei Hu
:
RangeLDM: Fast Realistic LiDAR Point Cloud Generation. 115-135 - Xiaofeng Yang, Yiwen Chen, Cheng Chen, Chi Zhang, Yi Xu, Xulei Yang, Fayao Liu, Guosheng Lin:
Learn to Optimize Denoising Scores: A Unified and Improved Diffusion Prior for 3D Generation. 136-152 - Fu-Yun Wang
, Xiaoshi Wu, Zhaoyang Huang
, Xiaoyu Shi, Dazhong Shen
, Guanglu Song
, Yu Liu, Hongsheng Li
:
Be-Your-Outpainter: Mastering Video Outpainting Through Input-Specific Adaptation. 153-168 - Qi Zhang
, Ying Feng
, Hongdong Li
:
Physically Plausible Color Correction for Neural Radiance Fields. 169-187 - Ziyu Zhu
, Zhuofan Zhang
, Xiaojian Ma
, Xuesong Niu
, Yixin Chen
, Baoxiong Jia
, Zhidong Deng
, Siyuan Huang
, Qing Li
:
Unifying 3D Vision-Language Understanding via Promptable Queries. 188-206 - Dong-Hwan Jang, Sangdoo Yun, Dongyoon Han:
Model Stock: All We Need Is Just a Few Fine-Tuned Models. 207-223 - Xi Yang, Chenhang He, Jianqi Ma, Lei Zhang:
Motion-Guided Latent Diffusion for Temporally Consistent Real-World Video Super-Resolution. 224-242 - Yong Zhong, Min Zhao, Zebin You, Xiaofeng Yu, Changwang Zhang, Chongxuan Li:
PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible Pose Control. 243-260 - Qiang Wang:
MAD-DR: Map Compression for Visual Localization with Matchness Aware Descriptor Dimension Reduction. 261-278 - Shweta Singh, Aayan Yadav, Jitesh Jain, Humphrey Shi, Justin Johnson, Karan Desai:
Benchmarking Object Detectors with COCO: A New Path Forward. 279-295 - Chenyue Li
, Shuoyi Chen
, Mang Ye
:
Adaptive High-Frequency Transformer for Diverse Wildlife Re-identification. 296-313 - Xin-Jian Wu, Ruisong Zhang, Jie Qin, Shijie Ma, Chenglin Liu:
WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models. 314-333 - Bencheng Liao, Shaoyu Chen, Bo Jiang, Tianheng Cheng, Qian Zhang, Wenyu Liu, Chang Huang, Xinggang Wang:
Lane Graph as Path: Continuity-Preserving Path-Wise Modeling for Online Lane Graph Construction. 334-351 - Xiaojing Zhong, Xinyi Huang, Xiaofeng Yang, Guosheng Lin, Qingyao Wu:
DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency. 352-370 - Zizheng Yang, Hu Yu, Bing Li, Jinghao Zhang, Jie Huang, Feng Zhao:
Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing. 371-389 - Xuan Wu, Hongxiang Li, Yuanjiang Luo, Xuxin Cheng, Xianwei Zhuang, Meng Cao, Keren Fu:
Uncertainty-Aware Sign Language Video Retrieval with Probability Distribution Modeling. 390-408 - Dong Wei
, Huaijiang Sun
, Xiaoning Sun
, Shengxiang Hu
:
NeRMo: Learning Implicit Neural Representations for 3D Human Motion Prediction. 409-427 - Tongkun Guan, Wei Shen, Xue Yang, Xuehui Wang, Xiaokang Yang:
Bridging Synthetic and Real Worlds for Pre-Training Scene Text Detectors. 428-446 - Ahmad Khaliq
, Ming Xu
, Stephen Hausler
, Michael Milford
, Sourav Garg
:
VLAD-BuFF: Burst-Aware Fast Feature Aggregation for Visual Place Recognition. 447-466 - Lujian Yao
, Haitao Zhao
, Jingchao Peng
, Zhongze Wang
, Kaijie Zhao
:
DSA: Discriminative Scatter Analysis for Early Smoke Segmentation. 467-484 - Sayan Nag, Koustava Goswami, Srikrishna Karanam:
SafaRi: Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation. 485-503
![](https://dblp.uni-trier.de./img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.