default search action
18th ECCV 2024: Milan, Italy - Part XV
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XV. Lecture Notes in Computer Science 15073, Springer 2025, ISBN 978-3-031-72632-3 - Yinghao Xu, Zifan Shi, Yifan Wang, Hansheng Chen, Ceyuan Yang, Sida Peng, Yujun Shen, Gordon Wetzstein:
GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation. 1-20 - Yidan Zhang, Ting Zhang, Dong Chen, Yujing Wang, Qi Chen, Xing Xie, Hao Sun, Weiwei Deng, Qi Zhang, Fan Yang, Mao Yang, Qingmin Liao, Jingdong Wang, Baining Guo:
IRGen: Generative Modeling for Image Retrieval. 21-41 - Kyu Ri Park, Hong Joo Lee, Jung Uk Kim:
Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality. 42-59 - Florian Langer, Jihong Ju, Georgi Dikov, Gerhard Reitmayr, Mohsen Ghafoorian:
FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos. 60-77 - Wouter Van Gansbeke, Bert De Brabandere:
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting. 78-97 - Cilin Yan, Haochen Wang, Shilin Yan, Xiaolong Jiang, Yao Hu, Guoliang Kang, Weidi Xie, Efstratios Gavves:
VISA: Reasoning Video Object Segmentation via Large Language Models. 98-115 - Saman Motamed, Danda Pani Paudel, Luc Van Gool:
Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models. 116-133 - Yuanhao Zhai, Kevin Lin, Linjie Li, Chung-Ching Lin, Jianfeng Wang, Zhengyuan Yang, David S. Doermann, Junsong Yuan, Zicheng Liu, Lijuan Wang:
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation. 134-152 - Ryo Nakamura, Ryu Tadokoro, Ryosuke Yamada, Yuki M. Asano, Iro Laina, Christian Rupprecht, Nakamasa Inoue, Rio Yokota, Hirokatsu Kataoka:
Scaling Backwards: Minimal Synthetic Pre-Training? 153-171 - Ekkasit Pinyoanuntapong, Muhammad Usama Saleem, Pu Wang, Minwoo Lee, Srijan Das, Chen Chen:
BAMM: Bidirectional Autoregressive Motion Model. 172-190 - Jiahui Yuan, Hebei Li, Yansong Peng, Jin Wang, Yuheng Jiang, Yueyi Zhang, Xiaoyan Sun:
Event-Based Head Pose Estimation: Benchmark and Method. 191-208 - Ekta Prashnani, Koki Nagano, Shalini De Mello, David Luebke, Orazio Gallo:
Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos. 209-228 - Guangyu Sun, Matías Mendieta, Aritra Dutta, Xin Li, Chen Chen:
Towards Multi-modal Transformers in Federated Learning. 229-246 - Wenke Huang, Mang Ye, Zekun Shi, Bo Du, Dacheng Tao:
Fisher Calibration for Backdoor-Robust Heterogeneous Federated Learning. 247-265 - Pengbo Guo, Chengxu Liu, Xingsong Hou, Xueming Qian:
QueryCDR: Query-Based Controllable Distortion Rectification Network for Fisheye Images. 266-284 - Shishira R. Maiya, Anubhav Gupta, Matthew Gwilliam, Max Ehrlich, Abhinav Shrivastava:
Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics. 285-302 - Shrey Singh, Prateek Keserwani, Masakazu Iwamura, Partha Pratim Roy:
DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution. 303-320 - Jeongmin Bae, Seoha Kim, Youngsik Yun, Hahyun Lee, Gun Bang, Youngjung Uh:
Per-Gaussian Embedding-Based Deformation for Deformable 3D Gaussian Splatting. 321-335 - Liao Shen, Tianqi Liu, Huiqiang Sun, Xinyi Ye, Baopu Li, Jianming Zhang, Zhiguo Cao:
DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion. 336-353 - Shuang Hao, Chunlin Zhong, He Tang:
CoLA: Conditional Dropout and Language-Driven Robust Dual-Modal Salient Object Detection. 354-371 - Zhiyu Wu, Jinshi Cui:
Image-Feature Weak-to-Strong Consistency: An Enhanced Paradigm for Semi-supervised Learning. 372-388 - Qingtian Zhu, Zizhuang Wei, Zhongtian Zheng, Yifan Zhan, Zhuyu Yao, Jiawang Zhang, Kejian Wu, Yinqiang Zheng:
RPBG: Towards Robust Neural Point-Based Graphics in the Wild. 389-406 - Jiahao Chang, Yinglin Xu, Yihao Li, Yuantao Chen, Wensen Feng, Xiaoguang Han:
GaussReg: Fast 3D Registration with Gaussian Splatting. 407-423 - Yifan Pu, Zhuofan Xia, Jiayi Guo, Dongchen Han, Qixiu Li, Duo Li, Yuhui Yuan, Ji Li, Yizeng Han, Shiji Song, Gao Huang, Li Xiu:
Efficient Diffusion Transformer with Step-Wise Dynamic Attention Mediators. 424-441 - Pengfei Wang, Yuxi Wang, Shuai Li, Zhaoxiang Zhang, Zhen Lei, Lei Zhang:
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation. 442-460 - Kihwan Yoon, Yong Han Kim, Sungjei Kim, Jinwoo Jeong:
IAM-VFI : Interpolate Any Motion for Video Frame Interpolation with Motion Complexity Map. 461-477 - Siyi Du, Shaoming Zheng, Yinsong Wang, Wenjia Bai, Declan P. O'Regan, Chen Qin:
TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete Data. 478-496
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.