default search action
18th ECCV 2024: Milan, Italy - Part XLVIII
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XLVIII. Lecture Notes in Computer Science 15106, Springer 2025, ISBN 978-3-031-73194-5 - Xiaoyu Liu, Yuxiang Wei, Ming Liu, Xianhui Lin, Peiran Ren, Xuansong Xie, Wangmeng Zuo:
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions. 1-17 - Sisi Dai, Wenhao Li, Haowen Sun, Haibin Huang, Chongyang Ma, Hui Huang, Kai Xu, Ruizhen Hu:
InterFusion: Text-Driven Generation of 3D Human-Object Interaction. 18-35 - Han Zhou, Wei Dong, Xiaohong Liu, Shuaicheng Liu, Xiongkuo Min, Guangtao Zhai, Jun Chen:
GLARE: Low Light Image Enhancement via Generative Latent Feature Based Codebook Retrieval. 36-54 - Xiaofeng Wang, Zheng Zhu, Guan Huang, Xinze Chen, Jiagang Zhu, Jiwen Lu:
DriveDreamer: Towards Real-World-Drive World Models for Autonomous Driving. 55-72 - Muhammad Adi Nugroho, Sangmin Woo, Sumin Lee, Jinyoung Park, Yooseung Wang, Donguk Kim, Changick Kim:
Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition. 73-91 - Ruilong Li, Sanja Fidler, Angjoo Kanazawa, Francis Williams:
NeRF-XL: Scaling NeRFs with Multiple GPUs. 92-107 - Jiankun Zhao, Bowen Song, Liyue Shen:
CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems. 108-126 - Qinyu Zhao, Ming Xu, Kartik Gupta, Akshay Asthana, Liang Zheng, Stephen Gould:
The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models? 127-142 - Chuanhao Li, Zhen Li, Chenchen Jing, Yuwei Wu, Mingliang Zhai, Yunde Jia:
Compositional Substitutivity of Visual Reasoning for Visual Question Answering. 143-160 - Hai Jiang, Ao Luo, Xiaohong Liu, Songchen Han, Shuaicheng Liu:
LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models. 161-179 - Sunjae Yoon, Gwanhyeong Koo, Ji Woo Hong, Chang D. Yoo:
DNI: Dilutional Noise Initialization for Diffusion Video Editing. 180-195 - Xin Duan, Yu Cao, Lei Zhu, Gang Fu, Xin Wang, Renjie Zhang, Ping Li:
Two-Stage Video Shadow Detection via Temporal-Spatial Adaption. 196-214 - Qichen Zheng, Yi Yu, Siyuan Yang, Jun Liu, Kwok-Yan Lam, Alex ChiChung Kot:
Towards Physical World Backdoor Attacks Against Skeleton Action Recognition. 215-233 - Haoyu Guo, He Zhu, Sida Peng, Yuang Wang, Yujun Shen, Ruizhen Hu, Xiaowei Zhou:
SAM-Guided Graph Cut for 3D Instance Segmentation. 234-251 - Chongyan Chen, Mengchen Liu, Noel Codella, Yunsheng Li, Lu Yuan, Danna Gurari:
Fully Authentic Visual Question Answering Dataset from Online Communities. 252-269 - Tao Huang, Jiaqi Liu, Shan You, Chang Xu:
Active Generation for Image Classification. 270-286 - Chen-Wei Xie, Siyang Sun, Liming Zhao, Pandeng Li, Shuailei Ma, Yun Zheng:
FuseTeacher: Modality-Fused Encoders are Strong Vision Supervisors. 287-304 - Chao Chen, Yu-Shen Liu, Zhizhong Han:
Learning Local Pattern Modularization for Point Cloud Reconstruction from Unseen Classes. 305-323 - Sotirios Panagiotis Chytas, Hyunwoo J. Kim, Vikas Singh:
Understanding Multi-compositional Learning in Vision and Language Models via Category Theory. 324-341 - Shangchao Su, Bin Li, Xiangyang Xue:
FedRA: A Random Allocation Strategy for Federated Tuning to Unleash the Power of Heterogeneous Clients. 342-358 - Youngjin Oh, Keuntek Lee, Jooyoung Lee, Dae-Hyun Lee, Nam Ik Cho:
Panel-Specific Degradation Representation for Raw Under-Display Camera Image Restoration. 359-375 - Pengkun Jiao, Na Zhao, Jingjing Chen, Yu-Gang Jiang:
Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image. 376-392 - Sung-Hoon Yoon, Hoyong Kwon, Jaeseok Jeong, Daehee Park, Kuk-Jin Yoon:
Diffusion-Guided Weakly Supervised Semantic Segmentation. 393-411 - Yang Jin, Yadong Mu:
Weakly-Supervised Spatio-Temporal Video Grounding with Variational Cross-Modal Alignment. 412-429 - Yi Zhang, Wang Zeng, Sheng Jin, Chen Qian, Ping Luo, Wentao Liu:
When Pedestrian Detection Meets Multi-modal Learning: Generalist Model and Benchmark Dataset. 430-448 - Yoonwoo Jeong, Jinwoo Lee, Chiheon Kim, Minsu Cho, Doyup Lee:
NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image. 449-466 - Feng Li, Hao Zhang, Peize Sun, Xueyan Zou, Shilong Liu, Chunyuan Li, Jianwei Yang, Lei Zhang, Jianfeng Gao:
Segment and Recognize Anything at Any Granularity. 467-484
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.