default search action
18th ECCV 2024: Milan, Italy - Part LXIV
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXIV. Lecture Notes in Computer Science 15122, Springer 2025, ISBN 978-3-031-73038-2 - Anita Rau, Josiah Aklilu, F. Christopher Holsinger, Serena Yeung-Levy:
Depth-Guided NeRF Training via Earth Mover's Distance. 1-17 - Ji Ha Jang, Hoigi Seo, Se Young Chun:
INTRA: Interaction Relationship-Aware Weakly Supervised Affordance Grounding. 18-34 - Sarah Jabbour, Gregory Kondas, Ella Kazerooni, Michael W. Sjoding, David Fouhey, Jenna Wiens:
DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks. 35-51 - Sanjoy Chowdhury, Sayan Nag, Subhrajyoti Dasgupta, Jun Chen, Mohamed Elhoseiny, Ruohan Gao, Dinesh Manocha:
MEERKAT: Audio-Visual Large Language Model for Grounding in Space and Time. 52-70 - Yake Wei, Siwei Li, Ruoxuan Feng, Di Hu:
Diagnosing and Re-learning for Balanced Multimodal Learning. 71-86 - Dongwon Park, Hayeon Kim, Se Young Chun:
Contribution-Based Low-Rank Adaptation with Pre-training Model for Real Image Restoration. 87-105 - Lucas Stoffl, Andy Bonnetto, Stéphane d'Ascoli, Alexander Mathis:
Elucidating the Hierarchical Nature of Behavior with Masked Autoencoders. 106-125 - Gwanghyun Kim, Hayeon Kim, Hoigi Seo, Dong Un Kang, Se Young Chun:
BeyondScene: Higher-Resolution Human-Centric Scene Generation with Pretrained Diffusion. 126-142 - Chao Xu, Ang Li, Linghao Chen, Yulin Liu, Ruoxi Shi, Hao Su, Minghua Liu:
SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views. 143-163 - Vishal Nedungadi, Ankit Kariryaa, Stefan Oehmcke, Serge J. Belongie, Christian Igel, Nico Lang:
MMEarth: Exploring Multi-modal Pretext Tasks for Geospatial Representation Learning. 164-182 - Mia Chiquier, Utkarsh Mall, Carl Vondrick:
Evolving Interpretable Visual Classifiers with Large Language Models. 183-201 - De-An Huang, Shijia Liao, Subhashree Radhakrishnan, Hongxu Yin, Pavlo Molchanov, Zhiding Yu, Jan Kautz:
LITA: Language Instructed Temporal-Localization Assistant. 202-218 - Timothy Chase, Karthik Dantu:
MARs: Multi-view Attention Regularizations for Patch-Based Feature Recognition of Space Terrain. 219-239 - Keen You, Haotian Zhang, Eldon Schoop, Floris Weers, Amanda Swearngin, Jeffrey Nichols, Yinfei Yang, Zhe Gan:
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs. 240-255 - Zhengfeng Lai, Joohi Chauhan, Brittany N. Dugger, Chen-Nee Chuah:
Bridging the Pathology Domain Gap: Efficiently Adapting CLIP for Pathology Image Analysis with Limited Labeled Data. 256-273 - Yangchao Wu, Tian Yu Liu, Hyoungseob Park, Stefano Soatto, Dong Lao, Alex Wong:
AugUndo: Scaling Up Augmentations for Monocular Depth Completion and Estimation. 274-293 - Wei-Yu Lee, Martin D. Dimitrievski, David Van Hamme, Jan Aelterman, Ljubomir Jovanov, Wilfried Philips:
CARB-Net: Camera-Assisted Radar-Based Network for Vulnerable Road User Detection. 294-310 - Haijin Zeng, Yuxi Liu, Yongyong Chen, Youfa Liu, Chong Peng, Jingyong Su:
SAH-SCI: Self-supervised Adapter for Efficient Hyperspectral Snapshot Compressive Imaging. 311-328 - Jeremy Klotz, Shree K. Nayar:
Minimalist Vision with Freeform Pixels. 329-346 - Seongho Kim, Byung Cheol Song:
All You Need Is Your Voice: Emotional Face Representation with Audio Perspective for Emotional Talking Face Generation. 347-363 - Umar Khalid, Hasan Iqbal, Nazmul Karim, Muhammad Tayyab, Jing Hua, Chen Chen:
LatentEditor: Text Driven Local Editing of 3D Scenes. 364-380 - Kaustubh Sadekar, David Maier, Atul Ingle:
Single-Photon 3D Imaging with Equi-Depth Photon Histograms. 381-398 - Sanket Kachole, Hussain Sajwani, Fariborz Baghaei Naeini, Dimitrios Makris, Yahya H. Zweiri:
Asynchronous Bioplausible Neuron for Spiking Neural Networks for Event-Based Vision. 399-415 - James Burgess, Kuan-Chieh Wang, Serena Yeung-Levy:
Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models. 416-435 - Prachi Garg, K. J. Joseph, Vineeth N. Balasubramanian, Necati Cihan Camgöz, Chengde Wan, Kenrick Kin, Weiguang Si, Shugao Ma, Fernando De la Torre:
POET: Prompt Offset Tuning for Continual Human Action Adaptation. 436-455 - Shuangzhi Li, Lei Ma, Xingyu Li:
Domain Generalization of 3D Object Detection by Density-Resampling. 456-473 - Chenglin Yang, Siyuan Qiao, Yuan Cao, Yu Zhang, Tao Zhu, Alan L. Yuille, Jiahui Yu:
IG Captioner: Information Gain Captioners Are Strong Zero-Shot Classifiers. 474-490
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.