default search action
18th ECCV 2024: Milan, Italy - Part LXIII
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXIII. Lecture Notes in Computer Science 15121, Springer 2025, ISBN 978-3-031-73035-1 - Yinan Zhang, Eric Tzeng, Yilun Du, Dmitry Kislyuk:
Large-Scale Reinforcement Learning for Diffusion Models. 1-17 - Jiarui Sun, Girish Chowdhary:
CoMusion: Towards Consistent Stochastic Human Motion Prediction via Motion Diffusion. 18-36 - Anestis Kastellos, Athanasios Psaltis, Charalampos Z. Patrikakis, Petros Daras:
FedHARM: Harmonizing Model Architectural Diversity in Federated Learning. 37-53 - Sharath Girish, Kamal Gupta, Abhinav Shrivastava:
EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS. 54-71 - Bartlomiej Sobieski, Przemyslaw Biecek:
Global Counterfactual Directions. 72-90 - Cheng Zhao, Su Sun, Ruoyu Wang, Yuliang Guo, Jun-Jun Wan, Zhou Huang, Xinyu Huang, Yingjie Victor Chen, Liu Ren:
TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Autonomous Driving: Supplementary Materials. 91-106 - Yuan-Hao Ho, Jen-Hao Cheng, Sheng-Yao Kuan, Zhongyu Jiang, Wenhao Chai, Hsiang-Wei Huang, Chih-Lung Lin, Jenq-Neng Hwang:
RT-Pose: A 4D Radar Tensor-Based 3D Human Pose Estimation and Localization Benchmark. 107-125 - Ruoxi Chen, Haibo Jin, Yixin Liu, Jinyin Chen, Haohan Wang, Lichao Sun:
EditShield: Protecting Unauthorized Image Editing by Instruction-Guided Diffusion Models. 126-142 - Abrar Majeedi, Viswanatha Reddy Gajjala, Satya Sai Srinath Namburi GNVV, Yin Li:
RICA2: Rubric-Informed, Calibrated Assessment of Actions. 143-161 - Dahun Kim, Anelia Angelova, Weicheng Kuo:
Region-Centric Image-Language Pretraining for Open-Vocabulary Detection. 162-179 - Fitim Abdullahu, Helmut Grabner:
Commonly Interesting Images. 180-198 - Federico Cocchi, Marcella Cornia, Lorenzo Baraldi, Alessandro Nicolosi, Rita Cucchiara:
Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities. 199-216 - Samia Shafique, Shu Kong, Charless C. Fowlkes:
CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching. 217-235 - Connor Lee, Matthew Anderson, Nikhil Ranganathan, Xingxing Zuo, Kevin Do, Georgia Gkioxari, Soon-Jo Chung:
Caltech Aerial RGB-Thermal Dataset in the Wild. 236-256 - Benjamin Biggs, Arjun Seshadri, Yang Zou, Achin Jain, Aditya Golatkar, Yusheng Xie, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto:
Diffusion Soup: Model Merging for Text-to-Image Diffusion Models. 257-274 - Gopal Sharma, Daniel Rebain, Kwang Moo Yi, Andrea Tagliasacchi:
Volumetric Rendering with Baked Quadrature Fields. 275-292 - Parth Parag Kulkarni, Gaurav Kumar Nayak, Mubarak Shah:
CityGuessr: City-Level Video Geo-Localization on a Global Scale. 293-311 - Changrui Chen, Kurt Debattista, Jungong Han:
Pseudo-labelling Should Be Aware of Disguising Channel Activations. 312-328 - Zhi Qin Tan, Olga Isupova, Gustavo Carneiro, Xiatian Zhu, Yunpeng Li:
Bayesian Detector Combination for Object Detection with Crowdsourced Annotations. 329-346 - Samuel Rota Bulò, Lorenzo Porzi, Peter Kontschieder:
Revising Densification in Gaussian Splatting. 347-362 - Gwanhyeong Koo, Sunjae Yoon, Ji Woo Hong, Chang D. Yoo:
FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-rigid Editing. 363-379 - Alexander Rich, Noah Stier, Pradeep Sen, Tobias Höllerer:
Smoothness, Synthesis, and Sampling: Re-thinking Unsupervised Multi-view Stereo with DIV Loss. 380-397 - Yijun Qian, Jack Urbanek, Alex Hauptmann, Jungdam Won:
Text Motion Translator: A Bi-directional Model for Enhanced 3D Human Motion Generation from Open-Vocabulary Descriptions. 398-414 - Jinho Park, Se Young Chun, Mingoo Seok:
UL-VIO: Ultra-Lightweight Visual-Inertial Odometry with Noise Robust Test-Time Adaptation. 415-432 - Jason J. Yu, Tristan Aumentado-Armstrong, Fereshteh Forghani, Konstantinos G. Derpanis, Marcus A. Brubaker:
PolyOculus: Simultaneous Multi-view Image-Based Novel View Synthesis. 433-451 - Qirui Wu, Sonia Raychaudhuri, Daniel Ritchie, Manolis Savva, Angel X. Chang:
R3DS: Reality-Linked 3D Scenes for Panoramic Scene Understanding. 452-468 - Or Hirschorn, Shai Avidan:
A Graph-Based Approach for Category-Agnostic Pose Estimation. 469-485
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.