default search action
18th ECCV 2024: Milan, Italy - Part XXXIII
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XXXIII. Lecture Notes in Computer Science 15091, Springer 2025, ISBN 978-3-031-73413-7 - Jingyang Xiang, Zuohui Chen, Siqi Li, Qing Wu, Yong Liu:
OvSW: Overcoming Silent Weights for Accurate Binary Neural Networks. 1-18 - Guillaume Jaume, Anurag Vaidya, Andrew Zhang, Andrew H. Song, Richard J. Chen, Sharifa Sahai, Dandan Mo, Emilio Madrigal, Long Phi Le, Faisal Mahmood:
Multistain Pretraining for Slide Representation Learning in Pathology. 19-37 - Qing Jiang, Feng Li, Zhaoyang Zeng, Tianhe Ren, Shilong Liu, Lei Zhang:
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy. 38-57 - Yaomin Huang, Zaomin Yan, Chaomin Shen, Faming Fang, Guixu Zhang:
Harmonizing Knowledge Transfer in Neural Network with Unified Distillation. 58-74 - Shufan Li, Harkanwar Singh, Aditya Grover:
Mamba-ND: Selective State Space Modeling for Multi-dimensional Data. 75-92 - Jie Liu, Haochen Wang, Wenzhe Yin, Jan-Jakob Sonke, Efstratios Gavves:
Click Prompt Learning with Optimal Transport for Interactive Segmentation. 93-110 - Kaili Zheng, Feixiang Lu, Yihao Lv, Liangjun Zhang, Chenyi Guo, Ji Wu:
3D Human Pose Estimation via Non-causal Retentive Networks. 111-128 - Dongkwon Jin, Chang-Su Kim:
OMR: Occlusion-Aware Memory-Based Refinement for Video Lane Detection. 129-145 - Sungho Chun, Ju Yong Chang:
6DoF Head Pose Estimation Through Explicit Bidirectional Interaction with Face Geometry. 146-163 - Zongliang Wu, Ruiying Lu, Ying Fu, Xin Yuan:
Latent Diffusion Prior Enhanced Deep Unfolding for Snapshot Spectral Compressive Imaging. 164-181 - Masashi Hatano, Ryo Hachiuma, Ryo Fujii, Hideo Saito:
Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition. 182-199 - Zhongxi Chen, Shen Chen, Taiping Yao, Ke Sun, Shouhong Ding, Xianming Lin, Liujuan Cao, Rongrong Ji:
Enhancing Tampered Text Detection Through Frequency Feature Fusion and Decomposition. 200-217 - Zhaomin Chen, Quan Cui, Ruoxi Deng, Jie Hu, Guodao Zhang:
Modeling Label Correlations with Latent Context for Multi-label Recognition. 218-234 - Yulin Luo, Ruichuan An, Bocheng Zou, Yiming Tang, Jiaming Liu, Shanghang Zhang:
LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model. 235-252 - Minzhou Pan, Zhenting Wang, Xin Dong, Vikash Sehwag, Lingjuan Lyu, Xue Lin:
Finding Needles in a Haystack: A Black-Box Approach to Invisible Watermark Detection. 253-270 - Yuxin Yao, Siyu Ren, Junhui Hou, Zhi Deng, Juyong Zhang, Wenping Wang:
DynoSurf: Neural Deformation-Based Temporally Consistent Dynamic Surface Reconstruction. 271-288 - Yihong Sun, Bharath Hariharan:
MOD-UV: Learning Mobile Object Detectors from Unlabeled Videos. 289-307 - Mohammad Saeed Ebrahimi Saadabadi, Sahar Rahimi Malakshan, Ali Dabouei, Nasser M. Nasrabadi:
ARoFace: Alignment Robustness to Improve Low-Quality Face Recognition. 308-327 - Chieh Liu, Yu-Min Chu, Ting-I Hsieh, Hwann-Tzong Chen, Tyng-Luh Liu:
Learning Diffusion Models for Multi-view Anomaly Detection. 328-345 - Zhihang Zhong, Gurunandan Krishnan, Xiao Sun, Yu Qiao, Sizhuo Ma, Jian Wang:
Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame Interpolation. 346-363 - Huiqun Wang, Yiping Bao, Panwang Pan, Zeming Li, Xiao Liu, Ruijie Yang, Di Huang:
Multi-modal Relation Distillation for Unified 3D Representation Learning. 364-381 - Renjie Pi, Tianyang Han, Wei Xiong, Jipeng Zhang, Runtao Liu, Rui Pan, Tong Zhang:
Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization. 382-398 - Siyu Jiao, Hongguang Zhu, Jiannan Huang, Yao Zhao, Yunchao Wei, Humphrey Shi:
Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation. 399-416 - Dekun Lin, Tailai Peng, Rui Chen, Xinran Xie, Xiaolin Qin, Zhe Cui:
Distributionally Robust Loss for Long-Tailed Multi-label Image Classification. 417-433 - Shuzhao Xie, Weixiang Zhang, Chen Tang, Yunpeng Bai, Rongwei Lu, Shijia Ge, Zhi Wang:
MesonGS: Post-training Compression of 3D Gaussians via Efficient Attribute Transformation. 434-452 - Yuetian Weng, Mingfei Han, Haoyu He, Xiaojun Chang, Bohan Zhuang:
LongVLM: Efficient Long Video Understanding via Large Language Models. 453-470 - Weiyun Wang, Yiming Ren, Haowen Luo, Tiantong Li, Chenxiang Yan, Zhe Chen, Wenhai Wang, Qingyun Li, Lewei Lu, Xizhou Zhu, Yu Qiao, Jifeng Dai:
The All-Seeing Project V2: Towards General Relation Comprehension of the Open World. 471-490
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.