default search action
ACCV 2024: Hanoi, Vietnam - Part VI
- Minsu Cho, Ivan Laptev, Du Tran, Angela Yao, Hongbin Zha:
Computer Vision - ACCV 2024 - 17th Asian Conference on Computer Vision, Hanoi, Vietnam, December 8-12, 2024, Proceedings, Part VI. Lecture Notes in Computer Science 15477, Springer 2025, ISBN 978-981-96-0959-8
Applications of Computer Vision
- Lingya Li, Zhixing Hou, Ming Ma, Jing Xiang, Chuangxin Yuan, Guihua Xia:
Spotlight on Small-Scale Ship Detection: Empowering YOLO with Advanced Techniques and a Novel Dataset. 3-17 - Minse Ha, Wan-Gi Bae, Geunyoung Bae, Jong Taek Lee:
ELLAR: An Action Recognition Dataset for Extremely Low-Light Conditions with Dual Gamma Adaptive Modulation. 18-35 - Qi Chen, Yutong Xie, Biao Wu, Xiaomin Chen, James Ang, Minh-Son To, Xiaojun Chang, Qi Wu:
Act Like a Radiologist: Radiology Report Generation Across Anatomical Regions. 36-52 - Jiahao Ma, Zicheng Duan, Liang Zheng, Chuong Nguyen:
Multiview Detection with Cardboard Human Modeling. 53-70 - Trong-Thang Pham, Ngoc-Vuong Ho, Nhat-Tan Bui, Thinh Phan, Patel Brijesh Patel, Donald A. Adjeroh, Gianfranco Doretto, Anh Nguyen, Carol C. Wu, Hien Nguyen, Ngan Le:
FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Generation. 71-88 - Shuhong Chen, Matthias Zwicker:
Match-Free Inbetweening Assistant (MIBA): A Practical Animation Tool Without User Stroke Correspondence. 89-103 - Chao Huang, Susan Liang, Yapeng Tian, Anurag Kumar, Chenliang Xu:
High-Quality Visually-Guided Sound Separation from Diverse Categories. 104-122 - Susan Liang, Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu:
Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation. 123-139 - Dongliang Zhang, Yunfei Li, Jiaran Zhou, Yuezun Li:
DPL: Cross-Quality DeepFake Detection via Dual Progressive Learning. 140-156 - Le Wang, Shigang Li:
Learning Neural Radiance Field from Quasi-uniformly Sampled Spherical Image for Immersive Virtual Reality. 157-171 - Devank, Jayateja Kalla, Soma Biswas:
CoVLM: Leveraging Consensus from Vision-Language Models for Semi-supervised Multi-modal Fake News Detection. 172-189 - Bach-Hoang Ngo, Si-Tri Ngo, Phu-Duc Le, Quang-Minh Phan, Minh-Triet Tran, Trung-Nghia Le:
CrossPAR: Enhancing Pedestrian Attribute Recognition with Vision-Language Fusion and Human-Centric Pre-training. 190-205 - Qianqian Zhang, Linwei Qiu, Li Zhou, Junshe An:
ESM-YOLO: Enhanced Small Target Detection Based on Visible and Infrared Multi-modal Fusion. 206-221 - Zhanyi Lu, Yue Zhou, Ao Chen:
Enhancing Photo Animation: Augmented Stylistic Modules and Prior Knowledge Integration. 222-238 - Cairong Yan, Meng Ma, Yanting Zhang, Yongquan Wan:
Dual-Path Multimodal Optimal Transport for Composed Image Retrieval. 239-254 - Alvaro Budria, Adrián López Rodríguez, Òscar Lorente, Francesc Moreno-Noguer:
InstantGeoAvatar: Effective Geometry and Appearance Modeling of Animatable Avatars from Monocular Video. 255-277 - Jaekyeong Lee, Geonung Kim, Sunghyun Cho:
RNA: Video Editing with ROI-Based Neural Atlas. 278-293 - Hongda Liu, Longguang Wang, Weijun Guan, Ye Zhang, Yulan Guo:
Pluggable Style Representation Learning for Multi-style Transfer. 294-312 - Bingzhi Duan, Xiaoyue Wan, Xu Zhao:
FSGait: Fine-Grained Self-supervised Gait Abnormality Detection. 313-329 - Chiheng Zhou, Yongxia Zhou, Chen Pan:
FocusNet: Cascaded Lightweight Networks and Ascending Feature Enhancement for Efficient Salient Object Detection. 330-345 - Jingchong Weng, Boyang Li, Kai Huang:
Event-Based Image Enhancement Under High Dynamic Range Scenarios. 346-360 - Chi Dai Tran, Long Hoang Pham, Duong Nguyen-Ngoc Tran, Quoc Pham-Nam Ho, Jae Wook Jeon:
Dual Memory Networks Guided Reverse Distillation for Unsupervised Anomaly Detection. 361-378 - Wenbin Tian, Qingmiao Jiang, Lu Chen, Haolin Li, Jinyao Yan:
Enhanced Asymmetric Invertible Network for Neural Video Delivery. 379-394 - Tharsan Senthivel, Ngoc-Son Vu:
QR-DETR: Query Routing for Detection Transformer. 395-412 - Tsung-Han Chou, Brian Wang, Wei-Chen Chiu, Jun-Cheng Chen:
A Recipe for CAC: Mosaic-Based Generalized Loss for Improved Class-Agnostic Counting. 413-428 - Xu Guo, Yujin Zheng, Dingwen Wang:
PMTrack: Multi-object Tracking with Motion-Aware. 429-444 - Mohammadreza Salehi, Nikolaos Apostolikas, Efstratios Gavves, Cees G. M. Snoek, Yuki M. Asano:
Redefining Normal: A Novel Object-Level Approach for Multi-object Novelty Detection. 445-461 - Jiwon Kim, Byeongho Heo, Sangdoo Yun, Seungryong Kim, Dongyoon Han:
Match Me If You Can: Semi-supervised Semantic Correspondence Learning with Unpaired Images. 462-479 - Marina Khoroshiltseva, Luca Palmieri, Sinem Aslan, Sebastiano Vascon, Marcello Pelillo:
Nash Meets Wertheimer: Using Good Continuation in Jigsaw Puzzles. 480-495
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.