"Temporal-enhanced Cross-modality Fusion Network for Video Sentence Grounding."

Zezhong Lv, Bing Su (2023)

Details and statistics

DOI: 10.1109/ICME55011.2023.00257

access: closed

type: Conference or Workshop Paper

metadata version: 2023-09-30