"Cross-modal Token Selection for Video Understanding."

Liyong Pan et al. (2022)

Details and statistics

DOI: 10.1145/3552458.3556444

access: closed

type: Conference or Workshop Paper

metadata version: 2024-07-22