"CLIP-SP: Vision-language model with adaptive prompting for scene parsing."

Jiaao Li et al. (2024)

Details and statistics

DOI: 10.1007/S41095-024-0430-4

access: open

type: Journal Article

metadata version: 2024-09-28