- Yiqiao Tan, Haizhong Liu:
How does a kernel based on gradients of infinite-width neural networks come to be widely used: a review of the neural tangent kernel. Int. J. Multim. Inf. Retr. 13(1): 8 (2024) - Qingsong Tang, Yingli Chen, Minghui Zhao, Shitong Min, Wuming Jiang:
DAABNet: depth-wise asymmetric attention bottleneck for real-time semantic segmentation. Int. J. Multim. Inf. Retr. 13(1): 12 (2024) - Neelu Verma, Anik De, Anand Mishra:
Bridging language to visuals: towards natural language query-to-chart image retrieval. Int. J. Multim. Inf. Retr. 13(3): 32 (2024) - Zhiwen Wang, Donglin Zhang, Zhikai Hu:
LSECA: local semantic enhancement and cross aggregation for video-text retrieval. Int. J. Multim. Inf. Retr. 13(3): 30 (2024) - Rui Wang, Jiawei Zhu, Shoujin Wang, Tao Wang, Jingze Huang, Xianxun Zhu:
Multi-modal emotion recognition using tensor decomposition fusion and self-supervised multi-tasking. Int. J. Multim. Inf. Retr. 13(4): 39 (2024) - Yuxin Wei, Ligang Zheng, Guoping Qiu, Guocan Cai:
Cross-modal retrieval based on shared proxies. Int. J. Multim. Inf. Retr. 13(1): 5 (2024) - Ashima Yadav, Anika Gupta:
An emotion-driven, transformer-based network for multimodal fake news detection. Int. J. Multim. Inf. Retr. 13(1): 7 (2024) - Xiang Yuan, Shihao Shan, Yuwen Huo, Junkai Jiang, Song Wu:
Text-assisted attention-based cross-modal hashing. Int. J. Multim. Inf. Retr. 13(1): 3 (2024) - Zhongyi Zhai, Jie Liang, Bo Cheng, Lingzhong Zhao, Junyan Qian:
Strengthening attention: knowledge distillation via cross-layer feature fusion for image classification. Int. J. Multim. Inf. Retr. 13(2): 23 (2024) - Huaying Zhang, Rintaro Yanagi, Ren Togo, Takahiro Ogawa, Miki Haseyama:
Parameter-efficient tuning of cross-modal retrieval for a specific database via trainable textual and visual prompts. Int. J. Multim. Inf. Retr. 13(1): 14 (2024) - Peng Zhao, Qiangchang Wang, Yilong Yin:
DSPformer: discovering semantic parts with token growth and clustering for zero-shot learning. Int. J. Multim. Inf. Retr. 13(3): 27 (2024) - Shuren Zhou, Zhixiong Li, Jie Liu, Jiarui Zhou, Jianming Zhang:
Progressive spatial-temporal transfer model for unsupervised person re-identification. Int. J. Multim. Inf. Retr. 13(2): 17 (2024) - 2023
- Ahmad AlZu'bi, Lojin Bani Younis, Alia Madain:
An interactive attribute-preserving fashion recommendation with 3D image-based virtual try-on. Int. J. Multim. Inf. Retr. 12(2): 24 (2023) - María Alfaro-Contreras, José M. Iñesta, Jorge Calvo-Zaragoza:
Optical music recognition for homophonic scores with neural networks and synthetic music generation. Int. J. Multim. Inf. Retr. 12(1): 12 (2023) - Mohd. Aquib Ansari, Dushyant Kumar Singh, Vibhav Prakash Singh:
Detecting abnormal behavior in megastore for crime prevention using a deep neural architecture. Int. J. Multim. Inf. Retr. 12(2): 25 (2023) - Shehu Ayuba, Wan Mohd Nazmee Wan Zainon:
Medical image watermarking: a survey on applications, approach and performance requirement compliance. Int. J. Multim. Inf. Retr. 12(2): 33 (2023) - Yuchun Fang, Liangjun Wang, Shiquan Lin, Lan Ni:
Visual feature segmentation with reinforcement learning for continuous sign language recognition. Int. J. Multim. Inf. Retr. 12(2): 39 (2023) - Fatma Gouizi, Ahmed Chaouki Megherbi:
Nested-Net: a deep nested network for background subtraction. Int. J. Multim. Inf. Retr. 12(1): 5 (2023) - Kai He, Nan Pu, Mingrui Lao, Michael S. Lew:
Few-shot and meta-learning methods for image understanding: a survey. Int. J. Multim. Inf. Retr. 12(2): 14 (2023) - Sk Maidul Islam, Subhankar Joardar, Arif Ahmed Sekh:
Ornament image retrieval using few-shot learning. Int. J. Multim. Inf. Retr. 12(2): 30 (2023) - Zetao Jiang, Xiuxian Wang, Zhongyi Zhai, Bo Cheng:
LG-MLFormer: local and global MLP for image captioning. Int. J. Multim. Inf. Retr. 12(1): 4 (2023) - Faten Khemakhem, Hela Ltifi:
Neural style transfer generative adversarial network (NST-GAN) for facial expression recognition. Int. J. Multim. Inf. Retr. 12(2): 26 (2023) - Jeong-Hun Kim, Fei Hao, Carson Kai-Sang Leung, Aziz Nasridinov:
Cluster-guided temporal modeling for action recognition. Int. J. Multim. Inf. Retr. 12(2): 15 (2023) - Qiuyu Kong, Jie Jiang, Junyan Yang, Qi Wang:
Hierarchical bidirectional aggregation with prior guided transformer for few-shot segmentation. Int. J. Multim. Inf. Retr. 12(2): 17 (2023) - Christos Koutlis, Manos Schinas, Symeon Papadopoulos:
MemeTector: enforcing deep focus for meme detection. Int. J. Multim. Inf. Retr. 12(1): 11 (2023) - Mingyong Li, Yewen Li, Mingyuan Ge, Longfei Ma:
CLIP-based fusion-modal reconstructing hashing for large-scale unsupervised cross-modal retrieval. Int. J. Multim. Inf. Retr. 12(1): 2 (2023) - Ruochen Li, Nannan Li, Wenmin Wang:
Maximizing mutual information inside intra- and inter-modality for audio-visual event retrieval. Int. J. Multim. Inf. Retr. 12(1): 10 (2023) - Yu Liu, Yanming Guo, Yusuke Matsui:
Special Issue on Open-Domain Image Retrieval in the Wild. Int. J. Multim. Inf. Retr. 12(2): 36 (2023) - Mingyue Liu, Honggang Zhao, Longfei Ma, Mingyong Li:
Modal interaction-enhanced prompt learning by transformer decoder for vision-language models. Int. J. Multim. Inf. Retr. 12(2): 19 (2023) - Shilpa Mahajan, Rajneesh Rani, Karan Trehan:
DELIGHT-Net: DEep and LIGHTweight network to segment Indian text at word level from wild scenic images. Int. J. Multim. Inf. Retr. 12(2): 29 (2023)