


default search action
ACM Transactions on Multimedia Computing, Communications, and Applications, Volume 20
Volume 20, Number 1, January 2024
- Zhenbo Xu
, Hai-Miao Hu
, Liu Liu
, Dongping Zhang, Shifeng Zhang, Wenming Tan
:
Instance-Based Continual Learning: A Real-World Dataset and Baseline for Fresh Recognition. 1:1-1:23 - Xiaoping Liang
, Zhenjun Tang
, Zhixin Li
, Mengzhu Yu
, Hanyun Zhang
, Xianquan Zhang
:
Robust Hashing via Global and Local Invariant Features for Image Copy Detection. 2:1-2:22 - Sandipan Sarma
, Arijit Sur
:
DiRaC-I: Identifying Diverse and Rare Training Classes for Zero-Shot Learning. 3:1-3:23 - Chengyu Zheng
, Ning Song
, Ruoyu Zhang
, Lei Huang
, Zhiqiang Wei
, Jie Nie
:
Scale-Semantic Joint Decoupling Network for Image-Text Retrieval in Remote Sensing. 4:1-4:20 - Jiankai Li
, Yunhong Wang
, Weixin Li
:
Zero-shot Scene Graph Generation via Triplet Calibration and Reduction. 5:1-5:21 - Abid Yaqoob
, Gabriel-Miro Muntean
:
Advanced Predictive Tile Selection Using Dynamic Tiling for Prioritized 360° Video VR Streaming. 6:1-6:28 - Jia Wang
, Hong-Han Shuai
, Yung-Hui Li
, Wen-Huang Cheng
:
Language-guided Residual Graph Attention Network and Data Augmentation for Visual Grounding. 7:1-7:23 - Haoran Wang
, Yajie Wang
, Baosheng Yu
, Yibing Zhan
, Chunfeng Yuan
, Wankou Yang
:
Attentional Composition Networks for Long-Tailed Human Action Recognition. 8:1-8:18 - Zi-Chao Zhang
, Zhen-Duo Chen
, Zhen-Yu Xie
, Xin Luo
, Xin-Shun Xu
:
S3Mix: Same Category Same Semantics Mixing for Augmenting Fine-grained Images. 9:1-9:16 - Mingkui Tan
, Zhiquan Wen
, Leyuan Fang
, Qi Wu
:
Transformer-Based Relational Inference Network for Complex Visual Relational Reasoning. 10:1-10:23 - Yiming Yang
, Weipeng Hu
, Haifeng Hu
:
Syncretic Space Learning Network for NIR-VIS Face Recognition. 11:1-11:25 - Chenghua Li
, Zongze Li
, Jing Sun
, Yun Zhang
, Xiaoping Jiang
, Fan Zhang
:
Dynamic Weighted Gradient Reversal Network for Visible-infrared Person Re-identification. 12:1-12:23 - Jiajun Song
, Zhuo Li
, Weiqing Min
, Shuqiang Jiang
:
Towards Food Image Retrieval via Generalization-Oriented Sampling and Loss Function Design. 13:1-13:19 - Yiting Jin
, Jie Wu
, Wanliang Wang
, Yidong Yan
, Jiawei Jiang
, Jianwei Zheng
:
Cascading Blend Network for Image Inpainting. 14:1-14:21 - Kehua Guo
, Liang Chen
, Xiangyuan Zhu
, Xiaoyan Kui
, Jian Zhang
, Heyuan Shi
:
Double-Layer Search and Adaptive Pooling Fusion for Reference-Based Image Super-Resolution. 15:1-15:23 - Jing Zhao
, Bin Li
, Jiahao Li
, Ruiqin Xiong
, Yan Lu
:
A Universal Optimization Framework for Learning-based Image Codec. 16:1-16:19 - Liping Zhang
, Shukai Chen
, Fei Lin
, Wei Ren
, Kim-Kwang Raymond Choo
, Geyong Min
:
1DIEN: Cross-session Electrocardiogram Authentication Using 1D Integrated EfficientNet. 17:1-17:17 - Baian Chen
, Zhilei Chen
, Xiaowei Hu
, Jun Xu
, Haoran Xie
, Jing Qin
, Mingqiang Wei
:
Dynamic Message Propagation Network for RGB-D and Video Salient Object Detection. 18:1-18:21 - Xiang Gao
, Wei Hu
, Guo-Jun Qi
:
Self-supervised Multi-view Learning via Auto-encoding 3D Transformations. 19:1-19:23 - Dewang Wang
, Gaobo Yang
, Zhiqing Guo
, Jiyou Chen
:
Enhancing Adversarial Embedding based Image Steganography via Clustering Modification Directions. 20:1-20:20 - Xiaojia Zhao
, Tingting Xu
, Qiangqiang Shen
, Youfa Liu
, Yongyong Chen
, Jingyong Su
:
Double High-Order Correlation Preserved Robust Multi-View Ensemble Clustering. 21:1-21:21 - Shuji Tasaka
:
Usefulness of QoS in Multidimensional QoE Prediction for Haptic-Audiovisual Communications. 22:1-22:24 - Ching-Nung Yang
, Xiaotian Wu
, Min-Jung Chung
:
Enhancement of Information Carrying and Decoding for Visual Cryptography with Error Correction. 23:1-23:24 - Yuqing Zhang
, Yong Zhang
, Shaofan Wang
, Yun Liang
, Baocai Yin
:
Semi-supervised Video Object Segmentation Via an Edge Attention Gated Graph Convolutional Network. 24:1-24:23 - Wenying Wen
, Minghui Huang
, Yushu Zhang
, Yuming Fang
, Yifan Zuo
:
Visual Security Index Combining CNN and Filter for Perceptually Encrypted Light Field Images. 25:1-25:15 - Linlin Liu
, Haijun Zhang
, Qun Li
, Jianghong Ma
, Zhao Zhang
:
Collocated Clothing Synthesis with GANs Aided by Textual Information: A Multi-Modal Framework. 26:1-26:25 - Xulei Lou
, Tinghui Wu
, Haifeng Hu
, Dihu Chen
:
Self-Supervised Consistency Based on Joint Learning for Unsupervised Person Re-identification. 27:1-27:20 - Yichi Zhang
, Gongchun Ding
, Dandan Ding
, Zhan Ma
, Zhu Li
:
On Content-Aware Post-Processing: Adapting Statistically Learned Models to Dynamic Content. 28:1-28:23 - Jing Xu
, Bing Liu
, Yong Zhou
, Mingming Liu
, Rui Yao
, Zhiwen Shao
:
Diverse Image Captioning via Conditional Variational Autoencoder and Dual Contrastive Learning. 29:1-29:16 - Cong Zou
, Rui Wang
, Cheng Jin
, Sanyi Zhang
, Xin Wang
:
S2CL-Leaf Net: Recognizing Leaf Images Like Human Botanists. 30:1-30:20
Volume 20, Number 2, February 2024
- Suyel Namasudra
, Pascal Lorenz
, Seifedine Kadry
, Syed Ahmad Chan Bukhari
:
Introduction to the Special Issue on DNA-centric Modeling and Practice for Next-generation Computing and Communication Systems. 31:1-31:2
- Shaohua Wan
, Yi Jin
, Guangdong Xu
, Michele Nappi
:
Editorial to Special Issue on Multimedia Cognitive Computing for Intelligent Transportation System. 32:1-32:2 - Ruonan Zhao
, Laurence T. Yang
, Debin Liu
, Wanli Lu
, Chenlu Zhu
, Yiheng Ruan
:
Tensor-Empowered LSTM for Communication-Efficient and Privacy-Enhanced Cognitive Federated Learning in Intelligent Transportation Systems. 33:1-33:21 - Hongjian Shi
, Hao Wang
, Ruhui Ma
, Yang Hua
, Tao Song
, Honghao Gao
, Haibing Guan
:
Robust Searching-Based Gradient Collaborative Management in Intelligent Transportation System. 34:1-34:23 - Zejia Weng
, Zuxuan Wu
, Hengduo Li
, Jingjing Chen
, Yu-Gang Jiang
:
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition. 35:1-35:18 - Shixiong Zhang
, Wenmin Wang
, Honglei Li
, Shenyong Zhang
:
E-detector: Asynchronous Spatio-temporal for Event-based Object Detection in Intelligent Transportation System. 36:1-36:20 - Ram Prasad Padhy
, Pankaj Kumar Sa
, Fabio Narducci
, Carmen Bisogni
, Sambit Bakshi
:
Monocular Vision-aided Depth Measurement from RGB Images for Autonomous UAV Navigation. 37:1-37:22
- Zhihan Lv
, Fabio Poiesi
, Qi Dong
, Jaime Lloret
, Houbing Song
:
Special Issue on Deep Learning for Intelligent Human Computer Interaction. 38:1-38:5 - Wenjuan Gong
, Yue Zhang
, Wei Wang
, Peng Cheng
, Jordi Gonzàlez
:
Meta-MMFNet: Meta-learning-based Multi-model Fusion Network for Micro-expression Recognition. 39:1-39:20 - Youcef Djenouri, Asma Belhadi, Gautam Srivastava, Jerry Chun-Wei Lin
:
An Efficient and Accurate GPU-based Deep Learning Model for Multimedia Recommendation. 40:1-40:18 - Loveleen Gaur
, Mohan Bhandari
, Bhadwal Singh Shikhar, NZ Jhanjhi
, Mohammad Shorfuzzaman, Mehedi Masud
:
Explanation-Driven HCI Model to Examine the Mini-Mental State for Alzheimer's Disease. 41:1-41:16 - Mi Li
, Wei Zhang
, Bin Hu
, Jiaming Kang
, Yuqi Wang
, Shengfu Lu
:
Automatic Assessment of Depression and Anxiety through Encoding Pupil-wave from HCI in VR Scenes. 42:1-42:22 - Abdul Qayyum
, Imran Razzak
, Muhammad Tanveer
, Moona Mazher
:
Spontaneous Facial Behavior Analysis Using Deep Transformer-based Framework for Child-computer Interaction. 43:1-43:17 - Xiaowei Chen
, Xiao Jiang
, Lishuang Zhan
, Shihui Guo
, Qunsheng Ruan
, Guoliang Luo
, Minghong Liao
, Yipeng Qin
:
Full-body Human Motion Reconstruction with Sparse Joint Tracking Using Flexible Sensors. 44:1-44:19 - Shanbao Qiao
, Neal N. Xiong
, Yongbin Gao
, Zhijun Fang
, Wenjun Yu
, Juan Zhang
, Xiaoyan Jiang
:
Self-Supervised Learning of Depth and Ego-Motion for 3D Perception in Human Computer Interaction. 45:1-45:21 - Yan Kang
, Bin Pu
, Yongqi Kou
, Yun Yang
, Jianguo Chen
, Khan Muhammad
, Po Yang
, Lida Xu
, Mohammad Hijji
:
A Deep Graph Network with Multiple Similarity for User Clustering in Human-Computer Interaction. 46:1-46:20 - Bahar Uddin Mahmud
, Guan Y. Hong
, Bernard Fong
:
A Study of Human-AI Symbiosis for Creative Work: Recent Developments and Future Directions in Deep Learning. 47:1-47:21 - Xiaoling Gu, Jie Huang
, Yongkang Wong
, Jun Yu
, Jianping Fan
, Pai Peng
, Mohan S. Kankanhalli
:
PAINT: Photo-realistic Fashion Design Synthesis. 48:1-48:23 - Qingfeng Dai
, Yongkang Wong
, Guofei Sun
, Yanwei Wang
, Zhou Zhou
, Mohan S. Kankanhalli
, Xiangdong Li
, Weidong Geng
:
Unsupervised Domain Adaptation by Causal Learning for Biometric Signal-based HCI. 49:1-49:18 - Yi Xiao
, Tong Liu
, Yu Han
, Yue Liu
, Yongtian Wang
:
Realtime Recognition of Dynamic Hand Gestures in Practical Applications. 50:1-50:17 - Jianping Gou
, Liyuan Sun
, Baosheng Yu
, Shaohua Wan
, Dacheng Tao
:
Hierarchical Multi-Attention Transfer for Knowledge Distillation. 51:1-51:20
- Subhrajyoti Deb
, Abhilash Kumar Das
, Nirmalya Kar
:
An Applied Image Cryptosystem on Moore's Automaton Operating on δ (qk)/𝔽2. 52:1-52:20 - Sisi You
, Yukun Zuo
, Hantao Yao
, Changsheng Xu
:
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene. 53:1-53:19 - Shiqi Sun
, Danlan Huang
, Xiaoming Tao
, Chengkang Pan
, Guangyi Liu
, Changwen Chen
:
Boosting Scene Graph Generation with Contextual Information. 54:1-54:24 - Jianwei Zheng
, Yu Liu
, Yuchao Feng
, Honghui Xu
, Meiyu Zhang
:
Contrastive Attention-guided Multi-level Feature Registration for Reference-based Super-resolution. 55:1-55:21 - Shangxi Wu
, Jitao Sang
, Kaiyan Xu
, Guanhua Zheng
, Changsheng Xu
:
Adaptive Adversarial Logits Pairing. 56:1-56:16 - Ying Chen
, Rui Yao
, Yong Zhou
, Jiaqi Zhao
, Bing Liu
, Abdulmotaleb El-Saddik
:
Black-box Attack against Self-supervised Video Object Segmentation Models with Contrastive Loss. 57:1-57:21 - Shuang Liang
, Wentao Ma
, Chi Xie
:
Relation with Free Objects for Action Recognition. 58:1-58:19 - Qiaolin He
, Zhijie Zheng
, Haifeng Hu
:
A Feature Map is Worth a Video Frame: Rethinking Convolutional Features for Visible-Infrared Person Re-identification. 59:1-59:20 - Wuliang Huang
, Yiqiang Chen
, Xinlong Jiang
, Teng Zhang
, Qian Chen
:
GJFusion: A Channel-Level Correlation Construction Method for Multimodal Physiological Signal Fusion. 60:1-60:23
Volume 20, Number 3, March 2024
- Chengji Shen
, Zhenjiang Liu
, Xin Gao
, Zunlei Feng
, Mingli Song
:
Self-Adaptive Clothing Mapping Based Virtual Try-on. 61:1-61:26 - Alberto Baldrati
, Marco Bertini
, Tiberio Uricchio
, Alberto Del Bimbo
:
Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features. 62:1-62:24 - Yan Wang
, Peize Li
, Qingyi Si
, Hanwen Zhang
, Wenyu Zang
, Zheng Lin
, Peng Fu
:
Cross-modality Multiple Relations Learning for Knowledge-based Visual Question Answering. 63:1-63:22 - Qiang Guo
, Zhi Zhang
, Mingliang Zhou
, Hong Yue
, Huayan Pu
, Jun Luo
:
Image Defogging Based on Regional Gradient Constrained Prior. 64:1-64:17 - Jintao Guo
, Lei Qi
, Yinghuan Shi
, Yang Gao
:
PLACE Dropout: A Progressive Layer-wise and Channel-wise Dropout for Domain Generalization. 65:1-65:23 - Yuan Xiong
, Jingru Wang
, Zhong Zhou
:
VirtualLoc: Large-scale Visual Localization Using Virtual Images. 66:1-66:19 - Yiheng Zhang
, Ting Yao
, Zhaofan Qiu
, Tao Mei
:
Explaining Cross-domain Recognition with Interpretable Deep Classifier. 67:1-67:21 - Ruimin Wang
, Fasheng Wang
, Yiming Su
, Jing Sun
, Fuming Sun
, Haojie Li
:
Attention-guided Multi-modality Interaction Network for RGB-D Salient Object Detection. 68:1-68:22 - Jemily Rime
, Alan Archer-Boyd
, Tom Collins
:
How Will You Pod? Implications of Creators' Perspectives for Designing Innovative Podcasting Tools. 69:1-69:25 - Ming Cheung
:
Learning from the Past: Fast NAS for Tasks and Datasets. 70:1-70:18 - Xinyue Li
, Haiyong Xu
, Gangyi Jiang
, Mei Yu
, Ting Luo
, Xuebo Zhang
, Hongwei Ying
:
Underwater Image Quality Assessment from Synthetic to Real-world: Dataset and Objective Method. 71:1-71:23 - Sujuan Hou
, Jiacheng Li
, Weiqing Min
, Qiang Hou
, Yanna Zhao
, Yuanjie Zheng
, Shuqiang Jiang
:
Deep Learning for Logo Detection: A Survey. 72:1-72:23 - Yunjie Peng
, Jinlin Wu
, Boqiang Xu
, Chunshui Cao
, Xu Liu
, Zhenan Sun
, Zhiqiang He
:
Deep Learning Based Occluded Person Re-Identification: A Survey. 73:1-73:27 - Muhammad Arslan Manzoor
, Sarah Albarri
, Ziting Xian
, Zaiqiao Meng
, Preslav Nakov
, Shangsong Liang
:
Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications. 74:1-74:34 - Yanyan Shi
, Shaowu Yang
, Wenjing Yang
, Dianxi Shi
, Xuehui Li
:
Boosting Few-shot Object Detection with Discriminative Representation and Class Margin. 75:1-75:19 - Harry Cheng
, Yangyang Guo
, Tianyi Wang
, Qi Li
, Xiaojun Chang
, Liqiang Nie
:
Voice-Face Homogeneity Tells Deepfake. 76:1-76:22 - Jin Ye
, Meng Dan
, Wenchao Jiang
:
A Visual Sensitivity Aware ABR Algorithm for DASH via Deep Reinforcement Learning. 77:1-77:22 - Jian Wang
, Xiao Wang
, Guosheng Zhao
:
Task Recommendation via Heterogeneous Multi-modal Features and Decision Fusion in Mobile Crowdsensing. 78:1-78:20 - Si-chao Lei
, Yue-Jiao Gong
, Xiaolin Xiao
, Yicong Zhou
, Jun Zhang
:
Boosting Diversity in Visual Search with Pareto Non-Dominated Re-Ranking. 79:1-79:23 - Huijie Zhang
, Pu Li
, Xiaobai Liu
, Xianfeng Terry Yang
, Li An
:
An Iterative Semi-supervised Approach with Pixel-wise Contrastive Loss for Road Extraction in Aerial Images. 80:1-80:21 - Jing Fang
, Yinbo Yu
, Zhongyuan Wang
, Xin Ding
, Ruimin Hu
:
An Image Arbitrary-Scale Super-Resolution Network Using Frequency-domain Information. 81:1-81:23 - Xiao Luo
, Wei Ju
, Yiyang Gu
, Yifang Qin
, Siyu Yi
, Daqing Wu
, Luchen Liu
, Ming Zhang
:
Toward Effective Semi-supervised Node Classification with Hybrid Curriculum Pseudo-labeling. 82:1-82:19 - Wen Guo
, Wuzhou Quan
, Junyu Gao
, Tianzhu Zhang
, Changsheng Xu
:
Feature Disentanglement Network: Multi-Object Tracking Needs More Differentiated Features. 83:1-83:22 - Mohammed Khaleel
, Azeez Idris
, Wallapak Tavanapong
, Jacob Pratt
, Jung-Hwan Oh
, Piet C. de Groen
:
VisActive: Visual-concept-based Active Learning for Image Classification under Class Imbalance. 84:1-84:21 - Honghua Chen
, Zhiqi Li
, Mingqiang Wei
, Jun Wang
:
Geometric and Learning-Based Mesh Denoising: A Comprehensive Survey. 85:1-85:28 - Ning Han
, Yawen Zeng
, Chuhao Shi
, Guangyi Xiao
, Hao Chen
, Jingjing Chen
:
BiC-Net: Learning Efficient Spatio-temporal Relation for Text-Video Retrieval. 86:1-86:21 - Yuan Feng
, Yaojun Hu, Pengfei Fang, Sheng Liu, Yanhong Yang, Shengyong Chen:
Asymmetric Dual-Decoder U-Net for Joint Rain and Haze Removal. 87:1-87:23 - Yurui Xie
, Ling Guan
:
Sparsity-guided Discriminative Feature Encoding for Robust Keypoint Detection. 88:1-88:22 - Nicolas Beuve
, Wassim Hamidouche
, Olivier Déforges
:
Hierarchical Learning and Dummy Triplet Loss for Efficient Deepfake Detection. 89:1-89:18 - Suncheng Xiang
, Dahong Qian
, Jingsheng Gao
, Zirui Zhang
, Ting Liu
, Yuzhuo Fu
:
Rethinking Person Re-Identification via Semantic-based Pretraining. 90:1-90:17
Volume 20, Number 4, April 2024
- Min Peng
, Xiaohu Shao
, Yu Shi
, Xiangdong Zhou
:
Hierarchical Synergy-Enhanced Multimodal Relational Network for Video Question Answering. 91:1-91:22 - Bin Ren
, Hao Tang
, Fanyang Meng
, Runwei Ding
, Philip Torr
, Nicu Sebe
:
Cloth Interactive Transformer for Virtual Try-On. 92:1-92:20 - Xiushan Nie
, Yang Shi
, Ziyu Meng
, Jin Huang
, Weili Guan
, Yilong Yin
:
Complex Scenario Image Retrieval via Deep Similarity-aware Hashing. 93:1-93:24 - Jiawei Tan
, Hongxing Wang
, Junsong Yuan
:
Characters Link Shots: Character Attention Network for Movie Scene Segmentation. 94:1-94:23 - Mingliang Zhou
, Xinwen Zhao
, Futing Luo
, Jun Luo
, Huayan Pu
, Tao Xiang
:
Robust RGB-T Tracking via Adaptive Modality Weight Correlation Filters and Cross-modality Learning. 95:1-95:20 - Zicheng Zhang
, Wei Sun
, Yingjie Zhou
, Jun Jia
, Zhichao Zhang
, Jing Liu
, Xiongkuo Min
, Guangtao Zhai
:
Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images. 96:1-96:22 - Shuvendu Roy
, Ali Etemad
:
Contrastive Learning of View-invariant Representations for Facial Expressions Recognition. 97:1-97:22 - Jun Liu
, Jiantao Zhou
, Haiwei Wu
, Weiwei Sun
, Jinyu Tian
:
Generating Robust Adversarial Examples against Online Social Networks (OSNs). 98:1-98:26 - Tao Yao
, Yiru Li
, Ying Li
, Yingying Zhu
, Gang Wang
, Jun Yue
:
Cross-modal Semantically Augmented Network for Image-text Matching. 99:1-99:18 - Ahmed Telili
, Sid Ahmed Fezza
, Wassim Hamidouche
, Hanene Brachemi Meftah
:
2BiVQA: Double Bi-LSTM-based Video Quality Assessment of UGC Videos. 100:1-100:22 - Hongzhou Chen
, Haihan Duan
, Maha Abdallah
, Yufeng Zhu
, Yonggang Wen
, Abdulmotaleb El-Saddik
, Wei Cai
:
Web3 Metaverse: State-of-the-Art and Vision. 101:1-101:42 - Lilong Wang
, Yunhui Shi
, Jin Wang
, Shujun Chen
, Baocai Yin
, Nam Ling
:
Graph Based Cross-Channel Transform for Color Image Compression. 102:1-102:25 - Kai Han
, Yu Liu
, Rukai Wei
, Ke Zhou
, Jinhui Xu
, Kun Long
:
Supervised Hierarchical Online Hashing for Cross-modal Retrieval. 103:1-103:23 - Fengyi Fu
, Shancheng Fang
, Weidong Chen
, Zhendong Mao
:
Sentiment-Oriented Transformer-Based Variational Autoencoder Network for Live Video Commenting. 104:1-104:24 - Yuxiang Peng
, Chong Fu
, Guixing Cao
, Wei Song
, Junxin Chen
, Chiu-Wing Sham
:
JPEG-compatible Joint Image Compression and Encryption Algorithm with File Size Preservation. 105:1-105:20 - Daizong Liu
, Xiaoye Qu
, Jianfeng Dong
, Pan Zhou
, Zichuan Xu
, Haozhao Wang
, Xing Di
, Weining Lu
, Yu Cheng:
Transform-Equivariant Consistency Learning for Temporal Sentence Grounding. 106:1-106:19 - Yijie Hu
, Bin Dong
, Kaizhu Huang
, Lei Ding
, Wei Wang
, Xiaowei Huang
, Qiu-Feng Wang
:
Scene Text Recognition via Dual-path Network with Shape-driven Attention Alignment. 107:1-107:20 - Rongjiao Liang
, Shichao Zhang
, Wenzhen Zhang
, Guixian Zhang
, Jinyun Tang
:
Nonlocal Hybrid Network for Long-tailed Image Classification. 108:1-108:22 - Piao Shi
, Min Hu
, Xuefeng Shi
, Fuji Ren
:
Deep Modular Co-Attention Shifting Network for Multimodal Sentiment Analysis. 109:1-109:23 - Jing Zhang
, Dan Guo
, Xun Yang
, Peipei Song
, Meng Wang
:
Visual-linguistic-stylistic Triple Reward for Cross-lingual Image Captioning. 110:1-110:23 - Zhaoyang Jia
, Yan Lu
, Houqiang Li
:
Exploring Neighbor Correspondence Matching for Multiple-hypotheses Video Frame Synthesis. 111:1-111:20 - Sheng Zhou
, Dan Guo
, Xun Yang
, Jianfeng Dong
, Meng Wang
:
Graph Pooling Inference Network for Text-based VQA. 112:1-112:21 - Hengtong Hu
, Lingxi Xie
, Xinyue Huo
, Richang Hong
, Qi Tian
:
One-Bit Supervision for Image Classification: Problem, Solution, and Beyond. 113:1-113:22 - Hang Yuan
, Wei Gao
, Siwei Ma
, Yiqiang Yan
:
Divide-and-conquer-based RDO-free CU Partitioning for 8K Video Compression. 114:1-114:20 - Mingyu Li
, Tao Zhou
, Zhuo Huang
, Jian Yang
, Jie Yang
, Chen Gong
:
Dynamic Weighted Adversarial Learning for Semi-Supervised Classification under Intersectional Class Mismatch. 115:1-115:24 - Hui Huang
, Di Xiao
, Jia Liang
:
Secure Low-complexity Compressive Sensing with Preconditioning Prior Regularization Reconstruction. 116:1-116:22 - Nathan Clement
, Alan Schoen
, Arnold P. Boedihardjo
, Andrew Jenkins
:
Synthetic Data and Hierarchical Object Detection in Overhead Imagery. 117:1-117:20 - Jiang Bian
, Xuhong Li
, Tao Wang
, Qingzhong Wang
, Jun Huang
, Chen Liu
, Jun Zhao
, Feixiang Lu
, Dejing Dou
, Haoyi Xiong
:
P2ANet: A Large-Scale Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos. 118:1-118:23 - Jifan Yang
, Zhongyuan Wang
, Guangcheng Wang
, Baojin Huang
, Yuhong Yang
, Weiping Tu
:
Auxiliary Information Guided Self-attention for Image Quality Assessment. 119:1-119:23 - Zhanzhou Feng
, Jiaming Xu
, Lei Ma
, Shiliang Zhang
:
Efficient Video Transformers via Spatial-temporal Token Merging for Action Recognition. 120:1-120:21
Volume 20, Number 5, May 2024
- Shupei Zhang
, Chenqiu Zhao
, Anup Basu
:
Principal Component Approximation Network for Image Compression. 121:1-121:20 - Tianyu Zhang
, Weiqing Min
, Tao Liu
, Shuqiang Jiang
, Yong Rui
:
Toward Egocentric Compositional Action Anticipation with Adaptive Semantic Debiasing. 122:1-122:21 - Yu Liu
, Mingbo Zhao
, Zhao Zhang
, Yuping Liu
, Shuicheng Yan
:
Arbitrary Virtual Try-on Network: Characteristics Preservation and Tradeoff between Body and Clothing. 123:1-123:23 - Shih-Wei Yang
, Li-Hsiang Shen
, Hong-Han Shuai
, Kai-Ten Feng
:
CMAF: Cross-Modal Augmentation via Fusion for Underwater Acoustic Image Recognition. 124:1-124:25 - Yazhou Zhang
, Yang Yu
, Mengyao Wang
, Min Huang
, M. Shamim Hossain
:
Self-Adaptive Representation Learning Model for Multi-Modal Sentiment and Sarcasm Joint Analysis. 125:1-125:17 - Lei Qi
, Peng Dong
, Tan Xiong
, Hui Xue
, Xin Geng
:
DoubleAUG: Single-domain Generalized Object Detector in Urban via Color Perturbation and Dual-style Memory. 126:1-126:20 - Dan Shi
, Lei Zhu
, Jingjing Li
, Guohua Dong
, Huaxiang Zhang
:
Incomplete Cross-Modal Retrieval with Deep Correlation Transfer. 127:1-127:21 - Xianhua Zeng
, Xinyu Wang
, Yicai Xie
:
Multiple Pseudo-Siamese Network with Supervised Contrast Learning for Medical Multi-modal Retrieval. 128:1-128:23 - Sisi You
, Hantao Yao
, Bing-Kun Bao
, Changsheng Xu
:
Multi-object Tracking with Spatial-Temporal Tracklet Association. 129:1-129:21 - Gülnaziye Bingöl
, Simone Porcu
, Alessandro Floris
, Luigi Atzori
:
QoE Estimation of WebRTC-based Audio-visual Conversations from Facial and Speech Features. 130:1-130:23 - Heqian Qiu
, Hongliang Li
, Qingbo Wu
, Hengcan Shi
, Lanxiao Wang
, Fanman Meng
, Linfeng Xu
:
Learning Offset Probability Distribution for Accurate Object Detection. 131:1-131:24 - Alessandro Floris
, Simone Porcu
, Luigi Atzori
:
Controlling Media Player with Hands: A Transformer Approach and a Quality of Experience Assessment. 132:1-132:22 - Jingyu Li
, Zhendong Mao
, Hao Li
, Weidong Chen
, Yongdong Zhang
:
Exploring Visual Relationships via Transformer-based Graphs for Enhanced Image Captioning. 133:1-133:23 - Zeyu Ma
, Siwei Wang
, Xiao Luo
, Zhonghui Gu
, Chong Chen
, Jinxing Li
, Xian-Sheng Hua
, Guangming Lu
:
HARR: Learning Discriminative and High-Quality Hash Codes for Image Retrieval. 134:1-134:23 - Chengyang Zhang
, Yong Zhang, Bo Li
, Xinglin Piao, Baocai Yin:
CrowdGraph: Weakly supervised Crowd Counting via Pure Graph Neural Network. 135:1-135:23 - Jie Wang
, Guoqiang Li
, Jie Shi
, Jinwen Xi
:
Weighted Guided Optional Fusion Network for RGB-T Salient Object Detection. 136:1-136:20 - Yibo Zhang, Weiguo Lin
, Junfeng Xu
:
Joint Audio-Visual Attention with Contrastive Learning for More General Deepfake Detection. 137:1-137:23 - Depei Wang
, Ruifeng Xu
, Lianglun Cheng
, Zhuowei Wang
:
Knowledge-integrated Multi-modal Movie Turning Point Identification. 138:1-138:19 - Chunpu Liu
, Guanglei Yang
, Wangmeng Zuo
, Tianyi Zang
:
DPDFormer: A Coarse-to-Fine Model for Monocular Depth Estimation. 139:1-139:21 - Yunyao Yan
, Guoqing Xiang
, Huizhu Jia
, Jie Chen
, Xiaofeng Huang
, Xiaodong Xie
:
Two-Stage Perceptual Quality Oriented Rate Control Algorithm for HEVC. 140:1-140:20 - Zongyi Li
, Yuxuan Shi
, Hefei Ling
, Jiazhong Chen
, Boyuan Liu
, Runsheng Wang
, Chengxin Zhao
:
Viewpoint Disentangling and Generation for Unsupervised Object Re-ID. 141:1-141:23 - Kuai Dai
, Xutao Li
, Huiwei Lin
, Yin Jiang
, Xunlai Chen
, Yunming Ye
, Di Xian
:
TinyPredNet: A Lightweight Framework for Satellite Image Sequence Prediction. 142:1-142:24 - Yingnan Ma
, Chenqiu Zhao
, Bingran Huang
, Xudong Li
, Anup Basu
:
RAST: Restorable Arbitrary Style Transfer. 143:1-143:21 - Wei-Yen Hsu
, Hsien-Wen Lin
:
Context-detail-aware United Network for Single Image Deraining. 144:1-144:18 - Yao Liu
, Gangfeng Cui
, Jiahui Luo
, Xiaojun Chang
, Lina Yao
:
Two-stream Multi-level Dynamic Point Transformer for Two-person Interaction Recognition. 145:1-145:22 - Chengxin Chen
, Pengyuan Zhang
:
Modality-collaborative Transformer with Hybrid Feature Reconstruction for Robust Emotion Recognition. 146:1-146:23 - Jiafeng Huang
, Tianjun Zhang
, Shengjie Zhao
, Lin Zhang
, Yicong Zhou
:
An Underwater Organism Image Dataset and a Lightweight Module Designed for Object Detection Networks. 147:1-147:23 - Jing Liu
, Litao Shang
, Yuting Su
, Weizhi Nie
, Xin Wen
, Anan Liu
:
Privacy-preserving Multi-source Cross-domain Recommendation Based on Knowledge Graph. 148:1-148:18 - Xingyu Liu
, Zhongyun Hua
, Shuang Yi
, Yushu Zhang
, Yicong Zhou
:
Bi-directional Block Encoding for Reversible Data Hiding over Encrypted Images. 149:1-149:23 - Peng Yi
, Zhongyuan Wang
, Laigan Luo
, Kui Jiang
, Zheng He
, Junjun Jiang
, Tao Lu
, Jiayi Ma:
Omniscient Video Super-Resolution with Explicit-Implicit Alignment. 150:1-150:23
Volume 20, Number 6, June 2024
- Amit Kumar Singh
, Deepa Kundur
, Mauro Conti
:
Introduction to the Special Issue on Integrity of Multimedia and Multimodal Data in Internet of Things. 151:1-151:4 - Wenyuan Yang
, Shaocong Wu
, Jianwei Fei
, Xianwang Zeng
, Yuemin Ding
, Zhihua Xia
:
A Bitcoin-based Secure Outsourcing Scheme for Optimization Problem in Multimedia Internet of Things. 152:1-152:23 - Qingzhi Liu
, Yuchen Huang
, Chenglu Jin
, Xiaohan Zhou
, Ying Mao
, Cagatay Catal
, Long Cheng
:
Privacy and Integrity Protection for IoT Multimodal Data Using Machine Learning and Blockchain. 153:1-153:18 - Simon Lucas Jonker
, Malthe Jelstrup
, Weizhi Meng
, Brooke Lampe
:
Detecting Post Editing of Multimedia Images using Transfer Learning and Fine Tuning. 154:1-154:22 - Carmen Bisogni
, Lucia Cascone
, Michele Nappi
, Chiara Pero
:
IoT-enabled Biometric Security: Enhancing Smart Car Safety with Depth-based Head Pose Estimation. 155:1-155:24 - Saif E. Nouma
, Attila A. Yavuz
:
Trustworthy and Efficient Digital Twins in Post-Quantum Era with Hybrid Hardware-Assisted Signatures. 156:1-156:30 - Fan Li
, Yanxiang Chen
, Haiyang Liu
, Zuxing Zhao
, Yuanzhi Yao
, Xin Liao
:
Vocoder Detection of Spoofing Speech Based on GAN Fingerprints and Domain Generalization. 157:1-157:20 - Jing Gao
, Peng Li
, Asif Ali Laghari
, Gautam Srivastava
, Thippa Reddy Gadekallu
, Sidra Abbas
, Jianing Zhang
:
Incomplete Multiview Clustering via Semidiscrete Optimal Transport for Multimedia Data Mining in IoT. 158:1-158:20 - Zhenyu Liu
, Da Li
, Xinyu Zhang
, Zhang Zhang
, Peng Zhang
, Caifeng Shan
, Jungong Han
:
Pedestrian Attribute Recognition via Spatio-temporal Relationship Learning for Visual Surveillance. 159:1-159:15
- Manvi Jha
, Ashish Kumar Bhandari
:
NSDIE: Noise Suppressing Dark Image Enhancement Using Multiscale Retinex and Low-Rank Minimization. 160:1-160:22 - Wenhao Fang
, Jiayuan Xie
, Hongfei Liu
, Jiali Chen
, Yi Cai
:
Diverse Visual Question Generation Based on Multiple Objects Selection. 161:1-161:22 - Yichi Zhang
, Dandan Ding
, Zhan Ma
, Zhu Li
:
A Reconfigurable Framework for Neural Network Based Video In-Loop Filtering. 162:1-162:20 - Ronglai Zuo
, Brian Mak
:
Improving Continuous Sign Language Recognition with Consistency Constraints and Signer Removal. 163:1-163:25 - Qinglin Liu
, Quanling Meng
, Xiaoqian Lv
, Zonglin Li
, Wei Yu
, Shengping Zhang
:
Human Selective Matting. 164:1-164:23 - Shenshen Li
, Xing Xu
, Xun Jiang
, Fumin Shen
, Zhe Sun
, Andrzej Cichocki
:
Cross-Modal Attention Preservation with Self-Contrastive Learning for Composed Query-Based Image Retrieval. 165:1-165:22 - Xizhong Wang
, Rui Liu
, Xin Yang, Qiang Zhang
, Dongsheng Zhou
:
MCFNet: Multi-Attentional Class Feature Augmentation Network for Real-Time Scene Parsing. 166:1-166:17 - Yanzhe Chen
, Jiahuan Zhou
, Yuxin Peng
:
SPIRIT: Style-guided Patch Interaction for Fashion Image Retrieval with Text Feedback. 167:1-167:17 - Huan Liu
, Xiaolong Liu, Zichang Tan, Xiaolong Li, Yao Zhao
:
PADVG: A Simple Baseline of Active Protection for Audio-Driven Video Generation. 168:1-168:19 - Yadong Huo
, Qibing Qin
, Jiangyan Dai
, Wenfeng Zhang
, Lei Huang
, Chengduan Wang
:
Deep Neighborhood-aware Proxy Hashing with Uniform Distribution Constraint for Cross-modal Retrieval. 169:1-169:23 - Yunyi Li
, Fu Xiao
, Wei Liang
, Linqing Gui
:
Multiply Complementary Priors for Image Compressive Sensing Reconstruction in Impulsive Noise. 170:1-170:22 - Weichao Zhao
, Hezhen Hu
, Wengang Zhou
, Li Li
, Houqiang Li
:
Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video. 171:1-171:18 - Aashania Antil
, Chhavi Dhiman
:
MF2ShrT: Multimodal Feature Fusion Using Shared Layered Transformer for Face Anti-spoofing. 172:1-172:21 - M. Shamim Hossain
, Yixue Hao
, Long Hu
, Jia Liu
, Gang Wei
, Min Chen
:
Immersive Multimedia Service Caching in Edge Cloud with Renewable Energy. 173:1-173:23 - Ying Ying Zhang
, Shuo Zhang
, Ming Hui
:
Semantic-Consistency-guided Learning on Deep Features for Unsupervised Salient Object Detection. 174:1-174:23 - Xuelin Liu
, Jiebin Yan
, Liping Huang
, Yuming Fang
, Zheng Wan
, Yang Liu
:
Perceptual Quality Assessment of Omnidirectional Images: A Benchmark and Computational Model. 175:1-175:24 - Yuhao Cheng
, Yichao Yan
, Wenhan Zhu
, Ye Pan
, Bowen Pan
, Xiaokang Yang
:
Head3D: Complete 3D Head Generation via Tri-plane Feature Distillation. 176:1-176:20 - Hao Chen
, Yunlong Yu
, Yonghan Dong
, Zheming Lu
, Yingming Li
, Zhongfei Zhang
:
Multi-Content Interaction Network for Few-Shot Segmentation. 177:1-177:20 - Zicheng Zhang
, Wei Sun
, Haoning Wu
, Yingjie Zhou
, Chunyi Li
, Zijian Chen
, Xiongkuo Min
, Guangtao Zhai
, Weisi Lin
:
GMS-3DQA: Projection-Based Grid Mini-patch Sampling for 3D Model Quality Assessment. 178:1-178:19 - Jun Lyu
, Guangming Wang
, M. Shamim Hossain
:
Iterative Temporal-spatial Transformer-based Cardiac T1 Mapping MRI Reconstruction. 179:1-179:18 - Yuanjie Dang
, Chunxia Huang
, Peng Chen
, Dongdong Zhao
, Nan Gao
, Ronghua Liang
, Ruohong Huan
:
Discriminative Action Snippet Propagation Network for Weakly Supervised Temporal Action Localization. 180:1-180:21 - Qiong Chen
, Tianlin Huang
, Qingfa Liu
:
SWRM: Similarity Window Reweighting and Margin for Long-Tailed Recognition. 181:1-181:18 - Peiguang Jing, Xianyi Liu, Lijuan Zhang, Yun Li, Yu Liu, Yuting Su:
Multimodal Attentive Representation Learning for Micro-video Multi-label Classification. 182:1-182:23 - Qingbao Huang
, Pijian Li
, Youji Huang
, Feng Shuang
, Yi Cai
:
Region-Focused Network for Dense Captioning. 183:1-183:20 - Lei Qi
, Hongpeng Yang
, Yinghuan Shi
, Xin Geng
:
MultiMatch: Multi-task Learning for Semi-supervised Domain Generalization. 184:1-184:21 - Yucheng Suo
, Zhedong Zheng
, Xiaohan Wang
, Bang Zhang
, Yi Yang:
Jointly Harnessing Prior Structures and Temporal Consistency for Sign Language Video Generation. 185:1-185:18
Volume 20, Number 7, July 2024
- Roberto García
, Ana Cediel
, Mercè Teixidó
, Rosa Gil
:
Semantics and Non-fungible Tokens for Copyright Management on the Metaverse and Beyond. 186:1-186:20 - Tianxiu Xie
, Keke Gai
, Liehuang Zhu
, Shuo Wang
, Zijian Zhang
:
RAC-Chain: An Asynchronous Consensus-based Cross-chain Approach to Scalable Blockchain for Metaverse. 187:1-187:24 - Yongjun Ren
, Zhiying Lv
, Neal N. Xiong
, Jin Wang
:
HCNCT: A Cross-chain Interaction Scheme for the Blockchain-based Metaverse. 188:1-188:23 - Shuang-Min Chen
, Rui Xu
, Jian Xu
, Shiqing Xin
, Changhe Tu
, Chenglei Yang
, Lin Lu
:
QuickCSGModeling: Quick CSG Operations Based on Fusing Signed Distance Fields for VR Modeling. 189:1-189:18 - Qinnan Zhang
, Zehui Xiong
, Jianming Zhu
, Sheng Gao
, Wanting Yang
:
A Privacy-preserving Auction Mechanism for Learning Model as an NFT in Blockchain-driven Metaverse. 190:1-190:24 - Han Wang
, Hui Li
, Abla Smahi
, Feng Zhao
, Yao Yao
, Ching Chuen Chan
, Shiyu Wang
, Wenyuan Yang
, Shuo-Yen Robert Li
:
MIS: A Multi-Identifier Management and Resolution System in the Metaverse. 191:1-191:25
- Fei Peng
, Le Qin
, Min Long
, Jin Li
:
Detection of Adversarial Facial Accessory Presentation Attacks Using Local Face Differential. 192:1-192:28 - Xinjian Gao
, Ye Pang
, Yuyu Liu
, Maokun Han
, Jun Yu
, Wei Wang
, Yuanxu Chen
:
Multimodal Visual-Semantic Representations Learning for Scene Text Recognition. 193:1-193:18 - Si-chao Lei
, Yue-Jiao Gong
, Xiaolin Xiao
, Yicong Zhou
, Jun Zhang
:
Tensorial Evolutionary Optimization for Natural Image Matting. 194:1-194:23 - Jifan Yang
, Zhongyuan Wang
, Baojin Huang
, Jiaxin Ai
, Yuhong Yang
, Zixiang Xiong
:
Joint Distortion Restoration and Quality Feature Learning for No-reference Image Quality Assessment. 195:1-195:20 - Weiyao Lin
, Yufeng Zhang
, Wenrui Dai
, Huabin Liu
, John See
, Hongkai Xiong
:
Scene Graph Lossless Compression with Adaptive Prediction for Objects and Relations. 196:1-196:23 - Xiaofeng Qu
, Li Liu
, Lei Zhu
, Liqiang Nie
, Huaxiang Zhang
:
Instance-level Adversarial Source-free Domain Adaptive Person Re-identification. 197:1-197:22 - Runyu Yang
, Dong Liu
, Siwei Ma
, Feng Wu
, Wen Gao
:
Perceptual Quality-Oriented Rate Allocation via Distillation from End-to-End Image Compression. 198:1-198:22 - Liangzhe Chen
, Wei Li
, Xiaohui Cui
, Zhenyu Wang
, Stefano Berretti
, Shaohua Wan
:
MS-GDA: Improving Heterogeneous Recipe Representation via Multinomial Sampling Graph Data Augmentation. 199:1-199:23 - Lei Gao
, Zheng Guo
, Ling Guan
:
An Optimal Edge-weighted Graph Semantic Correlation Framework for Multi-view Feature Representation Learning. 200:1-200:23 - Xiaoping Liang
, Wanting Liu
, Xianquan Zhang
, Zhenjun Tang
:
Robust Image Hashing via CP Decomposition and DCT for Copy Detection. 201:1-201:22 - Feng Li
, Yixuan Wu
, Anqi Li
, Huihui Bai
, Runmin Cong
, Yao Zhao
:
Enhanced Video Super-Resolution Network towards Compressed Data. 202:1-202:21 - Penglei Gao
, Xi Yang
, Rui Zhang
, Kaizhu Huang
:
Continuous Image Outpainting with Neural ODE. 203:1-203:16 - Jaime Ruiz-Serra
, Jack White
, Stephen M. Petrie
, Tatiana Kameneva
, Chris McCarthy
:
Learning Scene Representations for Human-assistive Displays Using Self-attention Networks. 204:1-204:26 - Jinjia Peng
, Song Pengpeng
, Hui Li
, Huibing Wang
:
ReFID: Reciprocal Frequency-aware Generalizable Person Re-identification via Decomposition and Filtering. 205:1-205:20 - Carlos Cortés
, Irene Viola
, Jesús Gutiérrez
, Jack Jansen
, Shishir Subramanyam
, Evangelos Alexiou
, Pablo Pérez, Narciso García
, Pablo César
:
Delay Threshold for Social Interaction in Volumetric eXtended Reality Communication. 206:1-206:22 - JongBeom Jeong
, Soonbin Lee
, Eun-Seok Ryu
:
DATRA-MIV: Decoder-Adaptive Tiling and Rate Allocation for MPEG Immersive Video. 207:1-207:22 - Zheng Chen
, Jian Zhao
, Mingyu Yang
, Wengang Zhou
, Houqiang Li
:
Optimizing Camera Motion with MCTS and Target Motion Modeling in Multi-Target Active Object Tracking. 208:1-208:19 - Xiangming Gu
, Longshen Ou
, Wei Zeng
, Jianan Zhang
, Nicholas Wong
, Ye Wang
:
Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing. 209:1-209:29 - Mingyu Deng
, Wanyi Zhang
, Jie Zhao
, Zhu Wang
, Mingliang Zhou
, Jun Luo
, Chao Chen
:
A Novel Framework for Joint Learning of City Region Partition and Representation. 210:1-210:23 - Xueqiang Han
, Biao Han
, Jinrong Li
, Congxi Song
:
Multi-agent DRL-based Multipath Scheduling for Video Streaming with QUIC. 211:1-211:23 - Wenxue Cui
, Xingtao Wang
, Xiaopeng Fan
, Shaohui Liu
, Xinwei Gao
, Debin Zhao
:
Deep Network for Image Compressed Sensing Coding Using Local Structural Sampling. 212:1-212:22 - Wenxi Liu
, Jiaxin Cai
, Qi Li
, Chenyang Liao
, Jingjing Cao
, Shengfeng He
, Yuanlong Yu
:
Learning Nighttime Semantic Segmentation the Hard Way. 213:1-213:23 - Xiaoya Yu
, Kejun Wu
, You Yang
, Qiong Liu
:
WaRENet: A Novel Urban Waterlogging Risk Evaluation Network. 214:1-214:28 - Jiawei Tan
, Pingan Yang
, Lu Chen
, Hongxing Wang
:
Temporal Scene Montage for Self-Supervised Video Scene Boundary Detection. 215:1-215:19 - Jun Liu
, Jiantao Zhou
, Jinyu Tian
, Weiwei Sun
:
Recoverable Privacy-Preserving Image Classification through Noise-like Adversarial Examples. 216:1-216:27 - Xiaobo Hu
, Youfang Lin
, Hehe Fan
, Shuo Wang
, Zhihao Wu
, Kai Lv
:
Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation. 217:1-217:22 - Baoli Sun
, Xinchen Ye
, Tiantian Yan
, Zhihui Wang
, Haojie Li
, Zhiyong Wang
:
Discriminative Segment Focus Network for Fine-grained Video Action Recognition. 218:1-218:20 - Tingting Han
, Quan Zhou
, Jun Yu
, Zhou Yu
, Jianhui Zhang
, Sicheng Zhao
:
Effective Video Summarization by Extracting Parameter-Free Motion Attention. 219:1-219:20 - Huisi Wu
, Zhaoze Wang
, Yifan Li
, Xueting Liu
, Tong-Yee Lee
:
Suitable and Style-Consistent Multi-Texture Recommendation for Cartoon Illustrations. 220:1-220:26 - Shizhan Liu
, Weiyao Lin
, Yihang Chen
, Yufeng Zhang
, Wenrui Dai
, John See
, Hongkai Xiong
:
A Unified Framework for Jointly Compressing Visual and Semantic Data. 221:1-221:24 - Yefei Sheng
, Ming Tao
, Jie Wang
, Bing-Kun Bao
:
ISF-GAN: Imagine, Select, and Fuse with GPT-Based Text Enrichment for Text-to-Image Synthesis. 222:1-222:17 - Haorao Gao
, Yiming Su
, Fasheng Wang
, Haojie Li
:
Heterogeneous Fusion and Integrity Learning Network for RGB-D Salient Object Detection. 223:1-223:24 - Xiruo Jiang
, Yazhou Yao
, Sheng Liu
, Fumin Shen
, Liqiang Nie
, Xian-Sheng Hua
:
Dual Dynamic Threshold Adjustment Strategy. 224:1-224:18 - Panpan Zhang
, Meng Liu
, Xuemeng Song
, Da Cao
, Zan Gao
, Liqiang Nie
:
Universal Relocalizer for Weakly Supervised Referring Expression Grounding. 225:1-225:23 - Xiaolong Shen
, Zhedong Zheng
, Yi Yang:
StepNet: Spatial-temporal Part-aware Network for Isolated Sign Language Recognition. 226:1-226:19 - Kankana Roy
:
Multimodal Score Fusion with Sparse Low-rank Bilinear Pooling for Egocentric Hand Action Recognition. 227:1-227:22 - Huiyuan Fu
, Jin Liu
, Ting Yu
, Xin Wang
, Huadong Ma
:
Multi-Domain Image-to-Image Translation with Cross-Granularity Contrastive Learning. 228:1-228:21 - Hao Zhang
, Meng Liu
, Yuan Qi
, Ning Yang
, Shunbo Hu
, Liqiang Nie
, Wenyin Zhang
:
Efficient Brain Tumor Segmentation with Lightweight Separable Spatial Convolutional Network. 229:1-229:19
Volume 20, Number 8, August 2024
- Jinliang Liu
, Zhedong Zheng
, Zongxin Yang
, Yi Yang:
High Fidelity Makeup via 2D and 3D Identity Preservation Net. 230:1-230:24 - Junjian Huang
, Hao Ren
, Shulin Liu
, Yong Liu
, Chuanlu Lv
, Jiawen Lu
, Changyong Xie
, Hong Lu
:
Real-Time Attentive Dilated U-Net for Extremely Dark Image Enhancement. 231:1-231:19 - Mingfu Xiong
, Kaikang Hu
, Zhihan Lyu, Fei Fang
, Zhongyuan Wang
, Ruimin Hu, Khan Muhammad
:
Inter-camera Identity Discrimination for Unsupervised Person Re-identification. 232:1-232:18 - Jiaqi Yu
, Jinhai Yang
, Hua Yang
, Renjie Pan
, Pingrui Lai
, Guangtao Zhai
:
Psychology-Guided Environment Aware Network for Discovering Social Interaction Groups from Videos. 233:1-233:23 - Qi Liu
, Xinchen Liu
, Kun Liu
, Xiaoyan Gu
, Wu Liu
:
SigFormer: Sparse Signal-guided Transformer for Multi-modal Action Segmentation. 234:1-234:22 - Jun Lyu
, Shouang Yan
, M. Shamim Hossain
:
DBGAN: Dual Branch Generative Adversarial Network for Multi-Modal MRI Translation. 235:1-235:22 - Dejun Zhang
, Mian Zhang
, Xuefeng Tan
, Jun Liu
:
Bridging the Domain Gap in Scene Flow Estimation via Hierarchical Smoothness Refinement. 236:1-236:21 - Ning Chen
, Zhipeng Cheng
, Xuwei Fan
, Zhang Liu
, Bangzhen Huang
, Yifeng Zhao
, Lianfen Huang
, Xiaojiang Du
, Mohsen Guizani
:
Integrated Sensing, Communication, and Computing for Cost-effective Multimodal Federated Perception. 237:1-237:28 - Jiayu Yang
, Chunhui Yang
, Fei Xiong
, Yongqi Zhai
, Ronggang Wang
:
Learned Video Compression with Adaptive Temporal Prior and Decoded Motion-aided Quality Enhancement. 238:1-238:21 - Xiaoling Gu
, Junkai Zhu
, Yongkang Wong
, Zizhao Wu
, Jun Yu
, Jianping Fan
, Mohan S. Kankanhalli
:
Recurrent Appearance Flow for Occlusion-Free Virtual Try-On. 239:1-239:17 - Yuanjie Lyu
, Penggang Qin
, Tong Xu
, Chen Zhu
, Enhong Chen
:
InteractNet: Social Interaction Recognition for Semantic-rich Videos. 240:1-240:21 - Mrinmoy Bhattacharjee
, S. R. Mahadeva Prasanna
, Prithwijit Guha
:
Exploration of Speech and Music Information for Movie Genre Classification. 241:1-241:19 - Sara Sarto
, Marcella Cornia
, Lorenzo Baraldi
, Alessandro Nicolosi
, Rita Cucchiara
:
Towards Retrieval-Augmented Architectures for Image Captioning. 242:1-242:22 - Kaihui Yang
, Junwei Han
, Guangyu Guo
, Chaowei Fang
, Yingzi Fan
, Lechao Cheng
, Dingwen Zhang
:
Progressive Adapting and Pruning: Domain-Incremental Learning for Saliency Prediction. 243:1-243:19 - Lv Tang
, Xinfeng Zhang
:
High Efficiency Deep-learning Based Video Compression. 244:1-244:23 - Pedro Gomes
, Silvia Rossi
, Laura Toni
:
AGAR - Attention Graph-RNN for Adaptative Motion Prediction of Point Clouds of Deformable Objects. 245:1-245:25 - Jiabo Ye
, Junfeng Tian
, Ming Yan
, Haiyang Xu
, Qinghao Ye
, Yaya Shi
, Xiaoshan Yang
, Xuwu Wang
, Ji Zhang
, Liang He
, Xin Lin
:
UniQRNet: Unifying Referring Expression Grounding and Segmentation with QRNet. 246:1-246:28 - Wei Zhou
, Qi Yang
, Wu Chen
, Qiuping Jiang
, Guangtao Zhai
, Weisi Lin
:
Blind Quality Assessment of Dense 3D Point Clouds with Structure Guided Resampling. 247:1-247:21 - Yuli Zhao
, Yin Zhang, Francis C. M. Lau
, Hai Yu
, Zhiliang Zhu, Bin Zhang:
Expanding-Window Zigzag Decodable Fountain Codes for Scalable Multimedia Transmission. 248:1-248:24 - Xuanyu Jin
, Ni Li
, Wanzeng Kong
, Jiajia Tang
, Bing Yang
:
Unbiased Semantic Representation Learning Based on Causal Disentanglement for Domain Generalization. 249:1-249:20 - Bo Peng
, Lin Sun
, Jianjun Lei
, Bingzheng Liu
, Haifeng Shen
, Wanqing Li
, Qingming Huang
:
Self-Supervised Monocular Depth Estimation via Binocular Geometric Correlation Learning. 250:1-250:19 - Yang Yang
, Shuailong Qiu
, Lanling Zeng
, Zhigeng Pan
:
Detail-preserving Joint Image Upsampling. 251:1-251:23 - Xiao Kang
, Xingbo Liu
, Wen Xue
, Xiushan Nie
, Yilong Yin
:
Online Cross-modal Hashing With Dynamic Prototype. 252:1-252:18 - Yuqing Yang
, Boris Joukovsky
, José Oramas Mogrovejo
, Tinne Tuytelaars
, Nikos Deligiannis
:
SNIPPET: A Framework for Subjective Evaluation of Visual Explanations Applied to DeepFake Detection. 253:1-253:29 - Jinwang Pan
, Xianming Liu
, Yuanchao Bai
, Deming Zhai
, Junjun Jiang
, Debin Zhao
:
Illumination-Aware Low-Light Image Enhancement with Transformer and Auto-Knee Curve. 254:1-254:23 - Lohic Fotio Tiotsop
, Antonio Servetti
, Peter Pocta
, Glenn Van Wallendael
, Marcus Barkowsky
, Enrico Masala
:
Multiple Image Distortion DNN Modeling Individual Subject Quality Assessment. 255:1-255:27 - Yunhui Xu
, Youru Li
, Muhao Xu
, Zhenfeng Zhu
, Yao Zhao
:
HKA: A Hierarchical Knowledge Alignment Framework for Multimodal Knowledge Graph Completion. 256:1-256:19 - Li Zhou
, Zhenyu Liu
, Yutong Li
, Yuchi Duan
, Huimin Yu
, Bin Hu
:
Multi Fine-Grained Fusion Network for Depression Detection. 257:1-257:23 - Chenlei Lv
, Dan Zhang
, Shengling Geng
, Zhongke Wu
, Hui Huang
:
Color Transfer for Images: A Survey. 258:1-258:29 - Zhihao Zhang
, Jun Wang
, Shengjie Li
, Lei Jin
, Hao Wu
, Jian Zhao
, Bo Zhang
:
Review and Analysis of RGBT Single Object Tracking Methods: A Fusion Perspective. 259:1-259:27 - Muhammad Bilal Shaikh
, Douglas Chai
, Syed Mohammed Shamsul Islam
, Naveed Akhtar
:
From CNNs to Transformers in Multimodal Human Action Recognition: A Survey. 260:1-260:24 - Yuankun Liu
, Xiang Yuan
, Haochen Li
, Zhijie Tan
, Jinsong Huang
, Jingjie Xiao
, Weiping Li
, Tong Mo
:
SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text Retrieval. 261:1-261:28
Volume 20, Number 9, September 2024
- Bo Chen
, Zhisheng Yan
, Klara Nahrstedt
:
Context-aware Optimization for Bandwidth-Efficient Image Analytics Offloading. 262:1-262:22 - Quentin Guimard
, Lucile Sassatelli
, Francesco Marchetti
, Federico Becattini
, Lorenzo Seidenari
, Alberto Del Bimbo
:
Deep Variational Learning for 360° Adaptive Streaming. 263:1-263:25 - Yu Zheng
, Wenchao Zhang
, Wei Song
, Xiuhua Wang
, Chong Fu
:
Encrypted Video Search with Single/Multiple Writers. 264:1-264:23 - Haihan Duan
, Junhua Liao
, Lehao Lin
, Abdulmotaleb El-Saddik
, Wei Cai
:
Meetor: A Human-Centered Automatic Video Editing System for Meeting Recordings. 265:1-265:23 - Na Li
, Yao Liu
:
VertexShuffle-Based Spherical Super-Resolution for 360-Degree Videos. 266:1-266:17 - Guilherme de A. P. Marques, José Matheus Carvalho Boaro, Antonio José G. Busson, Álan L. V. Guedes, Julio Cesar Duarte, Sérgio Colcher:
Action Segmentation through Self-Supervised Video Features and Positional-Encoded Embeddings. 267:1-267:23 - Sara Vlahovic
, Ivan Slivar
, Matko Silic
, Lea Skorin-Kapov
, Mirko Suznjevic
:
Exploring the Facets of the Multiplayer VR Gaming Experience. 268:1-268:24 - Bekir Oguzhan Turkkan
, Ting Dai
, Adithya Raman
, Tevfik Kosar
, Changyou Chen
, Muhammed Fatih Bulut
, Jaroslav Zola
, Daby Sow
:
GreenABR+: Generalized Energy-Aware Adaptive Bitrate Streaming. 269:1-269:24 - Zhiming Hu
, Mete Kemertas
, Lan Xiao
, Caleb Phillips
, Iqbal Mohomed
, Afsaneh Fazly
:
Realizing Efficient On-Device Language-based Image Retrieval. 270:1-270:18 - Amit Hirway
, Yuansong Qiao
, Niall Murray
:
A Quality of Experience and Visual Attention Evaluation for 360° Videos with Non-spatial and Spatial Audio. 271:1-271:20 - Cheonjin Park
, Chinmaey Shende
, Subhabrata Sen
, Bing Wang
:
C2: ABR Streaming in Cognizant of Consumption Context for Improved QoE and Resource Usage Tradeoffs. 272:1-272:27
Volume 20, Number 10, October 2024
- Walayat Hussain
, Honghao Gao
, Rafiul Karim
, Abdulmotaleb El-Saddik
:
Seventeen Years of the ACM Transactions on Multimedia Computing, Communications and Applications: A Bibliometric Overview. 297:1-297:22
- Bowen Yuan
, Jiahao Lu
, Sisi You
, Bing-Kun Bao
:
Unbiased Feature Learning with Causal Intervention for Visible-Infrared Person Re-Identification. 298:1-298:20 - Sixian Chan
, Xianpeng Zeng
, Xinhua Wang
, Jie Hu
, Cong Bai
:
Auxiliary Feature Fusion and Noise Suppression for HOI Detection. 299:1-299:18 - Yefan Li
, Fuqing Duan
, Ke Lu
:
Gated Multi-Modal Edge Refinement Network for Light Field Salient Object Detection. 300:1-300:20 - Dongze Hao
, Qunbo Wang
, Xinxin Zhu
, Jing Liu
:
HCCL: Hierarchical Counterfactual Contrastive Learning for Robust Visual Question Answering. 301:1-301:21 - Jun Jia
, Zhongpai Gao
, Yiwei Yang
, Wei Sun
, Dandan Zhu
, Xiaohong Liu
, Xiongkuo Min
, Guangtao Zhai
:
Hidden Barcode in Sub-Images with Invisible Locating Marker. 302:1-302:24 - Junxin Lu
, Yongbin Gao
, Jieyu Chen
, Jeng-Neng Hwang
, Hamido Fujita
, Zhijun Fang
:
Monocular Depth and Ego-motion Estimation with Scale Based on Superpixel and Normal Constraints. 303:1-303:26 - Zhenjiang Guo
, Xiaohai He
, Yu Yang
, Linbo Qing
, Honggang Chen
:
DAG-YOLO: A Context-Feature Adaptive fusion Rotating Detection Network in Remote Sensing Images. 304:1-304:24 - Yong Zhou
, Zeming Xie
, Jiaqi Zhao
, Wen-Liang Du
, Rui Yao
, Abdulmotaleb El-Saddik
:
Multi-Modal LiDAR Point Cloud Semantic Segmentation with Salience Refinement and Boundary Perception. 305:1-305:20 - Yuanyuan Wang
, Meng Liu
, Xuemeng Song
, Liqiang Nie
:
Harnessing Representative Spatial-Temporal Information for Video Question Answering. 306:1-306:20 - Guibiao Liao
, Wei Gao
:
Rethinking Feature Mining for Light Field Salient Object Detection. 307:1-307:24 - Chao Liang
, Linchao Zhu
, Zongxin Yang
, Wei Chen
, Yi Yang:
Noise-Tolerant Hybrid Prototypical Learning with Noisy Web Data. 308:1-308:19 - Yitao Peng
, Lianghua He
, Die Hu
, Yihang Liu
, Longzhen Yang
, Shaohua Shang
:
Decoupling Deep Learning for Enhanced Image Recognition Interpretability. 309:1-309:24 - Baoli Sun
, Yanjun Guo
, Tiantian Yan
, Xinchen Ye
, Zhihui Wang
, Haojie Li
, Zhiyong Wang
:
Digging into Depth and Color Spaces: A Mapping Constraint Network for Depth Super-Resolution. 310:1-310:20 - Michael Seufert
, Marius Spangenberger
, Fabian Poignée
, Florian Wamser
, Werner Robitza
, Christian Timmerer
, Tobias Hoßfeld
:
COBIRAS: Offering a Continuous Bit Rate Slide to Maximize DASH Streaming Bandwidth Utilization. 311:1-311:24 - Zhangyong Tang
, Tianyang Xu
, Xiao-Jun Wu
, Josef Kittler
:
Multi-Level Fusion for Robust RGBT Tracking via Enhanced Thermal Representation. 312:1-312:24 - Hanyue Tu
, Li Li
, Wengang Zhou
, Houqiang Li
:
Reconstruction-Free Image Compression for Machine Vision via Knowledge Transfer. 313:1-313:19 - Gai Zhang
, Xinfeng Zhang
, Lv Tang
:
Unified and Scalable Deep Image Compression Framework for Human and Machine. 314:1-314:22 - Fengyong Li
, Huajun Zhai
, Teng Liu
, Xinpeng Zhang
, Chuan Qin
:
Learning Compressed Artifact for JPEG Manipulation Localization Using Wide-Receptive-Field Network. 315:1-315:23 - Shukang Yin
, Sirui Zhao
, Hao Wang
, Tong Xu
, Enhong Chen
:
Exploiting Instance-level Relationships in Weakly Supervised Text-to-Video Retrieval. 316:1-316:21 - Kayhan Latifzadeh
, Nima Gozalpour
, V. Javier Traver
, Tuukka Ruotsalo
, Aleksandra Kawala-Sterniuk
, Luis A. Leiva
:
Efficient Decoding of Affective States from Video-elicited EEG Signals: An Empirical Investigation. 317:1-317:24 - Ziyue Wu
, Junyu Gao
, Shucheng Huang
, Changsheng Xu
:
Learning Commonsense-aware Moment-Text Alignment for Fast Video Temporal Grounding. 318:1-318:22 - Daniele Lorenzi
, Farzad Tashtarian
, Hermann Hellwagner
, Christian Timmerer
:
MEDUSA: A Dynamic Codec Switching Approach in HTTP Adaptive Streaming. 319:1-319:23 - Ruoyan Pi
, Peng Wu
, Xiangteng He
, Yuxin Peng
:
EOGT: Video Anomaly Detection with Enhanced Object Information and Global Temporal Dependency. 320:1-320:21 - Shengbin Yue
, Yunbin Tu
, Liang Li
, Shengxiang Gao
, Zhengtao Yu
:
Multi-Grained Representation Aggregating Transformer with Gating Cycle for Change Captioning. 321:1-321:23 - Jingjing Wu
, Xi Zhou
, Xiaohong Li
, Hao Liu
, Meibin Qi
, Richang Hong
:
Asymmetric Deformable Spatio-temporal Framework for Infrared Object Tracking. 322:1-322:24 - Zhenyu Li
, Shanshan Gao
, Deqian Mao
, Shouwen Song
, Lei Li
, Yuanfeng Zhou
:
Deep Plug-and-Play Non-Iterative Cluster for 3D Global Feature Extraction. 323:1-323:18 - Mingfu Xue
, Yinghao Wu
, Leo Yu Zhang
, Dujuan Gu
, Yushu Zhang
, Weiqiang Liu
:
SSAT: Active Authorization Control and User's Fingerprint Tracking Framework for DNN IP Protection. 324:1-324:24 - Yongkang Li
, Qifan Liang
, Zhen Han
, Wenjun Mai
, Zhongyuan Wang
:
Few-Shot Face Sketch-to-Photo Synthesis via Global-Local Asymmetric Image-to-Image Translation. 325:1-325:24 - Shuqin Chen
, Xian Zhong
, Yi Zhang
, Lei Zhu
, Ping Li
, Xiaokang Yang
, Bin Sheng
:
Action-aware Linguistic Skeleton Optimization Network for Non-autoregressive Video Captioning. 326:1-326:24 - Yancun Yang
, Weiqing Min
, Jingru Song
, Guorui Sheng
, Lili Wang
, Shuqiang Jiang
:
Lightweight Food Recognition via Aggregation Block and Feature Encoding. 327:1-327:25 - Huaijin Liu
, Jixiang Du
, Yong Zhang
, Hongbo Zhang
, Jiandian Zeng
:
MSSA: Multi-Representation Semantics-Augmented Set Abstraction for 3D Object Detection. 328:1-328:23 - Vinicius Atsushi Sato Kawai
, Lucas Pascotti Valem
, Alexandro Baldassin
, Edson Borin
, Daniel Carlos Guimarães Pedronette
, Longin Jan Latecki
:
Rank-based Hashing for Effective and Efficient Nearest Neighbor Search for Image Retrieval. 329:1-329:19
Volume 20, Number 11, November 2024
- Ritesh Vyas
, Michele Nappi
, Alberto Del Bimbo
, Sambit Bakshi
:
Introduction to Special Issue on "Recent Trends in Multimedia Forensics". 330:1-330:7 - Vincenzo Carletti
, Pasquale Foggia
, Antonio Greco
, Alessia Saggese
, Mario Vento
:
Facial Soft-biometrics Obfuscation through Adversarial Attacks. 331:1-331:21 - Hanrui Wang
, Shuo Wang
, Cunjian Chen
, Massimo Tistarelli
, Zhe Jin
:
A Multi-Task Adversarial Attack against Face Authentication. 332:1-332:24 - Tian Wu
, Rongbo Zhu
, Shaohua Wan
:
Semantic Map Guided Identity Transfer GAN for Person Re-identification. 333:1-333:20 - Dhiran Kumar Mahto
, Amit Kumar Singh
, Kedar Nath Singh
, Om Prakash Singh
, Amrit Kumar Agrawal
:
Robust Copyright Protection Technique with High-embedding Capacity for Color Images. 334:1-334:12 - S. Shitharth
, Hariprasath Manoharan
, Alaa O. Khadidos
, Achyut Shankar
, Carsten Maple
, Adil Omar Khadidos
, Shahid Mumtaz
:
Improved Security for Multimedia Data Visualization using Hierarchical Clustering Algorithm. 335:1-335:21 - Youqiang Sun
, Jianyi Liu
, Ru Zhang
:
Generative Image Steganography Based on Guidance Feature Distribution. 336:1-336:18 - Paarth Neekhara
, Shehzeen Hussain
, Xinqiao Zhang
, Ke Huang
, Julian J. McAuley
, Farinaz Koushanfar
:
FaceSigns: Semi-fragile Watermarks for Media Authentication. 337:1-337:21 - Jing Zhao
, Hongwei Yang
, Hui He
, Jie Peng
, Weizhe Zhang
, Jiangqun Ni
, Arun Kumar Sangaiah
, Aniello Castiglione
:
Backdoor Two-Stream Video Models on Federated Learning. 338:1-338:20 - Farkhund Iqbal
, Ahmed Abbasi
, Abdul Rehman Javed
, Ahmad S. Almadhor
, Zunera Jalil
, Sajid Anwar
, Imad Rida
:
Data Augmentation-based Novel Deep Learning Method for Deepfaked Images Detection. 339:1-339:15 - Kaihan Lin
, Weihong Han
, Shudong Li
, Zhaoquan Gu
, Huimin Zhao
, Yangyang Mei
:
Detecting Deepfake Videos using Spatiotemporal Trident Network. 340:1-340:20 - Ijaz Ul Haq
, Khalid Mahmood Malik
, Khan Muhammad
:
Multimodal Neurosymbolic Approach for Explainable Deepfake Detection. 341:1-341:16 - Federico Becattini
, Carmen Bisogni
, Vincenzo Loia
, Chiara Pero
, Fei Hao
:
Head Pose Estimation Patterns as Deepfake Detectors. 342:1-342:24 - Luca Guarnera
, Oliver Giudice
, Sebastiano Battiato
:
Mastering Deepfake Detection: A Cutting-edge Approach to Distinguish GAN and Diffusion-model Images. 343:1-343:24 - Aakash Varma Nadimpalli
, Ajita Rattani
:
ProActive DeepFake Detection using GAN-based Visible Watermarking. 344:1-344:27 - Bachir Kaddar
, Sid Ahmed Fezza
, Zahid Akhtar
, Wassim Hamidouche
, Abdenour Hadid
, Joan Serra-Sagristà
:
Deepfake Detection Using Spatiotemporal Transformer. 345:1-345:21 - Shuai Xiao
, Zhuo Zhang
, Jiachen Yang
, Jiabao Wen
, Yang Li
:
Forgery Detection by Weighted Complementarity between Significant Invariance and Detail Enhancement. 346:1-346:20 - Paola Capasso
, Giuseppe Cattaneo
, Maria De Marsico
:
A Comprehensive Survey on Methods for Image Integrity. 347:1-347:34
- Yunfang Niu
, Lingxiang Wu
, Yufeng Zhang
, Yousong Zhu
, Guibo Zhu
, Jinqiao Wang
:
Multi-Model Style-Aware Diffusion Learning for Semantic Image Synthesis. 348:1-348:21 - Jingzheng Li
, Hailong Sun
, Lei Chai
, Jiyi Li
:
Target Structure Learning Framework for Unsupervised Multi-Class Domain Adaptation. 349:1-349:23 - Chih-Fan Hsu
, Yi-Chen Li
, Chung-Chi Tsai
, Jian-Kai Wang
, Cheng-Hsin Hsu
:
Federated Learning Using Multi-Modal Sensors with Heterogeneous Privacy Sensitivity Levels. 350:1-350:27 - Hengwei Li
, Wei Wang
, Xiao Wang
, Xin Yuan
, Xin Xu
:
Blind 3D Video Stabilization with Spatio-Temporally Varying Motion Blur. 351:1-351:23 - Shunan Mao
, Hao Chen
, Yaowei Wang
, Wei Zeng
, Shiliang Zhang
:
TPTE: Text-Guided Patch Token Exploitation for Unsupervised Fine-Grained Representation Learning. 352:1-352:18 - Aditya Panda
, Dipti Prasad Mukherjee
:
Knowledge Guided Transformer Network for Compositional Zero-Shot Learning. 353:1-353:25 - Wei-Yen Hsu
, Yu-Yu Hsu
:
Multi-Scale and Multi-Layer Lattice Transformer for Underwater Image Enhancement. 354:1-354:24 - Tengfei Shi
, Chenglizhao Chen
, Zhenyu Wu
, Aimin Hao
, Yuming Fang
:
Improving Image Aesthetic Assessment via Multiple Image Joint Learning. 355:1-355:24 - Gaurang Bansal
, Aditya Nawal
, Vinay Chamola
, Norbert Herencsar
:
Revolutionizing Visuals: The Role of Generative AI in Modern Image Generation. 356:1-356:22 - Yujie Li
, Xuekai Wei
, Xiaofeng Liao
, You Zhao
, Fan Jia
, Xu Zhuang
, Mingliang Zhou
:
A Deep Retinex-Based Low-Light Enhancement Network Fusing Rich Intrinsic Prior Information. 357:1-357:23 - Weimin Shi
, Dehong Gao
, Yuan Xiong
, Zhong Zhou
:
QR-CLIP: Introducing Explicit Knowledge for Location and Time Reasoning. 358:1-358:22 - Feiyang Liu
, Kun Li
, Zhun Zhong
, Wei Jia
, Bin Hu
, Xun Yang
, Meng Wang
, Dan Guo
:
Depth Matters: Spatial Proximity-Based Gaze Cone Generation for Gaze Following in Wild. 359:1-359:24 - Yonghui Wang
, Shaokai Liu
, Li Li
, Wengang Zhou
, Houqiang Li
:
SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection. 360:1-360:20 - Xin Liu
, Chao Hao
, Zitong Yu
, Huanjing Yue
, Jingyu Yang
:
From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation. 361:1-361:19 - Jiaxing Wen
, Aohong Shen
, Zhen Han
, Zhongyuan Wang, Liang Chen
:
Cross-Modal Face Super-Resolution Based on Quasi-Siamese Domain Transfer Fusion Network. 362:1-362:23
Volume 20, Number 12, December 2024
- Hongbin Wang
, Rui Tang
, Fan Li
:
Hypercube Pooling for Visual Semantic Embedding. 363:1-363:17 - Fei Wang
, Liang Ding
, Jun Rao
, Ye Liu
, Li Shen
, Changxing Ding
:
Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining? 364:1-364:22 - Caixia Liu
, Yali Chen
, Minhong Zhu
, Chenhui Hao
, Hai-Sheng Li
, Xiaochuan Wang
:
DEGAN: Detail-Enhanced Generative Adversarial Network for Monocular Depth-Based 3D Reconstruction. 365:1-365:17 - Dan Song
, Shumeng Huo
, Xinwei Fu
, Chu-Meng Zhang
, Wenhui Li
, An-An Liu
:
Cross-Modal Contrastive Learning with a Style-Mixed Bridge for Single Image 3D Shape Retrieval. 366:1-366:24 - Ting-Lan Lin
, Bing-Wei Su
, Po-Cheng Shen
, Ding-Yuan Chen
, Chi-Fu Liang
, Yan-Cheng Chen
, Yangming Wen
, Mohammad Shahid
:
Upsampling Algorithm for V-PCC-Coded 3D Point Clouds. 367:1-367:23 - Yuanzhi Wang
, Yong Li
, Xiaoya Zhang
, Xin Liu
, Anbo Dai
, Antoni B. Chan
, Zhen Cui
:
Edit Temporal-Consistent Videos with Image Diffusion Model. 368:1-368:16 - Luis Álvarez
, Agustín Trujillo, Nelson Monzón, Jean-Michel Morel
:
Generation and Editing of 2D Shapes Using a Branched Representation. 369:1-369:25 - Xingbo Liu
, Jiamin Li
, Xiushan Nie
, Xuening Zhang
, Yilong Yin
:
Fast Unsupervised Cross-Modal Hashing with Robust Factorization and Dual Projection. 370:1-370:21 - Yongheng Zhang
, Yuanqiang Cai
, Danfeng Yan
, Rongheng Lin
:
Real-World Scene Image Enhancement with Contrastive Domain Adaptation Learning. 371:1-371:23 - Chunqiang Yu
, Shichao Cheng
, Xianquan Zhang
, Xinpeng Zhang
, Zhenjun Tang
:
Reversible Data Hiding in Shared JPEG Images. 372:1-372:24 - Boqian Liu
, Haojie Li
, Zhihui Wang
, Tianfan Xue
:
Transparent Depth Completion Using Segmentation Features. 373:1-373:19 - Yongtang Bao
, Chunjian Su
, Yutong Qi
, Yanbing Geng
, Haojie Li
:
Category-Level Pose Estimation and Iterative Refinement for Monocular RGB-D Image. 374:1-374:20 - Kuiyuan Sun, Xiaolong Liu, Xiaolong Li, Yao Zhao, Wei Wang:
Multi-Modal Driven Pose-Controllable Talking Head Generation. 375:1-375:23 - Bing Liu
, Jinfu Lu
, Mingming Liu
, Hao Liu
, Yong Zhou
, Dongping Yang
:
Diverse Image Captioning via Panoptic Segmentation and Sequential Conditional Variational Transformer. 376:1-376:17 - Veronika Stephanie
, Ibrahim Khalil
, Mohammed Atiquzzaman
:
Weight-Based Privacy-Preserving Asynchronous SplitFed for Multimedia Healthcare Data. 377:1-377:24 - Chuanhao Li
, Chenchen Jing
, Zhen Li
, Yuwei Wu
, Yunde Jia
:
Adversarial Sample Synthesis for Visual Question Answering. 378:1-378:24 - Shipeng Zhu
, Jun Fang
, Pengfei Fang
, Hui Xue
:
Improving Scene Text Retrieval via Stylized Middle Modality. 379:1-379:18 - Xiao Liang
, Erkun Yang
, Cheng Deng
, Yanhua Yang
:
CrossFormer: Cross-Modal Representation Learning via Heterogeneous Graph Transformer. 380:1-380:21 - Jiayu Lin
, Yuan-Gen Wang
:
TSFormer: Tracking Structure Transformer for Image Inpainting. 381:1-381:23 - Yixuan Li
, Peilin Chen
, Hanwei Zhu
, Keyan Ding
, Leida Li
, Shiqi Wang
:
Deep Shape-Texture Statistics for Completely Blind Image Quality Evaluation. 382:1-382:21 - Zhenyu Zhou
, Qing Liao
, Lei Luo
, Xinwang Liu
, En Zhu
:
ProtoRefine: Enhancing Prototypes with Similar Structure in Few-Shot Learning. 383:1-383:24 - Mengzhu Yu
, Zhenjun Tang
, Xiaoping Liang
, Xianquan Zhang
, Zhixin Li
, Xinpeng Zhang
:
Robust Hashing with Deep Features and Meixner Moments for Image Copy Detection. 384:1-384:23 - Jiabei Liu
, Weiming Zhuang
, Yuanyuan Liu
, Yonggang Wen
, Jun Huang
, Wei Lin
:
Personalized Federated Mutual Learning for Unsupervised Camera-Aware Person Re-Identification. 385:1-385:19 - Yiyang Ma
, Haowei Kuang
, Huan Yang
, Jianlong Fu
, Jiaying Liu
:
Prompt-Based Modality Bridging for Unified Text-to-Face Generation and Manipulation. 386:1-386:23 - Peilin Chen
, Shiqi Wang
, Zhu Li
:
Occupancy Map Guided Attributes Artifacts Removal for Video-Based Point Cloud Compression. 387:1-387:20 - Yunda Sun
, Lin Zhang
, Zhong Wang
, Yang Chen
, Shengjie Zhao
, Yicong Zhou
:
I2P Registration by Learning the Underlying Alignment Feature Space from Pixel-to-Point Similarities. 388:1-388:21 - Daniel Gebre
, Siem Hadish
, Aron Sbhatu
, Moayad Aloqaily
, Mohsen Guizani
:
Establishing Trust and Security in Decentralized Metaverse: A Web 3.0 Approach. 389:1-389:17 - Yangjun Mao
, Jun Xiao
, Dong Zhang
, Meng Cao
, Jian Shao
, Yueting Zhuang
, Long Chen
:
Improving Reference-Based Distinctive Image Captioning with Contrastive Rewards. 390:1-390:24 - Shenglan Li
, Rui Yao
, Yong Zhou
, Hancheng Zhu
, Jiaqi Zhao
, Zhiwen Shao
, Abdulmotaleb El-Saddik
:
Motion-Aware Self-Supervised RGBT Tracking with Multi-Modality Hierarchical Transformers. 391:1-391:23 - Jun Ling
, Han Xue
, Anni Tang
, Rong Xie
, Li Song
:
ViCoFace: Learning Disentangled Latent Motion Representations for Visual-Consistent Face Reenactment. 392:1-392:24 - Jiachen Li
, Qing Xie
, Xiaojun Chang
, Jinyu Xu
, Yongjian Liu
:
Mutually-Guided Hierarchical Multi-Modal Feature Learning for Referring Image Segmentation. 393:1-393:18 - Fatima Alshehri
, Ghulam Muhammad
:
Ischemic Stroke Segmentation by Transformer and Convolutional Neural Network Using Few-Shot Learning. 394:1-394:21
- Kamran Gholizadeh HamlAbadi
, Fedwa Laamarti
, Abdulmotaleb El-Saddik
:
Meta-Review on Brain-Computer Interface (BCI) in the Metaverse. 395:1-395:42

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.