default search action
ACM Transactions on Multimedia Computing, Communications, and Applications, Volume 20
Volume 20, Number 1, January 2024
- Zhenbo Xu, Hai-Miao Hu, Liu Liu, Dongping Zhang, Shifeng Zhang, Wenming Tan:
Instance-Based Continual Learning: A Real-World Dataset and Baseline for Fresh Recognition. 1:1-1:23 - Xiaoping Liang, Zhenjun Tang, Zhixin Li, Mengzhu Yu, Hanyun Zhang, Xianquan Zhang:
Robust Hashing via Global and Local Invariant Features for Image Copy Detection. 2:1-2:22 - Sandipan Sarma, Arijit Sur:
DiRaC-I: Identifying Diverse and Rare Training Classes for Zero-Shot Learning. 3:1-3:23 - Chengyu Zheng, Ning Song, Ruoyu Zhang, Lei Huang, Zhiqiang Wei, Jie Nie:
Scale-Semantic Joint Decoupling Network for Image-Text Retrieval in Remote Sensing. 4:1-4:20 - Jiankai Li, Yunhong Wang, Weixin Li:
Zero-shot Scene Graph Generation via Triplet Calibration and Reduction. 5:1-5:21 - Abid Yaqoob, Gabriel-Miro Muntean:
Advanced Predictive Tile Selection Using Dynamic Tiling for Prioritized 360° Video VR Streaming. 6:1-6:28 - Jia Wang, Hong-Han Shuai, Yung-Hui Li, Wen-Huang Cheng:
Language-guided Residual Graph Attention Network and Data Augmentation for Visual Grounding. 7:1-7:23 - Haoran Wang, Yajie Wang, Baosheng Yu, Yibing Zhan, Chunfeng Yuan, Wankou Yang:
Attentional Composition Networks for Long-Tailed Human Action Recognition. 8:1-8:18 - Zi-Chao Zhang, Zhen-Duo Chen, Zhen-Yu Xie, Xin Luo, Xin-Shun Xu:
S3Mix: Same Category Same Semantics Mixing for Augmenting Fine-grained Images. 9:1-9:16 - Mingkui Tan, Zhiquan Wen, Leyuan Fang, Qi Wu:
Transformer-Based Relational Inference Network for Complex Visual Relational Reasoning. 10:1-10:23 - Yiming Yang, Weipeng Hu, Haifeng Hu:
Syncretic Space Learning Network for NIR-VIS Face Recognition. 11:1-11:25 - Chenghua Li, Zongze Li, Jing Sun, Yun Zhang, Xiaoping Jiang, Fan Zhang:
Dynamic Weighted Gradient Reversal Network for Visible-infrared Person Re-identification. 12:1-12:23 - Jiajun Song, Zhuo Li, Weiqing Min, Shuqiang Jiang:
Towards Food Image Retrieval via Generalization-Oriented Sampling and Loss Function Design. 13:1-13:19 - Yiting Jin, Jie Wu, Wanliang Wang, Yidong Yan, Jiawei Jiang, Jianwei Zheng:
Cascading Blend Network for Image Inpainting. 14:1-14:21 - Kehua Guo, Liang Chen, Xiangyuan Zhu, Xiaoyan Kui, Jian Zhang, Heyuan Shi:
Double-Layer Search and Adaptive Pooling Fusion for Reference-Based Image Super-Resolution. 15:1-15:23 - Jing Zhao, Bin Li, Jiahao Li, Ruiqin Xiong, Yan Lu:
A Universal Optimization Framework for Learning-based Image Codec. 16:1-16:19 - Liping Zhang, Shukai Chen, Fei Lin, Wei Ren, Kim-Kwang Raymond Choo, Geyong Min:
1DIEN: Cross-session Electrocardiogram Authentication Using 1D Integrated EfficientNet. 17:1-17:17 - Baian Chen, Zhilei Chen, Xiaowei Hu, Jun Xu, Haoran Xie, Jing Qin, Mingqiang Wei:
Dynamic Message Propagation Network for RGB-D and Video Salient Object Detection. 18:1-18:21 - Xiang Gao, Wei Hu, Guo-Jun Qi:
Self-supervised Multi-view Learning via Auto-encoding 3D Transformations. 19:1-19:23 - Dewang Wang, Gaobo Yang, Zhiqing Guo, Jiyou Chen:
Enhancing Adversarial Embedding based Image Steganography via Clustering Modification Directions. 20:1-20:20 - Xiaojia Zhao, Tingting Xu, Qiangqiang Shen, Youfa Liu, Yongyong Chen, Jingyong Su:
Double High-Order Correlation Preserved Robust Multi-View Ensemble Clustering. 21:1-21:21 - Shuji Tasaka:
Usefulness of QoS in Multidimensional QoE Prediction for Haptic-Audiovisual Communications. 22:1-22:24 - Ching-Nung Yang, Xiaotian Wu, Min-Jung Chung:
Enhancement of Information Carrying and Decoding for Visual Cryptography with Error Correction. 23:1-23:24 - Yuqing Zhang, Yong Zhang, Shaofan Wang, Yun Liang, Baocai Yin:
Semi-supervised Video Object Segmentation Via an Edge Attention Gated Graph Convolutional Network. 24:1-24:23 - Wenying Wen, Minghui Huang, Yushu Zhang, Yuming Fang, Yifan Zuo:
Visual Security Index Combining CNN and Filter for Perceptually Encrypted Light Field Images. 25:1-25:15 - Linlin Liu, Haijun Zhang, Qun Li, Jianghong Ma, Zhao Zhang:
Collocated Clothing Synthesis with GANs Aided by Textual Information: A Multi-Modal Framework. 26:1-26:25 - Xulei Lou, Tinghui Wu, Haifeng Hu, Dihu Chen:
Self-Supervised Consistency Based on Joint Learning for Unsupervised Person Re-identification. 27:1-27:20 - Yichi Zhang, Gongchun Ding, Dandan Ding, Zhan Ma, Zhu Li:
On Content-Aware Post-Processing: Adapting Statistically Learned Models to Dynamic Content. 28:1-28:23 - Jing Xu, Bing Liu, Yong Zhou, Mingming Liu, Rui Yao, Zhiwen Shao:
Diverse Image Captioning via Conditional Variational Autoencoder and Dual Contrastive Learning. 29:1-29:16 - Cong Zou, Rui Wang, Cheng Jin, Sanyi Zhang, Xin Wang:
S2CL-Leaf Net: Recognizing Leaf Images Like Human Botanists. 30:1-30:20
Volume 20, Number 2, February 2024
- Suyel Namasudra, Pascal Lorenz, Seifedine Kadry, Syed Ahmad Chan Bukhari:
Introduction to the Special Issue on DNA-centric Modeling and Practice for Next-generation Computing and Communication Systems. 31:1-31:2
- Shaohua Wan, Yi Jin, Guangdong Xu, Michele Nappi:
Editorial to Special Issue on Multimedia Cognitive Computing for Intelligent Transportation System. 32:1-32:2 - Ruonan Zhao, Laurence T. Yang, Debin Liu, Wanli Lu, Chenlu Zhu, Yiheng Ruan:
Tensor-Empowered LSTM for Communication-Efficient and Privacy-Enhanced Cognitive Federated Learning in Intelligent Transportation Systems. 33:1-33:21 - Hongjian Shi, Hao Wang, Ruhui Ma, Yang Hua, Tao Song, Honghao Gao, Haibing Guan:
Robust Searching-Based Gradient Collaborative Management in Intelligent Transportation System. 34:1-34:23 - Zejia Weng, Zuxuan Wu, Hengduo Li, Jingjing Chen, Yu-Gang Jiang:
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition. 35:1-35:18 - Shixiong Zhang, Wenmin Wang, Honglei Li, Shenyong Zhang:
E-detector: Asynchronous Spatio-temporal for Event-based Object Detection in Intelligent Transportation System. 36:1-36:20 - Ram Prasad Padhy, Pankaj Kumar Sa, Fabio Narducci, Carmen Bisogni, Sambit Bakshi:
Monocular Vision-aided Depth Measurement from RGB Images for Autonomous UAV Navigation. 37:1-37:22
- Zhihan Lv, Fabio Poiesi, Qi Dong, Jaime Lloret, Houbing Song:
Special Issue on Deep Learning for Intelligent Human Computer Interaction. 38:1-38:5 - Wenjuan Gong, Yue Zhang, Wei Wang, Peng Cheng, Jordi Gonzàlez:
Meta-MMFNet: Meta-learning-based Multi-model Fusion Network for Micro-expression Recognition. 39:1-39:20 - Youcef Djenouri, Asma Belhadi, Gautam Srivastava, Jerry Chun-Wei Lin:
An Efficient and Accurate GPU-based Deep Learning Model for Multimedia Recommendation. 40:1-40:18 - Loveleen Gaur, Mohan Bhandari, Bhadwal Singh Shikhar, NZ Jhanjhi, Mohammad Shorfuzzaman, Mehedi Masud:
Explanation-Driven HCI Model to Examine the Mini-Mental State for Alzheimer's Disease. 41:1-41:16 - Mi Li, Wei Zhang, Bin Hu, Jiaming Kang, Yuqi Wang, Shengfu Lu:
Automatic Assessment of Depression and Anxiety through Encoding Pupil-wave from HCI in VR Scenes. 42:1-42:22 - Abdul Qayyum, Imran Razzak, Muhammad Tanveer, Moona Mazher:
Spontaneous Facial Behavior Analysis Using Deep Transformer-based Framework for Child-computer Interaction. 43:1-43:17 - Xiaowei Chen, Xiao Jiang, Lishuang Zhan, Shihui Guo, Qunsheng Ruan, Guoliang Luo, Minghong Liao, Yipeng Qin:
Full-body Human Motion Reconstruction with Sparse Joint Tracking Using Flexible Sensors. 44:1-44:19 - Shanbao Qiao, Neal N. Xiong, Yongbin Gao, Zhijun Fang, Wenjun Yu, Juan Zhang, Xiaoyan Jiang:
Self-Supervised Learning of Depth and Ego-Motion for 3D Perception in Human Computer Interaction. 45:1-45:21 - Yan Kang, Bin Pu, Yongqi Kou, Yun Yang, Jianguo Chen, Khan Muhammad, Po Yang, Lida Xu, Mohammad Hijji:
A Deep Graph Network with Multiple Similarity for User Clustering in Human-Computer Interaction. 46:1-46:20 - Bahar Uddin Mahmud, Guan Y. Hong, Bernard Fong:
A Study of Human-AI Symbiosis for Creative Work: Recent Developments and Future Directions in Deep Learning. 47:1-47:21 - Xiaoling Gu, Jie Huang, Yongkang Wong, Jun Yu, Jianping Fan, Pai Peng, Mohan S. Kankanhalli:
PAINT: Photo-realistic Fashion Design Synthesis. 48:1-48:23 - Qingfeng Dai, Yongkang Wong, Guofei Sun, Yanwei Wang, Zhou Zhou, Mohan S. Kankanhalli, Xiangdong Li, Weidong Geng:
Unsupervised Domain Adaptation by Causal Learning for Biometric Signal-based HCI. 49:1-49:18 - Yi Xiao, Tong Liu, Yu Han, Yue Liu, Yongtian Wang:
Realtime Recognition of Dynamic Hand Gestures in Practical Applications. 50:1-50:17 - Jianping Gou, Liyuan Sun, Baosheng Yu, Shaohua Wan, Dacheng Tao:
Hierarchical Multi-Attention Transfer for Knowledge Distillation. 51:1-51:20
- Subhrajyoti Deb, Abhilash Kumar Das, Nirmalya Kar:
An Applied Image Cryptosystem on Moore's Automaton Operating on δ (qk)/𝔽2. 52:1-52:20 - Sisi You, Yukun Zuo, Hantao Yao, Changsheng Xu:
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene. 53:1-53:19 - Shiqi Sun, Danlan Huang, Xiaoming Tao, Chengkang Pan, Guangyi Liu, Changwen Chen:
Boosting Scene Graph Generation with Contextual Information. 54:1-54:24 - Jianwei Zheng, Yu Liu, Yuchao Feng, Honghui Xu, Meiyu Zhang:
Contrastive Attention-guided Multi-level Feature Registration for Reference-based Super-resolution. 55:1-55:21 - Shangxi Wu, Jitao Sang, Kaiyan Xu, Guanhua Zheng, Changsheng Xu:
Adaptive Adversarial Logits Pairing. 56:1-56:16 - Ying Chen, Rui Yao, Yong Zhou, Jiaqi Zhao, Bing Liu, Abdulmotaleb El-Saddik:
Black-box Attack against Self-supervised Video Object Segmentation Models with Contrastive Loss. 57:1-57:21 - Shuang Liang, Wentao Ma, Chi Xie:
Relation with Free Objects for Action Recognition. 58:1-58:19 - Qiaolin He, Zhijie Zheng, Haifeng Hu:
A Feature Map is Worth a Video Frame: Rethinking Convolutional Features for Visible-Infrared Person Re-identification. 59:1-59:20 - Wuliang Huang, Yiqiang Chen, Xinlong Jiang, Teng Zhang, Qian Chen:
GJFusion: A Channel-Level Correlation Construction Method for Multimodal Physiological Signal Fusion. 60:1-60:23
Volume 20, Number 3, March 2024
- Chengji Shen, Zhenjiang Liu, Xin Gao, Zunlei Feng, Mingli Song:
Self-Adaptive Clothing Mapping Based Virtual Try-on. 61:1-61:26 - Alberto Baldrati, Marco Bertini, Tiberio Uricchio, Alberto Del Bimbo:
Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features. 62:1-62:24 - Yan Wang, Peize Li, Qingyi Si, Hanwen Zhang, Wenyu Zang, Zheng Lin, Peng Fu:
Cross-modality Multiple Relations Learning for Knowledge-based Visual Question Answering. 63:1-63:22 - Qiang Guo, Zhi Zhang, Mingliang Zhou, Hong Yue, Huayan Pu, Jun Luo:
Image Defogging Based on Regional Gradient Constrained Prior. 64:1-64:17 - Jintao Guo, Lei Qi, Yinghuan Shi, Yang Gao:
PLACE Dropout: A Progressive Layer-wise and Channel-wise Dropout for Domain Generalization. 65:1-65:23 - Yuan Xiong, Jingru Wang, Zhong Zhou:
VirtualLoc: Large-scale Visual Localization Using Virtual Images. 66:1-66:19 - Yiheng Zhang, Ting Yao, Zhaofan Qiu, Tao Mei:
Explaining Cross-domain Recognition with Interpretable Deep Classifier. 67:1-67:21 - Ruimin Wang, Fasheng Wang, Yiming Su, Jing Sun, Fuming Sun, Haojie Li:
Attention-guided Multi-modality Interaction Network for RGB-D Salient Object Detection. 68:1-68:22 - Jemily Rime, Alan Archer-Boyd, Tom Collins:
How Will You Pod? Implications of Creators' Perspectives for Designing Innovative Podcasting Tools. 69:1-69:25 - Ming Cheung:
Learning from the Past: Fast NAS for Tasks and Datasets. 70:1-70:18 - Xinyue Li, Haiyong Xu, Gangyi Jiang, Mei Yu, Ting Luo, Xuebo Zhang, Hongwei Ying:
Underwater Image Quality Assessment from Synthetic to Real-world: Dataset and Objective Method. 71:1-71:23 - Sujuan Hou, Jiacheng Li, Weiqing Min, Qiang Hou, Yanna Zhao, Yuanjie Zheng, Shuqiang Jiang:
Deep Learning for Logo Detection: A Survey. 72:1-72:23 - Yunjie Peng, Jinlin Wu, Boqiang Xu, Chunshui Cao, Xu Liu, Zhenan Sun, Zhiqiang He:
Deep Learning Based Occluded Person Re-Identification: A Survey. 73:1-73:27 - Muhammad Arslan Manzoor, Sarah Albarri, Ziting Xian, Zaiqiao Meng, Preslav Nakov, Shangsong Liang:
Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications. 74:1-74:34 - Yanyan Shi, Shaowu Yang, Wenjing Yang, Dianxi Shi, Xuehui Li:
Boosting Few-shot Object Detection with Discriminative Representation and Class Margin. 75:1-75:19 - Harry Cheng, Yangyang Guo, Tianyi Wang, Qi Li, Xiaojun Chang, Liqiang Nie:
Voice-Face Homogeneity Tells Deepfake. 76:1-76:22 - Jin Ye, Meng Dan, Wenchao Jiang:
A Visual Sensitivity Aware ABR Algorithm for DASH via Deep Reinforcement Learning. 77:1-77:22 - Jian Wang, Xiao Wang, Guosheng Zhao:
Task Recommendation via Heterogeneous Multi-modal Features and Decision Fusion in Mobile Crowdsensing. 78:1-78:20 - Si-chao Lei, Yue-Jiao Gong, Xiaolin Xiao, Yicong Zhou, Jun Zhang:
Boosting Diversity in Visual Search with Pareto Non-Dominated Re-Ranking. 79:1-79:23 - Huijie Zhang, Pu Li, Xiaobai Liu, Xianfeng Terry Yang, Li An:
An Iterative Semi-supervised Approach with Pixel-wise Contrastive Loss for Road Extraction in Aerial Images. 80:1-80:21 - Jing Fang, Yinbo Yu, Zhongyuan Wang, Xin Ding, Ruimin Hu:
An Image Arbitrary-Scale Super-Resolution Network Using Frequency-domain Information. 81:1-81:23 - Xiao Luo, Wei Ju, Yiyang Gu, Yifang Qin, Siyu Yi, Daqing Wu, Luchen Liu, Ming Zhang:
Toward Effective Semi-supervised Node Classification with Hybrid Curriculum Pseudo-labeling. 82:1-82:19 - Wen Guo, Wuzhou Quan, Junyu Gao, Tianzhu Zhang, Changsheng Xu:
Feature Disentanglement Network: Multi-Object Tracking Needs More Differentiated Features. 83:1-83:22 - Mohammed Khaleel, Azeez Idris, Wallapak Tavanapong, Jacob Pratt, Jung-Hwan Oh, Piet C. de Groen:
VisActive: Visual-concept-based Active Learning for Image Classification under Class Imbalance. 84:1-84:21 - Honghua Chen, Zhiqi Li, Mingqiang Wei, Jun Wang:
Geometric and Learning-Based Mesh Denoising: A Comprehensive Survey. 85:1-85:28 - Ning Han, Yawen Zeng, Chuhao Shi, Guangyi Xiao, Hao Chen, Jingjing Chen:
BiC-Net: Learning Efficient Spatio-temporal Relation for Text-Video Retrieval. 86:1-86:21 - Yuan Feng, Yaojun Hu, Pengfei Fang, Sheng Liu, Yanhong Yang, Shengyong Chen:
Asymmetric Dual-Decoder U-Net for Joint Rain and Haze Removal. 87:1-87:23 - Yurui Xie, Ling Guan:
Sparsity-guided Discriminative Feature Encoding for Robust Keypoint Detection. 88:1-88:22 - Nicolas Beuve, Wassim Hamidouche, Olivier Déforges:
Hierarchical Learning and Dummy Triplet Loss for Efficient Deepfake Detection. 89:1-89:18 - Suncheng Xiang, Dahong Qian, Jingsheng Gao, Zirui Zhang, Ting Liu, Yuzhuo Fu:
Rethinking Person Re-Identification via Semantic-based Pretraining. 90:1-90:17
Volume 20, Number 4, April 2024
- Min Peng, Xiaohu Shao, Yu Shi, Xiangdong Zhou:
Hierarchical Synergy-Enhanced Multimodal Relational Network for Video Question Answering. 91:1-91:22 - Bin Ren, Hao Tang, Fanyang Meng, Runwei Ding, Philip Torr, Nicu Sebe:
Cloth Interactive Transformer for Virtual Try-On. 92:1-92:20 - Xiushan Nie, Yang Shi, Ziyu Meng, Jin Huang, Weili Guan, Yilong Yin:
Complex Scenario Image Retrieval via Deep Similarity-aware Hashing. 93:1-93:24 - Jiawei Tan, Hongxing Wang, Junsong Yuan:
Characters Link Shots: Character Attention Network for Movie Scene Segmentation. 94:1-94:23 - Mingliang Zhou, Xinwen Zhao, Futing Luo, Jun Luo, Huayan Pu, Tao Xiang:
Robust RGB-T Tracking via Adaptive Modality Weight Correlation Filters and Cross-modality Learning. 95:1-95:20 - Zicheng Zhang, Wei Sun, Yingjie Zhou, Jun Jia, Zhichao Zhang, Jing Liu, Xiongkuo Min, Guangtao Zhai:
Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images. 96:1-96:22 - Shuvendu Roy, Ali Etemad:
Contrastive Learning of View-invariant Representations for Facial Expressions Recognition. 97:1-97:22 - Jun Liu, Jiantao Zhou, Haiwei Wu, Weiwei Sun, Jinyu Tian:
Generating Robust Adversarial Examples against Online Social Networks (OSNs). 98:1-98:26 - Tao Yao, Yiru Li, Ying Li, Yingying Zhu, Gang Wang, Jun Yue:
Cross-modal Semantically Augmented Network for Image-text Matching. 99:1-99:18 - Ahmed Telili, Sid Ahmed Fezza, Wassim Hamidouche, Hanene Brachemi Meftah:
2BiVQA: Double Bi-LSTM-based Video Quality Assessment of UGC Videos. 100:1-100:22 - Hongzhou Chen, Haihan Duan, Maha Abdallah, Yufeng Zhu, Yonggang Wen, Abdulmotaleb El-Saddik, Wei Cai:
Web3 Metaverse: State-of-the-Art and Vision. 101:1-101:42 - Lilong Wang, Yunhui Shi, Jin Wang, Shujun Chen, Baocai Yin, Nam Ling:
Graph Based Cross-Channel Transform for Color Image Compression. 102:1-102:25 - Kai Han, Yu Liu, Rukai Wei, Ke Zhou, Jinhui Xu, Kun Long:
Supervised Hierarchical Online Hashing for Cross-modal Retrieval. 103:1-103:23 - Fengyi Fu, Shancheng Fang, Weidong Chen, Zhendong Mao:
Sentiment-Oriented Transformer-Based Variational Autoencoder Network for Live Video Commenting. 104:1-104:24 - Yuxiang Peng, Chong Fu, Guixing Cao, Wei Song, Junxin Chen, Chiu-Wing Sham:
JPEG-compatible Joint Image Compression and Encryption Algorithm with File Size Preservation. 105:1-105:20 - Daizong Liu, Xiaoye Qu, Jianfeng Dong, Pan Zhou, Zichuan Xu, Haozhao Wang, Xing Di, Weining Lu, Yu Cheng:
Transform-Equivariant Consistency Learning for Temporal Sentence Grounding. 106:1-106:19 - Yijie Hu, Bin Dong, Kaizhu Huang, Lei Ding, Wei Wang, Xiaowei Huang, Qiu-Feng Wang:
Scene Text Recognition via Dual-path Network with Shape-driven Attention Alignment. 107:1-107:20 - Rongjiao Liang, Shichao Zhang, Wenzhen Zhang, Guixian Zhang, Jinyun Tang:
Nonlocal Hybrid Network for Long-tailed Image Classification. 108:1-108:22 - Piao Shi, Min Hu, Xuefeng Shi, Fuji Ren:
Deep Modular Co-Attention Shifting Network for Multimodal Sentiment Analysis. 109:1-109:23 - Jing Zhang, Dan Guo, Xun Yang, Peipei Song, Meng Wang:
Visual-linguistic-stylistic Triple Reward for Cross-lingual Image Captioning. 110:1-110:23 - Zhaoyang Jia, Yan Lu, Houqiang Li:
Exploring Neighbor Correspondence Matching for Multiple-hypotheses Video Frame Synthesis. 111:1-111:20 - Sheng Zhou, Dan Guo, Xun Yang, Jianfeng Dong, Meng Wang:
Graph Pooling Inference Network for Text-based VQA. 112:1-112:21 - Hengtong Hu, Lingxi Xie, Xinyue Huo, Richang Hong, Qi Tian:
One-Bit Supervision for Image Classification: Problem, Solution, and Beyond. 113:1-113:22 - Hang Yuan, Wei Gao, Siwei Ma, Yiqiang Yan:
Divide-and-conquer-based RDO-free CU Partitioning for 8K Video Compression. 114:1-114:20 - Mingyu Li, Tao Zhou, Zhuo Huang, Jian Yang, Jie Yang, Chen Gong:
Dynamic Weighted Adversarial Learning for Semi-Supervised Classification under Intersectional Class Mismatch. 115:1-115:24 - Hui Huang, Di Xiao, Jia Liang:
Secure Low-complexity Compressive Sensing with Preconditioning Prior Regularization Reconstruction. 116:1-116:22 - Nathan Clement, Alan Schoen, Arnold P. Boedihardjo, Andrew Jenkins:
Synthetic Data and Hierarchical Object Detection in Overhead Imagery. 117:1-117:20 - Jiang Bian, Xuhong Li, Tao Wang, Qingzhong Wang, Jun Huang, Chen Liu, Jun Zhao, Feixiang Lu, Dejing Dou, Haoyi Xiong:
P2ANet: A Large-Scale Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos. 118:1-118:23 - Jifan Yang, Zhongyuan Wang, Guangcheng Wang, Baojin Huang, Yuhong Yang, Weiping Tu:
Auxiliary Information Guided Self-attention for Image Quality Assessment. 119:1-119:23 - Zhanzhou Feng, Jiaming Xu, Lei Ma, Shiliang Zhang:
Efficient Video Transformers via Spatial-temporal Token Merging for Action Recognition. 120:1-120:21
Volume 20, Number 5, May 2024
- Shupei Zhang, Chenqiu Zhao, Anup Basu:
Principal Component Approximation Network for Image Compression. 121:1-121:20 - Tianyu Zhang, Weiqing Min, Tao Liu, Shuqiang Jiang, Yong Rui:
Toward Egocentric Compositional Action Anticipation with Adaptive Semantic Debiasing. 122:1-122:21 - Yu Liu, Mingbo Zhao, Zhao Zhang, Yuping Liu, Shuicheng Yan:
Arbitrary Virtual Try-on Network: Characteristics Preservation and Tradeoff between Body and Clothing. 123:1-123:23 - Shih-Wei Yang, Li-Hsiang Shen, Hong-Han Shuai, Kai-Ten Feng:
CMAF: Cross-Modal Augmentation via Fusion for Underwater Acoustic Image Recognition. 124:1-124:25 - Yazhou Zhang, Yang Yu, Mengyao Wang, Min Huang, M. Shamim Hossain:
Self-Adaptive Representation Learning Model for Multi-Modal Sentiment and Sarcasm Joint Analysis. 125:1-125:17 - Lei Qi, Peng Dong, Tan Xiong, Hui Xue, Xin Geng:
DoubleAUG: Single-domain Generalized Object Detector in Urban via Color Perturbation and Dual-style Memory. 126:1-126:20 - Dan Shi, Lei Zhu, Jingjing Li, Guohua Dong, Huaxiang Zhang:
Incomplete Cross-Modal Retrieval with Deep Correlation Transfer. 127:1-127:21 - Xianhua Zeng, Xinyu Wang, Yicai Xie:
Multiple Pseudo-Siamese Network with Supervised Contrast Learning for Medical Multi-modal Retrieval. 128:1-128:23 - Sisi You, Hantao Yao, Bing-Kun Bao, Changsheng Xu:
Multi-object Tracking with Spatial-Temporal Tracklet Association. 129:1-129:21 - Gülnaziye Bingöl, Simone Porcu, Alessandro Floris, Luigi Atzori:
QoE Estimation of WebRTC-based Audio-visual Conversations from Facial and Speech Features. 130:1-130:23 - Heqian Qiu, Hongliang Li, Qingbo Wu, Hengcan Shi, Lanxiao Wang, Fanman Meng, Linfeng Xu:
Learning Offset Probability Distribution for Accurate Object Detection. 131:1-131:24 - Alessandro Floris, Simone Porcu, Luigi Atzori:
Controlling Media Player with Hands: A Transformer Approach and a Quality of Experience Assessment. 132:1-132:22 - Jingyu Li, Zhendong Mao, Hao Li, Weidong Chen, Yongdong Zhang:
Exploring Visual Relationships via Transformer-based Graphs for Enhanced Image Captioning. 133:1-133:23 - Zeyu Ma, Siwei Wang, Xiao Luo, Zhonghui Gu, Chong Chen, Jinxing Li, Xian-Sheng Hua, Guangming Lu:
HARR: Learning Discriminative and High-Quality Hash Codes for Image Retrieval. 134:1-134:23 - Chengyang Zhang, Yong Zhang, Bo Li, Xinglin Piao, Baocai Yin:
CrowdGraph: Weakly supervised Crowd Counting via Pure Graph Neural Network. 135:1-135:23 - Jie Wang, Guoqiang Li, Jie Shi, Jinwen Xi:
Weighted Guided Optional Fusion Network for RGB-T Salient Object Detection. 136:1-136:20 - Yibo Zhang, Weiguo Lin, Junfeng Xu:
Joint Audio-Visual Attention with Contrastive Learning for More General Deepfake Detection. 137:1-137:23 - Depei Wang, Ruifeng Xu, Lianglun Cheng, Zhuowei Wang:
Knowledge-integrated Multi-modal Movie Turning Point Identification. 138:1-138:19 - Chunpu Liu, Guanglei Yang, Wangmeng Zuo, Tianyi Zang:
DPDFormer: A Coarse-to-Fine Model for Monocular Depth Estimation. 139:1-139:21 - Yunyao Yan, Guoqing Xiang, Huizhu Jia, Jie Chen, Xiaofeng Huang, Xiaodong Xie:
Two-Stage Perceptual Quality Oriented Rate Control Algorithm for HEVC. 140:1-140:20 - Zongyi Li, Yuxuan Shi, Hefei Ling, Jiazhong Chen, Boyuan Liu, Runsheng Wang, Chengxin Zhao:
Viewpoint Disentangling and Generation for Unsupervised Object Re-ID. 141:1-141:23 - Kuai Dai, Xutao Li, Huiwei Lin, Yin Jiang, Xunlai Chen, Yunming Ye, Di Xian:
TinyPredNet: A Lightweight Framework for Satellite Image Sequence Prediction. 142:1-142:24 - Yingnan Ma, Chenqiu Zhao, Bingran Huang, Xudong Li, Anup Basu:
RAST: Restorable Arbitrary Style Transfer. 143:1-143:21 - Wei-Yen Hsu, Hsien-Wen Lin:
Context-detail-aware United Network for Single Image Deraining. 144:1-144:18 - Yao Liu, Gangfeng Cui, Jiahui Luo, Xiaojun Chang, Lina Yao:
Two-stream Multi-level Dynamic Point Transformer for Two-person Interaction Recognition. 145:1-145:22 - Chengxin Chen, Pengyuan Zhang:
Modality-collaborative Transformer with Hybrid Feature Reconstruction for Robust Emotion Recognition. 146:1-146:23 - Jiafeng Huang, Tianjun Zhang, Shengjie Zhao, Lin Zhang, Yicong Zhou:
An Underwater Organism Image Dataset and a Lightweight Module Designed for Object Detection Networks. 147:1-147:23 - Jing Liu, Litao Shang, Yuting Su, Weizhi Nie, Xin Wen, Anan Liu:
Privacy-preserving Multi-source Cross-domain Recommendation Based on Knowledge Graph. 148:1-148:18 - Xingyu Liu, Zhongyun Hua, Shuang Yi, Yushu Zhang, Yicong Zhou:
Bi-directional Block Encoding for Reversible Data Hiding over Encrypted Images. 149:1-149:23 - Peng Yi, Zhongyuan Wang, Laigan Luo, Kui Jiang, Zheng He, Junjun Jiang, Tao Lu, Jiayi Ma:
Omniscient Video Super-Resolution with Explicit-Implicit Alignment. 150:1-150:23
Volume 20, Number 6, June 2024
- Amit Kumar Singh, Deepa Kundur, Mauro Conti:
Introduction to the Special Issue on Integrity of Multimedia and Multimodal Data in Internet of Things. 151:1-151:4 - Wenyuan Yang, Shaocong Wu, Jianwei Fei, Xianwang Zeng, Yuemin Ding, Zhihua Xia:
A Bitcoin-based Secure Outsourcing Scheme for Optimization Problem in Multimedia Internet of Things. 152:1-152:23 - Qingzhi Liu, Yuchen Huang, Chenglu Jin, Xiaohan Zhou, Ying Mao, Cagatay Catal, Long Cheng:
Privacy and Integrity Protection for IoT Multimodal Data Using Machine Learning and Blockchain. 153:1-153:18 - Simon Lucas Jonker, Malthe Jelstrup, Weizhi Meng, Brooke Lampe:
Detecting Post Editing of Multimedia Images using Transfer Learning and Fine Tuning. 154:1-154:22 - Carmen Bisogni, Lucia Cascone, Michele Nappi, Chiara Pero:
IoT-enabled Biometric Security: Enhancing Smart Car Safety with Depth-based Head Pose Estimation. 155:1-155:24 - Saif E. Nouma, Attila A. Yavuz:
Trustworthy and Efficient Digital Twins in Post-Quantum Era with Hybrid Hardware-Assisted Signatures. 156:1-156:30 - Fan Li, Yanxiang Chen, Haiyang Liu, Zuxing Zhao, Yuanzhi Yao, Xin Liao:
Vocoder Detection of Spoofing Speech Based on GAN Fingerprints and Domain Generalization. 157:1-157:20 - Jing Gao, Peng Li, Asif Ali Laghari, Gautam Srivastava, Thippa Reddy Gadekallu, Sidra Abbas, Jianing Zhang:
Incomplete Multiview Clustering via Semidiscrete Optimal Transport for Multimedia Data Mining in IoT. 158:1-158:20 - Zhenyu Liu, Da Li, Xinyu Zhang, Zhang Zhang, Peng Zhang, Caifeng Shan, Jungong Han:
Pedestrian Attribute Recognition via Spatio-temporal Relationship Learning for Visual Surveillance. 159:1-159:15
- Manvi Jha, Ashish Kumar Bhandari:
NSDIE: Noise Suppressing Dark Image Enhancement Using Multiscale Retinex and Low-Rank Minimization. 160:1-160:22 - Wenhao Fang, Jiayuan Xie, Hongfei Liu, Jiali Chen, Yi Cai:
Diverse Visual Question Generation Based on Multiple Objects Selection. 161:1-161:22 - Yichi Zhang, Dandan Ding, Zhan Ma, Zhu Li:
A Reconfigurable Framework for Neural Network Based Video In-Loop Filtering. 162:1-162:20 - Ronglai Zuo, Brian Mak:
Improving Continuous Sign Language Recognition with Consistency Constraints and Signer Removal. 163:1-163:25 - Qinglin Liu, Quanling Meng, Xiaoqian Lv, Zonglin Li, Wei Yu, Shengping Zhang:
Human Selective Matting. 164:1-164:23 - Shenshen Li, Xing Xu, Xun Jiang, Fumin Shen, Zhe Sun, Andrzej Cichocki:
Cross-Modal Attention Preservation with Self-Contrastive Learning for Composed Query-Based Image Retrieval. 165:1-165:22 - Xizhong Wang, Rui Liu, Xin Yang, Qiang Zhang, Dongsheng Zhou:
MCFNet: Multi-Attentional Class Feature Augmentation Network for Real-Time Scene Parsing. 166:1-166:17 - Yanzhe Chen, Jiahuan Zhou, Yuxin Peng:
SPIRIT: Style-guided Patch Interaction for Fashion Image Retrieval with Text Feedback. 167:1-167:17 - Huan Liu, Xiaolong Liu, Zichang Tan, Xiaolong Li, Yao Zhao:
PADVG: A Simple Baseline of Active Protection for Audio-Driven Video Generation. 168:1-168:19 - Yadong Huo, Qibing Qin, Jiangyan Dai, Wenfeng Zhang, Lei Huang, Chengduan Wang:
Deep Neighborhood-aware Proxy Hashing with Uniform Distribution Constraint for Cross-modal Retrieval. 169:1-169:23 - Yunyi Li, Fu Xiao, Wei Liang, Linqing Gui:
Multiply Complementary Priors for Image Compressive Sensing Reconstruction in Impulsive Noise. 170:1-170:22 - Weichao Zhao, Hezhen Hu, Wengang Zhou, Li Li, Houqiang Li:
Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video. 171:1-171:18 - Aashania Antil, Chhavi Dhiman:
MF2ShrT: Multimodal Feature Fusion Using Shared Layered Transformer for Face Anti-spoofing. 172:1-172:21 - M. Shamim Hossain, Yixue Hao, Long Hu, Jia Liu, Gang Wei, Min Chen:
Immersive Multimedia Service Caching in Edge Cloud with Renewable Energy. 173:1-173:23 - Ying Ying Zhang, Shuo Zhang, Ming Hui:
Semantic-Consistency-guided Learning on Deep Features for Unsupervised Salient Object Detection. 174:1-174:23 - Xuelin Liu, Jiebin Yan, Liping Huang, Yuming Fang, Zheng Wan, Yang Liu:
Perceptual Quality Assessment of Omnidirectional Images: A Benchmark and Computational Model. 175:1-175:24 - Yuhao Cheng, Yichao Yan, Wenhan Zhu, Ye Pan, Bowen Pan, Xiaokang Yang:
Head3D: Complete 3D Head Generation via Tri-plane Feature Distillation. 176:1-176:20 - Hao Chen, Yunlong Yu, Yonghan Dong, Zheming Lu, Yingming Li, Zhongfei Zhang:
Multi-Content Interaction Network for Few-Shot Segmentation. 177:1-177:20 - Zicheng Zhang, Wei Sun, Haoning Wu, Yingjie Zhou, Chunyi Li, Zijian Chen, Xiongkuo Min, Guangtao Zhai, Weisi Lin:
GMS-3DQA: Projection-Based Grid Mini-patch Sampling for 3D Model Quality Assessment. 178:1-178:19 - Jun Lyu, Guangming Wang, M. Shamim Hossain:
Iterative Temporal-spatial Transformer-based Cardiac T1 Mapping MRI Reconstruction. 179:1-179:18 - Yuanjie Dang, Chunxia Huang, Peng Chen, Dongdong Zhao, Nan Gao, Ronghua Liang, Ruohong Huan:
Discriminative Action Snippet Propagation Network for Weakly Supervised Temporal Action Localization. 180:1-180:21 - Qiong Chen, Tianlin Huang, Qingfa Liu:
SWRM: Similarity Window Reweighting and Margin for Long-Tailed Recognition. 181:1-181:18 - Peiguang Jing, Xianyi Liu, Lijuan Zhang, Yun Li, Yu Liu, Yuting Su:
Multimodal Attentive Representation Learning for Micro-video Multi-label Classification. 182:1-182:23 - Qingbao Huang, Pijian Li, Youji Huang, Feng Shuang, Yi Cai:
Region-Focused Network for Dense Captioning. 183:1-183:20 - Lei Qi, Hongpeng Yang, Yinghuan Shi, Xin Geng:
MultiMatch: Multi-task Learning for Semi-supervised Domain Generalization. 184:1-184:21 - Yucheng Suo, Zhedong Zheng, Xiaohan Wang, Bang Zhang, Yi Yang:
Jointly Harnessing Prior Structures and Temporal Consistency for Sign Language Video Generation. 185:1-185:18
Volume 20, Number 7, July 2024
- Roberto García, Ana Cediel, Mercè Teixidó, Rosa Gil:
Semantics and Non-fungible Tokens for Copyright Management on the Metaverse and Beyond. 186:1-186:20 - Tianxiu Xie, Keke Gai, Liehuang Zhu, Shuo Wang, Zijian Zhang:
RAC-Chain: An Asynchronous Consensus-based Cross-chain Approach to Scalable Blockchain for Metaverse. 187:1-187:24 - Yongjun Ren, Zhiying Lv, Neal N. Xiong, Jin Wang:
HCNCT: A Cross-chain Interaction Scheme for the Blockchain-based Metaverse. 188:1-188:23 - Shuang-Min Chen, Rui Xu, Jian Xu, Shiqing Xin, Changhe Tu, Chenglei Yang, Lin Lu:
QuickCSGModeling: Quick CSG Operations Based on Fusing Signed Distance Fields for VR Modeling. 189:1-189:18 - Qinnan Zhang, Zehui Xiong, Jianming Zhu, Sheng Gao, Wanting Yang:
A Privacy-preserving Auction Mechanism for Learning Model as an NFT in Blockchain-driven Metaverse. 190:1-190:24 - Han Wang, Hui Li, Abla Smahi, Feng Zhao, Yao Yao, Ching Chuen Chan, Shiyu Wang, Wenyuan Yang, Shuo-Yen Robert Li:
MIS: A Multi-Identifier Management and Resolution System in the Metaverse. 191:1-191:25
- Fei Peng, Le Qin, Min Long, Jin Li:
Detection of Adversarial Facial Accessory Presentation Attacks Using Local Face Differential. 192:1-192:28 - Xinjian Gao, Ye Pang, Yuyu Liu, Maokun Han, Jun Yu, Wei Wang, Yuanxu Chen:
Multimodal Visual-Semantic Representations Learning for Scene Text Recognition. 193:1-193:18 - Si-chao Lei, Yue-Jiao Gong, Xiaolin Xiao, Yicong Zhou, Jun Zhang:
Tensorial Evolutionary Optimization for Natural Image Matting. 194:1-194:23 - Jifan Yang, Zhongyuan Wang, Baojin Huang, Jiaxin Ai, Yuhong Yang, Zixiang Xiong:
Joint Distortion Restoration and Quality Feature Learning for No-reference Image Quality Assessment. 195:1-195:20 - Weiyao Lin, Yufeng Zhang, Wenrui Dai, Huabin Liu, John See, Hongkai Xiong:
Scene Graph Lossless Compression with Adaptive Prediction for Objects and Relations. 196:1-196:23 - Xiaofeng Qu, Li Liu, Lei Zhu, Liqiang Nie, Huaxiang Zhang:
Instance-level Adversarial Source-free Domain Adaptive Person Re-identification. 197:1-197:22 - Runyu Yang, Dong Liu, Siwei Ma, Feng Wu, Wen Gao:
Perceptual Quality-Oriented Rate Allocation via Distillation from End-to-End Image Compression. 198:1-198:22 - Liangzhe Chen, Wei Li, Xiaohui Cui, Zhenyu Wang, Stefano Berretti, Shaohua Wan:
MS-GDA: Improving Heterogeneous Recipe Representation via Multinomial Sampling Graph Data Augmentation. 199:1-199:23 - Lei Gao, Zheng Guo, Ling Guan:
An Optimal Edge-weighted Graph Semantic Correlation Framework for Multi-view Feature Representation Learning. 200:1-200:23 - Xiaoping Liang, Wanting Liu, Xianquan Zhang, Zhenjun Tang:
Robust Image Hashing via CP Decomposition and DCT for Copy Detection. 201:1-201:22 - Feng Li, Yixuan Wu, Anqi Li, Huihui Bai, Runmin Cong, Yao Zhao:
Enhanced Video Super-Resolution Network towards Compressed Data. 202:1-202:21 - Penglei Gao, Xi Yang, Rui Zhang, Kaizhu Huang:
Continuous Image Outpainting with Neural ODE. 203:1-203:16 - Jaime Ruiz-Serra, Jack White, Stephen M. Petrie, Tatiana Kameneva, Chris McCarthy:
Learning Scene Representations for Human-assistive Displays Using Self-attention Networks. 204:1-204:26 - Jinjia Peng, Song Pengpeng, Hui Li, Huibing Wang:
ReFID: Reciprocal Frequency-aware Generalizable Person Re-identification via Decomposition and Filtering. 205:1-205:20 - Carlos Cortés, Irene Viola, Jesús Gutiérrez, Jack Jansen, Shishir Subramanyam, Evangelos Alexiou, Pablo Pérez, Narciso García, Pablo César:
Delay Threshold for Social Interaction in Volumetric eXtended Reality Communication. 206:1-206:22 - JongBeom Jeong, Soonbin Lee, Eun-Seok Ryu:
DATRA-MIV: Decoder-Adaptive Tiling and Rate Allocation for MPEG Immersive Video. 207:1-207:22 - Zheng Chen, Jian Zhao, Mingyu Yang, Wengang Zhou, Houqiang Li:
Optimizing Camera Motion with MCTS and Target Motion Modeling in Multi-Target Active Object Tracking. 208:1-208:19 - Xiangming Gu, Longshen Ou, Wei Zeng, Jianan Zhang, Nicholas Wong, Ye Wang:
Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing. 209:1-209:29 - Mingyu Deng, Wanyi Zhang, Jie Zhao, Zhu Wang, Mingliang Zhou, Jun Luo, Chao Chen:
A Novel Framework for Joint Learning of City Region Partition and Representation. 210:1-210:23 - Xueqiang Han, Biao Han, Jinrong Li, Congxi Song:
Multi-agent DRL-based Multipath Scheduling for Video Streaming with QUIC. 211:1-211:23 - Wenxue Cui, Xingtao Wang, Xiaopeng Fan, Shaohui Liu, Xinwei Gao, Debin Zhao:
Deep Network for Image Compressed Sensing Coding Using Local Structural Sampling. 212:1-212:22 - Wenxi Liu, Jiaxin Cai, Qi Li, Chenyang Liao, Jingjing Cao, Shengfeng He, Yuanlong Yu:
Learning Nighttime Semantic Segmentation the Hard Way. 213:1-213:23 - Xiaoya Yu, Kejun Wu, You Yang, Qiong Liu:
WaRENet: A Novel Urban Waterlogging Risk Evaluation Network. 214:1-214:28 - Jiawei Tan, Pingan Yang, Lu Chen, Hongxing Wang:
Temporal Scene Montage for Self-Supervised Video Scene Boundary Detection. 215:1-215:19 - Jun Liu, Jiantao Zhou, Jinyu Tian, Weiwei Sun:
Recoverable Privacy-Preserving Image Classification through Noise-like Adversarial Examples. 216:1-216:27 - Xiaobo Hu, Youfang Lin, Hehe Fan, Shuo Wang, Zhihao Wu, Kai Lv:
Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation. 217:1-217:22 - Baoli Sun, Xinchen Ye, Tiantian Yan, Zhihui Wang, Haojie Li, Zhiyong Wang:
Discriminative Segment Focus Network for Fine-grained Video Action Recognition. 218:1-218:20 - Tingting Han, Quan Zhou, Jun Yu, Zhou Yu, Jianhui Zhang, Sicheng Zhao:
Effective Video Summarization by Extracting Parameter-Free Motion Attention. 219:1-219:20 - Huisi Wu, Zhaoze Wang, Yifan Li, Xueting Liu, Tong-Yee Lee:
Suitable and Style-Consistent Multi-Texture Recommendation for Cartoon Illustrations. 220:1-220:26 - Shizhan Liu, Weiyao Lin, Yihang Chen, Yufeng Zhang, Wenrui Dai, John See, Hongkai Xiong:
A Unified Framework for Jointly Compressing Visual and Semantic Data. 221:1-221:24 - Yefei Sheng, Ming Tao, Jie Wang, Bing-Kun Bao:
ISF-GAN: Imagine, Select, and Fuse with GPT-Based Text Enrichment for Text-to-Image Synthesis. 222:1-222:17 - Haorao Gao, Yiming Su, Fasheng Wang, Haojie Li:
Heterogeneous Fusion and Integrity Learning Network for RGB-D Salient Object Detection. 223:1-223:24 - Xiruo Jiang, Yazhou Yao, Sheng Liu, Fumin Shen, Liqiang Nie, Xian-Sheng Hua:
Dual Dynamic Threshold Adjustment Strategy. 224:1-224:18 - Panpan Zhang, Meng Liu, Xuemeng Song, Da Cao, Zan Gao, Liqiang Nie:
Universal Relocalizer for Weakly Supervised Referring Expression Grounding. 225:1-225:23 - Xiaolong Shen, Zhedong Zheng, Yi Yang:
StepNet: Spatial-temporal Part-aware Network for Isolated Sign Language Recognition. 226:1-226:19 - Kankana Roy:
Multimodal Score Fusion with Sparse Low-rank Bilinear Pooling for Egocentric Hand Action Recognition. 227:1-227:22 - Huiyuan Fu, Jin Liu, Ting Yu, Xin Wang, Huadong Ma:
Multi-Domain Image-to-Image Translation with Cross-Granularity Contrastive Learning. 228:1-228:21 - Hao Zhang, Meng Liu, Yuan Qi, Ning Yang, Shunbo Hu, Liqiang Nie, Wenyin Zhang:
Efficient Brain Tumor Segmentation with Lightweight Separable Spatial Convolutional Network. 229:1-229:19
Volume 20, Number 8, August 2024
- Jinliang Liu, Zhedong Zheng, Zongxin Yang, Yi Yang:
High Fidelity Makeup via 2D and 3D Identity Preservation Net. 230:1-230:24 - Junjian Huang, Hao Ren, Shulin Liu, Yong Liu, Chuanlu Lv, Jiawen Lu, Changyong Xie, Hong Lu:
Real-Time Attentive Dilated U-Net for Extremely Dark Image Enhancement. 231:1-231:19 - Mingfu Xiong, Kaikang Hu, Zhihan Lyu, Fei Fang, Zhongyuan Wang, Ruimin Hu, Khan Muhammad:
Inter-camera Identity Discrimination for Unsupervised Person Re-identification. 232:1-232:18 - Jiaqi Yu, Jinhai Yang, Hua Yang, Renjie Pan, Pingrui Lai, Guangtao Zhai:
Psychology-Guided Environment Aware Network for Discovering Social Interaction Groups from Videos. 233:1-233:23 - Qi Liu, Xinchen Liu, Kun Liu, Xiaoyan Gu, Wu Liu:
SigFormer: Sparse Signal-guided Transformer for Multi-modal Action Segmentation. 234:1-234:22 - Jun Lyu, Shouang Yan, M. Shamim Hossain:
DBGAN: Dual Branch Generative Adversarial Network for Multi-Modal MRI Translation. 235:1-235:22 - Dejun Zhang, Mian Zhang, Xuefeng Tan, Jun Liu:
Bridging the Domain Gap in Scene Flow Estimation via Hierarchical Smoothness Refinement. 236:1-236:21 - Ning Chen, Zhipeng Cheng, Xuwei Fan, Zhang Liu, Bangzhen Huang, Yifeng Zhao, Lianfen Huang, Xiaojiang Du, Mohsen Guizani:
Integrated Sensing, Communication, and Computing for Cost-effective Multimodal Federated Perception. 237:1-237:28 - Jiayu Yang, Chunhui Yang, Fei Xiong, Yongqi Zhai, Ronggang Wang:
Learned Video Compression with Adaptive Temporal Prior and Decoded Motion-aided Quality Enhancement. 238:1-238:21 - Xiaoling Gu, Junkai Zhu, Yongkang Wong, Zizhao Wu, Jun Yu, Jianping Fan, Mohan S. Kankanhalli:
Recurrent Appearance Flow for Occlusion-Free Virtual Try-On. 239:1-239:17 - Yuanjie Lyu, Penggang Qin, Tong Xu, Chen Zhu, Enhong Chen:
InteractNet: Social Interaction Recognition for Semantic-rich Videos. 240:1-240:21 - Mrinmoy Bhattacharjee, S. R. Mahadeva Prasanna, Prithwijit Guha:
Exploration of Speech and Music Information for Movie Genre Classification. 241:1-241:19 - Sara Sarto, Marcella Cornia, Lorenzo Baraldi, Alessandro Nicolosi, Rita Cucchiara:
Towards Retrieval-Augmented Architectures for Image Captioning. 242:1-242:22 - Kaihui Yang, Junwei Han, Guangyu Guo, Chaowei Fang, Yingzi Fan, Lechao Cheng, Dingwen Zhang:
Progressive Adapting and Pruning: Domain-Incremental Learning for Saliency Prediction. 243:1-243:19 - Lv Tang, Xinfeng Zhang:
High Efficiency Deep-learning Based Video Compression. 244:1-244:23 - Pedro Gomes, Silvia Rossi, Laura Toni:
AGAR - Attention Graph-RNN for Adaptative Motion Prediction of Point Clouds of Deformable Objects. 245:1-245:25 - Jiabo Ye, Junfeng Tian, Ming Yan, Haiyang Xu, Qinghao Ye, Yaya Shi, Xiaoshan Yang, Xuwu Wang, Ji Zhang, Liang He, Xin Lin:
UniQRNet: Unifying Referring Expression Grounding and Segmentation with QRNet. 246:1-246:28 - Wei Zhou, Qi Yang, Wu Chen, Qiuping Jiang, Guangtao Zhai, Weisi Lin:
Blind Quality Assessment of Dense 3D Point Clouds with Structure Guided Resampling. 247:1-247:21 - Yuli Zhao, Yin Zhang, Francis C. M. Lau, Hai Yu, Zhiliang Zhu, Bin Zhang:
Expanding-Window Zigzag Decodable Fountain Codes for Scalable Multimedia Transmission. 248:1-248:24 - Xuanyu Jin, Ni Li, Wanzeng Kong, Jiajia Tang, Bing Yang:
Unbiased Semantic Representation Learning Based on Causal Disentanglement for Domain Generalization. 249:1-249:20 - Bo Peng, Lin Sun, Jianjun Lei, Bingzheng Liu, Haifeng Shen, Wanqing Li, Qingming Huang:
Self-Supervised Monocular Depth Estimation via Binocular Geometric Correlation Learning. 250:1-250:19 - Yang Yang, Shuailong Qiu, Lanling Zeng, Zhigeng Pan:
Detail-preserving Joint Image Upsampling. 251:1-251:23 - Xiao Kang, Xingbo Liu, Wen Xue, Xiushan Nie, Yilong Yin:
Online Cross-modal Hashing With Dynamic Prototype. 252:1-252:18 - Yuqing Yang, Boris Joukovsky, José Oramas Mogrovejo, Tinne Tuytelaars, Nikos Deligiannis:
SNIPPET: A Framework for Subjective Evaluation of Visual Explanations Applied to DeepFake Detection. 253:1-253:29 - Jinwang Pan, Xianming Liu, Yuanchao Bai, Deming Zhai, Junjun Jiang, Debin Zhao:
Illumination-Aware Low-Light Image Enhancement with Transformer and Auto-Knee Curve. 254:1-254:23 - Lohic Fotio Tiotsop, Antonio Servetti, Peter Pocta, Glenn Van Wallendael, Marcus Barkowsky, Enrico Masala:
Multiple Image Distortion DNN Modeling Individual Subject Quality Assessment. 255:1-255:27 - Yunhui Xu, Youru Li, Muhao Xu, Zhenfeng Zhu, Yao Zhao:
HKA: A Hierarchical Knowledge Alignment Framework for Multimodal Knowledge Graph Completion. 256:1-256:19 - Li Zhou, Zhenyu Liu, Yutong Li, Yuchi Duan, Huimin Yu, Bin Hu:
Multi Fine-Grained Fusion Network for Depression Detection. 257:1-257:23 - Chenlei Lv, Dan Zhang, Shengling Geng, Zhongke Wu, Hui Huang:
Color Transfer for Images: A Survey. 258:1-258:29 - Zhihao Zhang, Jun Wang, Shengjie Li, Lei Jin, Hao Wu, Jian Zhao, Bo Zhang:
Review and Analysis of RGBT Single Object Tracking Methods: A Fusion Perspective. 259:1-259:27 - Muhammad Bilal Shaikh, Douglas Chai, Syed Mohammed Shamsul Islam, Naveed Akhtar:
From CNNs to Transformers in Multimodal Human Action Recognition: A Survey. 260:1-260:24 - Yuankun Liu, Xiang Yuan, Haochen Li, Zhijie Tan, Jinsong Huang, Jingjie Xiao, Weiping Li, Tong Mo:
SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text Retrieval. 261:1-261:28
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.