default search action
5th MIPR 2022: Virtual Event, USA
- 5th IEEE International Conference on Multimedia Information Processing and Retrieval, MIPR 2022, Virtual Event, USA, August 2-4, 2022. IEEE 2022, ISBN 978-1-6654-9548-6
- Zhongzheng Yuan, Samyak Rawlekar, Siddharth Garg, Elza Erkip, Yao Wang:
Feature Compression for Rate Constrained Object Detection on the Edge. 1-6 - Zhuoyi Wang, Yibo Hu, Latifur Khan, Kevin W. Hamlen, Bhavani Thuraisingham:
CAPT: Contrastive Pre-Training based Semi-Supervised Open-Set Learning. 7-13 - Chih-Fan Hsu, Ming-Ching Chang, Wei-Chao Chen:
A Robust Collaborative Learning Framework Using Data Digests and Synonyms to Represent Absent Clients. 14-19 - Qisheng He, Soumyanil Banerjee, Loren Schwiebert, Ming Dong:
AgileGCN: Accelerating Deep GCN with Residual Connections using Structured Pruning. 20-26 - Chris Henry, Birendra Kathariya, M. Salman Asif, Zhu Li, George York:
Aerial Image Classification through Thin Lensless Camera. 27-30 - Thanh Hong-Phuoc, Ling Guan:
Learning Rotational Invariant Dictionary for Sparse Coding based Key-point Detection. 35-40 - Maryna Veksler, Ramazan Aygun, Kemal Akkaya, S. Sitharama Iyengar:
Video Origin Camera Identification using Ensemble CNNs of Positional Patches. 41-46 - Lei Gao, Ling Guan:
Interpretable Learning-Based Multi-Modal Hashing Analysis for Multi-View Feature Representation Learning. 47-52 - Ju Wang, Wookjin Choi, Igor Schtau, Taylor Ferro, Wei-bang Chen, Cutrell Trott, Grant Patterson:
Improving Angular Estimation Using a Deep CNN network in 6D pose estimation. 53-58 - Jianqiang Wang, Zhan Ma:
Sparse Tensor-based Point Cloud Attribute Compression. 59-64 - Yi Chen, Yunhao Mao, Shiqi Wang, Xianguo Zhang, Sam Kwong:
Machine-Learning Based High Efficiency Rate Control for AV1. 65-70 - Hoontaek Oh, Jerry D. Gibson:
Recursive Randomized Tree Coding of Speech. 71-76 - Kyriakos Lite, Bernhard Rinner:
Information-Seeking in Localization and Mission Planning of Multi-Agent Systems. 77-83 - Ziheng Zhang, Chang-Hong Fu, Kai Xie, Hong Hong, Guan-Ming Su:
Fast VVC Intra Coding by Skipping Redundant Coding Block Structures and Unnecessary Directional Partition. 84-89 - Ranjit Kumar Tulabandu, Jayasanker Jayaprakash, Sanampudi Venkata Rao, Cherma Rajan A, Neeraj Gadgil, Frank Galligan, Wan-Teh Chang:
Evolution of AVIF Encoder: Speed and Memory Optimizations. 90-95 - Yixiang Mao, Yueyu Hu, Yao Wang:
Learning to Predict on Octree for Scalable Point Cloud Geometry Coding. 96-102 - Zixiao Yu, Chenyu Yu, Haohong Wang, Jian Ren:
Enabling Automatic Cinematography with Reinforcement Learning. 103-108 - Yang Lei, Viktor Shkolnikov, Daisy Xin:
Spatially Isotropic 3D Volumetric Reconstruction of Live Biological Cells with Multi-View Geometry. 109-114 - Omkar N. Kulkarni, Shashank Arora, Pradeep K. Atrey:
GARGI: Selecting Gaze-Aware Representative Group Image from a Live Photo. 115-120 - Shlomo Dubnov, Gérard Assayag, Vignesh Gokul:
Creative Improvised Interaction with Generative Musical Systems. 121-126 - Jie Cai, Zibo Meng, Jiaming Ding, Chiu Man Ho:
Real-Time Super-Resolution for Real-World Images on Mobile Devices. 127-132 - Yuxin Lin, Jian Li, Lanqing Guo, Bihan Wen:
RE2L: A Real-World Dataset for Outdoor Low-Light Image Enhancement. 133-138 - Peixi Wu, Ge Li, Thomas H. Li:
MOAC: Multi-level Perception Optimizer Based on Dual Augmented Cost for Structure- from-Motion. 139-145 - Congying Cao, Lijun Zhao, Jinjing Zhang, Xinlu Wang, Anhong Wang:
D2-UTransforIner: Deep Modulated Dual-UTransformer for Multiple Description Image Enhancement. 146-151 - Hao Cheng, Joey Tianyi Zhou, Wee Peng Tay, Bihan Wen:
Attentive Graph Neural Networks for Few-Shot Learning. 152-157 - Daisuke Miyazaki, Hodaka Tanida:
Image enhancement for dichromats using image pyramid based on saturation. 158-161 - Akihito Watanabe, Daisuke Miyazaki:
Surface normal estimation of thin transparent objects from polarization of transmitted light. 162-165 - Liangshan Lou, Ke Lu, Jian Xue:
Skipped-Connection Transformer for Image Captioning*. 166-171 - Dexiang Hong, Guorong Li, Bineng Zhong, Zhenjun Han, Li Su, Qingming Huang:
CRNet: Collaborative Refinement Network for Self-Supervised Video Object Segmentation. 172-177 - Chin-Chia Yang, Yi-Chou Chen, Shan-Ling Chen, Homer H. Chen:
Disparity-Guided Light Field Video Synthesis with Temporal Consistency. 178-183 - Fityanul Akhyar, Ledya Novamizanti, Trianusa Putra, Elvin Nur Furqon, Ming-Ching Chang, Chih-Yang Lin:
Lightning YOLOv4 for a Surface Defect Detection System for Sawn Lumber. 184-189 - Kazuhiro Yamawaki, Xian-Hua Han:
Deep Unsupervised Blind Learning for Single Image Super Resolution. 190-193 - Da Huo, Marc A. Kastner, Takahiro Komamizu, Ichiro Ide:
Action Semantic Alignment for Image Captioning. 194-197 - Chengwei Wei, C.-C. Jay Kuo, Rafael Luiz Testa, Ariane Machado-Lima, Fátima L. S. Nunes:
ExpressionHop: A Lightweight Human Facial Expression Classifier. 198-203 - Praneet Singh, Haoyu Chen, Edward J. Delp, Amy R. Reibman:
Evaluating Image Quality Estimators for Face Matching. 204-209 - Jiaran Zhou, Yuezun Li:
Detection-by-Simulation: Exposing DeepFake via Simulating Forgery using Face Reconstruction. 210-215 - Nathan Galea, Dylan Seychell:
Facial Expression Recognition in the Wild: Dataset Configurations. 216-219 - Pengcheng Gao, Bin Huang, Jiayi Lyu, Haifeng Ma, Jian Xue:
A Local-Global Metric Learning Method for Facial Expression Animation. 220-223 - Hui Guo, Shu Hu, Xin Wang, Ming-Ching Chang, Siwei Lyu:
Open-Eye: An Open Platform to Study Human Performance on Identifying AI-Synthesized Faces. 224-227 - Zinan Xiong, Chenxi Wang, Ying Li, Yan Luo, Yu Cao:
Swin-Pose: Swin Transformer Based Human Pose Estimation. 228-233 - Astha Verma, A. Venkata Subramanyam, Rajiv Ratn Shah:
Wasserstein Metric Attack on Person Re-identification. 234-239 - Hirotaka Kato, Takatsugu Hirayama, Keisuke Doman, Ichiro Ide, Yasutomo Kawanishi, Takahiro Komamizu, Daisuke Deguchi, Hiroshi Murase:
Intuitive Gait Modeling using Mimetic-Words for Gait Description and Generation. 240-245 - Wei-An Teng, Su-Ling Yeh, Homer H. Chen:
Comparison of Virtual-Real Integration Efficiency between Light Field and Conventional Near-Eye AR Displays. 246-251 - Vineet Joshi, A. V. Subramanyam:
Contextual Active Learning for Person Re- Identification. 252-257 - Junyi Liu, Esha Naidu, Jialian Wu, Shira Gabriel, Edward Steinfeld, Junsong Yuan:
Personalized Prediction of Indoor Comfort Using Graph Convolutional Matrix Completion. 258-261 - Gautham Vinod, Zeman Shao, Fengqing Zhu:
Image Based Food Energy Estimation With Depth Domain Adaptation. 262-267 - Zixiao Yu, Enhao Guo, Haohong Wang, Jian Ren:
Bridging Script and Animation Utilizing a New Automatic Cinematography Model. 268-273 - Mehmet N. Akcay, Burak Kara, Ali C. Begen, Saba Ahsan, Igor D. D. Curcio, Emre Aksu:
Rate-Adaptive Streaming of 360-Degree Videos with Head-Motion-Aware Viewport Margins. 274-280 - Qian Zhou, Klara Nahrstedt:
Ultra-Sparse 360-Degree Camera View Synthesis for Immersive Virtual Tourism. 281-286 - Maria E. Presa Reyes, Yudong Tao, Rui Ma, Shu-Ching Chen, Mei-Ling Shyu:
Multi-Source Weak Supervision Fusion for Disaster Scene Recognition in Videos. 287-292 - Luntian Mou, Yiyuan Zhao, Chao Zhou, Baocai Yin, Wen Gao, Ramesh C. Jain:
A Review of Personalized Health Navigation for Drivers. 293-299 - Saeed Ranjbar Alvar, Korcan Uyanik, Ivan V. Bajic:
License Plate Privacy in Collaborative Visual Analysis of Traffic Scenes. 300-305 - Neha Kumari, Min Chen:
Malware and Piracy Detection in Android Applications. 306-311 - Haoming Guo, Tianyi Huang, Huixuan Huang, Mingyue Fan, Gerald Friedland:
A Systematic Review of Multimodal Approaches to Online Misinformation Detection. 312-317 - Chih-Fan Hsu, Jing-Lun Huang, Feng-Hao Liu, Ming-Ching Chang, Wei-Chao Chen:
FedTrust: Towards Building Secure Robust and Trustworthy Moderators for Federated Learning. 318-323 - Kratika Bhagtani, Amit Kumar Singh Yadav, Emily R. Bartusiak, Ziyue Xiang, Ruiting Shao, Sriram Baireddy, Edward J. Delp:
An Overview of Recent Work in Multimedia Forensics. 324-329 - Tankut Akgul, Deniz Ugur, Ali C. Begen:
Automated Adaptive Playback for Encoder-Adjudicated Live Sports. 330-335 - Nikolaos Passalis, Maria Tzelepi, Polychronis Charitidis, Stavros Doropoulos, Stavros Vologiannidis, Anastasios Tefas:
Deep Video Stream Information Analysis and Retrieval: Challenges and Opportunities. 336-341 - Yuwei Chen, Ming-Ching Chang:
Towards Multimodal Semantic Consistency Analysis of Long Form Articles. 342-347 - Jiajun Song, Weiqing Min, Yuxin Liu, Zhuo Li, Shuqiang Jiang, Yong Rui:
A Noise-robust Locality Transformer for Fine-grained Food Image Retrieval. 348-353 - Jawaher Alghamdi, Yuqing Lin, Suhuai Luo:
Modeling Fake News Detection Using BERT-CNN-BiLSTM Architecture. 354-357 - Zheng Guo, Thanh Hong-Phuoc, Naimul Khan, Ling Guan:
A Highly Optimized GPU Batched Elasticnet Solver (BENS) with Application to Real- Time Keypoint Detection for Image Retrieval. 358-361 - Keiichi Suekane, Ryo Osawa, Aozora Inagaki, Taiga Matsui, Tomohiro Tanabe, Keita Ishikawa, Tomohiro Takagi:
Personalized Fashion Sequential Recommendation with Visual Feature Based on Conditional Hierarchical VAE. 362-365 - Chengxuan Huang, Dalei Wu, Yu Liang:
Adaptive Acquisition of Airborne Lidar Point Cloud Based on Deep Reinforcement Learning. 366-371 - Gabriel Lugo Bustillo, Amit Upreti, Irene Cheng:
Multiscale point feature object localization for hydrant surveying using LiDAR. 372-378 - Yuexi Zhang, Ming Chen, Yikang Li, Jenhao Hsiao, Octavia I. Camps, Chiuman Ho:
Generic Action Start Detection. 379-382 - Cheng Yang, Weigang Zhang:
Weakly Supervised Temporal Action Localization Through Contrastive Learning. 383-386 - Tom Liao, Jun-Cheng Chen, Shyh-Kang Jeng, Chunhwei Tai:
Cross-Domain Knowledge Transfer for Skeleton-based Action Recognition based on Graph Convolutional Gradient Reversal Layer. 387-390 - Ziruo Yi, Eduardo Blanco, Heng Fan, Mark V. Albert:
BAPO: A Large-Scale Multimodal Corpus for Ball Possession Prediction in American Football Games. 391-394 - Oguz M. Aranay, Pradeep K. Atrey:
Active Genetic Learning with Evidential Uncertainty for Identifying Mushroom Toxicity. 395-400 - Garima Singhal, Priyankar Choudhary, Vusirikala Abhishek, Seela Sweety, Srinivas Subramanian, Neeraj Goel:
Cattle Collar: An End-to-End Multi-Model Framework for Cattle Monitoring. 401-407 - Sushil Ghildiyal, Neeraj Goel, Mukesh Saini:
Cloud Removal in Satellite Imagery Using Adversarial Network and RGB-Optical Data Fusion. 407-412 - Pratham Goyal, Anjali Raj, Puneet Kumar, Kishore Babu Nampalle:
Automatic Evaluation of Machine Generated Feedback For Text and Image Data. 413-418 - Charan Charupalli, Karthick Seshadri:
Fine-tuning the Robust Temporal Feature Magnitude Model for Enhancing the Accuracy of Anomaly Detection. 419-424 - Kishore Babu Nampalle, Balasubramanian Raman:
An efficient multi-functional deep learning model for effective medical image classification using skin lesion database. 425-429
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.