default search action
28th MMM 2022: Phu Quoc, Vietnam
- Björn Þór Jónsson, Cathal Gurrin, Minh-Triet Tran, Duc-Tien Dang-Nguyen, Anita Min-Chun Hu, Huynh Thi Thanh Binh, Benoit Huet:
MultiMedia Modeling - 28th International Conference, MMM 2022, Phu Quoc, Vietnam, June 6-10, 2022, Proceedings, Part II. Lecture Notes in Computer Science 13142, Springer 2022, ISBN 978-3-030-98354-3
Poster Papers
- Sheng Kang, Yang Wang, Yang Cao, Zheng-Jun Zha:
Long-Range Feature Dependencies Capturing for Low-Resolution Image Classification. 3-14 - Pengjian Yang, Jun Wang, Guangyu Zhong, Pengyuan Zhang, Lai Zhang, Fan Liang, Jianxin Yang:
An IBC Reference Block Enhancement Model Based on GAN for Screen Content Video Coding. 15-26 - Ruijing Zhao, Kai Zhu, Yang Cao, Zheng-Jun Zha:
AS-Net: Class-Aware Assistance and Suppression Network for Few-Shot Learning. 27-39 - Shengbin Meng, Chunyu Qiao, Junlin Li, Yue Wang, Zongming Guo:
DIG: A Data-Driven Impact-Based Grouping Method for Video Rebuffering Optimization. 40-51 - Yu-Heng Huang, Wei-Ta Chu:
Indie Games Popularity Prediction by Considering Multimodal Features. 52-61 - Changjian Zhu, Hong Zhang, Ying Wei, Nan He, Qiuming Liu:
An Iterative Correction Phase of Light Field for Novel View Reconstruction. 62-72 - Fan Wang, Lei Luo, En Zhu, Siwei Wang:
Multi-object Tracking with a Hierarchical Single-Branch Network. 73-83 - Ani Withöft, Larbi Abdenebaoui, Susanne Boll:
ILMICA - Interactive Learning Model of Image Collage Assessment: A Transfer Learning Approach for Aesthetic Principles. 84-96 - Zhiwei Zha, Pengfei Zhou, Cong Bai:
Exploring Implicit and Explicit Relations with the Dual Relation-Aware Network for Image Captioning. 97-108 - Dapeng Zhao, Yue Qi:
Generative Landmarks Guided Eyeglasses Removal 3D Face Reconstruction. 109-120 - Xuemei Jia, Xian Zhong, Mang Ye, Wenxuan Liu, Wenxin Huang, Shilei Zhao:
Patching Your Clothes: Semantic-Aware Learning for Cloth-Changed Person Re-Identification. 121-133 - Yuejin Sun, Yang Wang, Yang Cao, Zheng-Jun Zha:
Lightweight Wavelet-Based Network for JPEG Artifacts Removal. 134-145 - Jihun Kang, Daichi Haraguchi, Seiya Matsuda, Akisato Kimura, Seiichi Uchida:
Shared Latent Space of Font Shapes and Their Noisy Impressions. 146-157 - Weiran Wang, Huijun Di, Lingxiao Song:
Reconstructing 3D Contour Models of General Scenes from RGB-D Sequences. 158-170 - Jingzhong Qi, Na Qi, Qing Zhu:
SUnet++: Joint Demosaicing and Denoising of Extreme Low-Light Raw Image. 171-181 - Alexander Theus, Luca Rossetto, Abraham Bernstein:
HyText - A Scene-Text Extraction Method for Video Retrieval. 182-193 - Xianghong Huang, Qun Yang, Shaohan Liu:
Depthwise-Separable Residual Capsule for Robust Keyword Spotting. 194-204 - Dengshi Li, Lanxin Zhao, Jing Xiao, Jiaqi Liu, Duanzheng Guan, Qianrui Wang:
Adaptive Speech Intelligibility Enhancement for Far-and-Near-end Noise Environments Based on Self-attention StarGAN. 205-217 - Donnaphat Trakulwaranont, Marc A. Kastner, Shin'ichi Satoh:
Personalized Fashion Recommendation Using Pairwise Attention. 218-229 - Hongyan Wu, Haiyun Guo, Qinghai Miao, Min Huang, Jinqiao Wang:
Graph Neural Networks Based Multi-granularity Feature Representation Learning for Fine-Grained Visual Categorization. 230-242 - Yi Ren, Min Zhang, Hongyu Zhou, Ji Liu:
Skeletonization Based on K-Nearest-Neighbors on Binary Image. 243-254 - Liyan Chen, Haoran Yang, Kunhong Liu:
Classroom Attention Estimation Method Based on Mining Facial Landmarks of Students. 255-266 - Lei Zhang, Xiaoming Zhao, Xueqiang Song, Yuwei Fang, Dong Li, Haizhou Wang:
A Novel Chinese Sarcasm Detection Model Based on Retrospective Reader. 267-278 - Mareike Gabele, Andrea Thoms, Simon Schröer, Steffi Hußlein, Christian Hansen:
Effects and Combination of Tailored Browser-Based and Mobile Cognitive Software Training. 279-291 - Shuang Jin, Na Qi, Qing Zhu, Haoran Ouyang:
Progressive GAN-Based Transfer Network for Low-Light Image Enhancement. 292-304 - Na Jiang, Zhaofa Wang, Peng Xu, Xinyue Wu, Lei Zhang:
Rethinking Shared Features and Re-ranking for Cross-Modality Person Re-identification. 305-317 - Ngan Hoang Vo, Khoa D. Phan, Anh-Duy Tran, Duc-Tien Dang-Nguyen:
Adversarial Attacks on Deepfake Detectors: A Practical Analysis. 318-330 - Scott McCrae, Kehan Wang, Avideh Zakhor:
Multi-modal Semantic Inconsistency Detection in Social Media News Posts. 331-343 - Hanyu Li, Xu Zhang, Ying Xia:
EEG Emotion Recognition Based on Dynamically Organized Graph Neural Network. 344-355 - Yajie Wang, Yanyan Xie, Yanyan Wu, Kai Liang, Jilin Qiao:
An Unsupervised Multi-scale Generative Adversarial Network for Remote Sensing Image Pan-Sharpening. 356-368 - Apostolos Panagiotopoulos, Giorgos Kordopatis-Zilos, Symeon Papadopoulos:
Leveraging Selective Prediction for Reliable Image Geolocation. 369-381 - Hongying Zheng, Yawen Huang, Lin Li, Di Xiao:
Compressive Sensing-Based Image Encryption and Authentication in Edge-Clouds. 382-393 - Jesús Aguilar Armijo, Ekrem Çetinkaya, Christian Timmerer, Hermann Hellwagner:
ECAS-ML: Edge Computing Assisted Adaptation Scheme with Machine Learning for HTTP Adaptive Streaming. 394-406 - Shiyi Liu, Zhenyu Wang, Ke Qiu, Jiayu Yang, Ronggang Wang:
Fast CU Depth Decision Algorithm for AVS3. 407-418 - Li Li, Liansheng Zhuang:
MEViT: Motion Enhanced Video Transformer for Video Classification. 419-430 - Thuan Trong Nguyen, Thuan Q. Nguyen, Long Duong, Nguyen D. Vo, Khang Nguyen:
CDeRSNet: Towards High Performance Object Detection in Vietnamese Document Images. 431-442
Demonstration Papers
- Werner Bailer:
Making Few-Shot Object Detection Simpler and Less Frustrating. 445-451 - Klaus Jung, Kai Uwe Barthel, Nico Hezel, Konstantin Schall:
PicArrange - Visually Sort, Search, and Explore Private Images on a Mac Computer. 452-457 - Kim I. Schild, Alexandra M. Bagi, Magnus Holm Mamsen, Omar Shahbaz Khan, Jan Zahálka, Björn Þór Jónsson:
XQM: Search-Oriented vs. Classifier-Oriented Relevance Feedback on Mobile Phones. 458-464 - Ekrem Çetinkaya, Minh Nguyen, Christian Timmerer:
MoViDNN: A Mobile Platform for Evaluating Video Quality Enhancement with Deep Neural Networks. 465-472 - Vasileios Sitokonstantinou, Alkiviadis Koukos, Thanassis Drivas, Charalampos Kontoes, Vassilia Karathanassi:
DataCAP: A Satellite Datacube and Crowdsourced Street-Level Images for the Monitoring of the Common Agricultural Policy. 473-478 - Ly-Duyen Tran, Diarmuid Kennedy, Liting Zhou, Binh T. Nguyen, Cathal Gurrin:
A Virtual Reality Reminiscence Interface for Personal Lifelogs. 479-484
Video Browser Showdown 2022
- Nico Hezel, Konstantin Schall, Klaus Jung, Kai Uwe Barthel:
Efficient Search and Browsing of Large-Scale Video Collections with Vibro. 487-492 - Silvan Heller, Rahel Arnold, Ralph Gasser, Viktor Gsteiger, Mahnaz Parian-Scherb, Luca Rossetto, Loris Sauter, Florian Spiess, Heiko Schuldt:
Multi-modal Interactive Video Retrieval with Temporal Queries. 493-498 - Florian Spiess, Ralph Gasser, Silvan Heller, Mahnaz Parian-Scherb, Luca Rossetto, Loris Sauter, Heiko Schuldt:
Multi-modal Video Retrieval in Virtual Reality with vitrivr-VR. 499-504 - Jakub Lokoc, Frantisek Mejzlík, Tomás Soucek, Patrik Dokoupil, Ladislav Peska:
Video Search with Context-Aware Ranker and Relevance Feedback. 505-510 - Omar Shahbaz Khan, Ujjwal Sharma, Björn Þór Jónsson, Dennis C. Koelma, Stevan Rudinac, Marcel Worring, Jan Zahálka:
Exquisitor at the Video Browser Showdown 2022. 511-517 - Thao-Nhu Nguyen, Bunyarit Puangthamawathanakun, Graham Healy, Binh T. Nguyen, Cathal Gurrin, Annalina Caputo:
Videofall - A Hierarchical Search Engine for VBS2022. 518-523 - Sangmin Lee, Sungjune Park, Yong Man Ro:
IVIST: Interactive Video Search Tool in VBS 2022. 524-529 - Stelios Andreadis, Anastasia Moumtzidou, Damianos Galanopoulos, Nick Pantelidis, Konstantinos Apostolidis, Despoina Touska, Konstantinos Gkountakos, Maria Pegia, Ilias Gialampoukidis, Stefanos Vrochidis, Vasileios Mezaris, Ioannis Kompatsiaris:
VERGE in VBS 2022. 530-536 - Tu-Khiem Le, Van-Tu Ninh, Mai-Khiem Tran, Graham Healy, Cathal Gurrin, Minh-Triet Tran:
AVSeeker: An Active Video Retrieval Engine at VBS2022. 537-542 - Giuseppe Amato, Paolo Bolettieri, Fabio Carrara, Fabrizio Falchi, Claudio Gennaro, Nicola Messina, Lucia Vadicamo, Claudio Vairo:
VISIONE at Video Browser Showdown 2022. 543-548 - Zhixin Ma, Jiaxin Wu, Zhijian Hou, Chong-Wah Ngo:
Reinforcement Learning-Based Interactive Video Search. 549-555 - Khanh Ho, Vu Xuan Dinh, Hong-Quang Nguyen, Khiem Le, Khang Dinh Tran, Tien Do, Tien-Dung Mai, Thanh Duc Ngo, Duy-Dinh Le:
UIT at VBS 2022: An Unified and Interactive Video Retrieval System with Temporal Search. 556-561 - Minh-Triet Tran, Nhat Hoang-Xuan, Hoang-Phuc Trang-Trung, Thanh-Cong Le, Mai-Khiem Tran, Minh-Quan Le, Tu-Khiem Le, Van-Tu Ninh, Cathal Gurrin:
V-FIRST: A Flexible Interactive Retrieval System for Video at VBS 2022. 562-568 - Andreas Leibetseder, Klaus Schoeffmann:
diveXplore 6.0: ITEC's Interactive Video Exploration System at VBS 2022. 569-574 - Duc-Tuan Luu, Khanh-An C. Quan, Thinh-Quyen Nguyen, Van-Son Hua, Minh-Chau Nguyen, Minh-Triet Tran, Vinh-Tiep Nguyen:
CDC: Color-Based Diffusion Model with Caption Embedding in VBS 2022. 575-579 - Aaron Duane, Björn Þór Jónsson:
ViRMA: Virtual Reality Multimedia Analytics at Video Browser Showdown 2022. 580-585
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.