


default search action
MMM 2023, Bergen, Norway - Part II
- Duc-Tien Dang-Nguyen
, Cathal Gurrin
, Martha A. Larson
, Alan F. Smeaton, Stevan Rudinac
, Minh-Son Dao
, Christoph Trattner
, Phoebe Chen
:
MultiMedia Modeling - 29th International Conference, MMM 2023, Bergen, Norway, January 9-12, 2023, Proceedings, Part II. Lecture Notes in Computer Science 13834, Springer 2023, ISBN 978-3-031-27817-4
Multimedia Processing and Applications
- Shuo Chen, Di Li, Bobo Ju, Linhua Jiang, Dongfang Zhao:
Transparent Object Detection with Simulation Heatmap Guidance and Context Spatial Attention. 3-15 - Tianrun Chen, Chenglong Fu, Ying Zang, Lanyun Zhu
, Jia Zhang, Papa Mao, Lingyun Sun:
Deep3DSketch+: Rapid 3D Modeling from Single Free-Hand Sketches. 16-28 - Yi-Ting Yang, Wei-Ta Chu:
Manga Text Detection with Manga-Specific Data Augmentation and Its Applications on Emotion Analysis. 29-40 - ShanShan Zhong, Wushao Wen, Jinghui Qin:
SPEM: Self-adaptive Pooling Enhanced Attention Module for Image Recognition. 41-53 - Patrik Veselý
, Ladislav Peska
:
Less Is More: Similarity Models for Content-Based Video Retrieval. 54-65 - Wanliang Wang, Fangsen Xing, Jiacheng Chen, Hangyao Tu:
Edge Assisted Asymmetric Convolution Network for MR Image Super-Resolution. 66-78 - Weiyan Chen
, Changjian Zhu
, Shan Zhang
, Sen Xiang
:
An Occlusion Model for Spectral Analysis of Light Field Signal. 79-90 - Tianxing Feng, Zhe Zhang, Kaiqiang Xiong, Ronggang Wang:
Context-Guided Multi-view Stereo with Depth Back-Projection. 91-102 - Wei Wang, Peng Lu, Xujun Peng, Wang Yin
, Zhaoran Zhao:
RLSCNet: A Residual Line-Shaped Convolutional Network for Vanishing Point Detection. 103-114 - Jiajun Ouyang, Qingxuan Lv, Shu Zhang, Junyu Dong:
Energy Transfer Contrast Network for Unsupervised Domain Adaption. 115-126 - Xuran Deng, Chuanbin Liu, Zhiying Lu:
Recombining Vision Transformer Architecture for Fine-Grained Visual Categorization. 127-138 - Ming Gao, Shilian Wu, Zengfu Wang:
A Length-Sensitive Language-Bound Recognition Network for Multilingual Text Recognition. 139-150 - Yuan Zhang, Xiang Tian, Ziyang Zhang, Xiangmin Xu:
Lightweight Multi-level Information Fusion Network for Facial Expression Recognition. 151-163 - Duc-Tien Dang-Nguyen, Vegard Velle Sjøen, Dinh-Hai Le, Thien-Phu Dao, Anh-Duy Tran
, Minh-Triet Tran:
Practical Analyses of How Common Social Media Platforms and Photo Storage Services Handle Uploaded Images. 164-176 - Kai Ye, Haoqin Ji, Yuan Li, Lei Wang, Peng Liu, Linlin Shen:
CCF-Net: A Cascade Center-Based Framework Towards Efficient Human Parts Detection. 177-189 - Yuhang Li, Feifan Cai, Yifei Tu, Youdong Ding:
Low-Light Image Enhancement Under Non-uniform Dark. 190-201 - Fucai Gong
, Yuchen Xie
, Le Jiang
, Keming Chen
, Yunxin Liu
, Xiaozhou Ye
, Ye Ouyang
:
A Proposal-Improved Few-Shot Embedding Model with Contrastive Learning. 202-214 - Haoqi Xu, Jian Hou, Huaqiang Yuan:
Weighted Multi-view Clustering Based on Internal Evaluation. 215-227 - Zhiqi Yan, Shuang Liang:
BENet: Boundary Enhance Network for Salient Object Detection. 228-239 - Trong-Hieu Nguyen Mau
, Quoc-Huy Trinh
, Nhat-Tan Bui
, Phuoc-Thao Vo Thi
, Minh-Van Nguyen
, Xuan-Nam Cao
, Minh-Triet Tran
, Hai-Dang Nguyen
:
PEFNet: Positional Embedding Feature for Polyp Segmentation. 240-251 - Daniele Lorenzi
, Farzad Tashtarian
, Hadi Amirpour
, Christian Timmerer
, Hermann Hellwagner
:
MCOM-Live: A Multi-Codec Optimization Model at the Edge for Live Streaming. 252-264 - Jinxin Guo, Jiaqiang Zhang, Xiaojing Zhang, Ming Ma:
LAE-Net: Light and Efficient Network for Compressed Video Action Recognition. 265-276 - Yunhong Li
, Shuai Li
, Zhenhua Yu
:
DARTS-PAP: Differentiable Neural Architecture Search by Polarization of Instance Complexity Weighted Architecture Parameters. 277-288 - Song Chen
, Chong Wang
, Weijie Liu
, Zhengjie Ye
, Jiacheng Deng
:
Pseudo-label Diversity Exploitation for Few-Shot Object Detection. 289-300 - Xinjia Xie
, Feng Liu, Shun Gai, Zhen Huang, Minghao Hu, Ankun Wang:
HSS: A Hierarchical Semantic Similarity Hard Negative Sampling Method for Dense Retrievers. 301-312 - Jingsen Fang
, Shoudong Shi, Yi Fang, Zheng Huo:
Realtime Sitting Posture Recognition on Embedded Device. 313-324 - Georgios Loupas, Theodora Pistola, Sotiris Diplaris, Konstantinos Ioannidis, Stefanos Vrochidis, Ioannis Kompatsiaris:
Comparison of Deep Learning Techniques for Video-Based Automatic Recognition of Greek Folk Dances. 325-336 - Yingnan Fu, Shu Zheng, Wenyuan Cai, Ming Gao, Cheqing Jin, Aoying Zhou:
Dynamic Feature Selection for Structural Image Content Recognition. 337-349 - Ke Dong, Hao Peng, Jie Che:
Dynamic-Static Cross Attentional Feature Fusion Method for Speech Emotion Recognition. 350-361 - Aimei Dong, Sidi Liu:
Research on Multi-task Semantic Segmentation Based on Attention and Feature Fusion Method. 362-373 - Minyan Zheng
, Jianping Luo
:
Space-Time Video Super-Resolution 3D Transformer. 374-385 - Despoina Touska
, Konstantinos Gkountakos
, Theodora Tsikrika, Konstantinos Ioannidis, Stefanos Vrochidis, Ioannis Kompatsiaris:
Graph-Based Data Association in Multiple Object Tracking: A Survey. 386-398 - Chaoqun Niu, Yuan Li, Jian Wang, Jizhe Zhou, Tu Xiong, Dong Yu, Huili Guo, Lin Zhang, Weibo Liang, Jiancheng Lv:
Multi-view Adaptive Bone Activation from Chest X-Ray with Conditional Adversarial Nets. 399-410 - Wei Luo, Mengying Xu, Hanjiang Lai:
Multimodal Reconstruct and Align Net for Missing Modality Problem in Sentiment Analysis. 411-422 - Ping Feng, Hanyun Zhang, Yingying Sun, Zhenjun Tang:
Lightweight Image Hashing Based on Knowledge Distillation and Optimal Transport for Face Retrieval. 423-434 - Shengwei Zhao, Yuying Liu, Shaoyi Du, Zhiqiang Tian, Ting Qu, Linhai Xu:
CMFG: Cross-Model Fine-Grained Feature Interaction for Text-Video Retrieval. 435-445 - Xiaoqiong Liu, Yuewei Lin, Qing Yang, Heng Fan:
Transferable Adversarial Attack on 3D Object Tracking in Point Cloud. 446-458 - Xiangqi Gan
, Changjian Zhu
, Mengqin Bai
, Ying Wei
, Weiyan Chen
:
A Spectrum Dependent Depth Layered Model for Optimization Rendering Quality of Light Field. 459-470 - Jing Yang
, Junwen Chen
, Keiji Yanai
:
Transformer-Based Cross-Modal Recipe Embeddings with Large Batch Training. 471-482 - Yuanhang Yin, Yang Hua
, Tao Song
, Ruhui Ma, Haibing Guan:
Self-supervised Multi-object Tracking with Cycle-Consistency. 483-495 - Chih-Wei Lin, Zhongsheng Chen, Xiuping Huang, Suhui Yang:
Video-Based Precipitation Intensity Recognition Using Dual-Dimension and Dual-Scale Spatiotemporal Convolutional Neural Network. 496-509 - Elissavet Batziou, Konstantinos Ioannidis, Ioannis Patras, Stefanos Vrochidis, Ioannis Kompatsiaris:
Low-Light Image Enhancement Based on U-Net and Haar Wavelet Pooling. 510-522 - Vijay John, Yasutomo Kawanishi
:
Audio-Visual Sensor Fusion Framework Using Person Attributes Robust to Missing Visual Modality for Person Recognition. 523-535 - Xinxin Zhang
, Shanliang Pan, Chengwu Qian, Jiadong Yuan:
Rumor Detection on Social Media by Using Global-Local Relations Encoding Network. 536-548 - Jinmeng Wu, Pengcheng Shu, Hanyu Hong, Xingxun Li, Lei Ma, Yaozong Zhang, Ying Zhu, Lei Wang:
Unsupervised Encoder-Decoder Model for Anomaly Prediction Task. 549-561 - Hongfeng Han, Zhiwu Lu, Ji-Rong Wen:
CTDA: Contrastive Temporal Domain Adaptation for Action Segmentation. 562-574 - Zhaoyong Yan, Liyan Ma, Xiangfeng Luo, Yan Sun:
Multi-scale and Multi-stage Deraining Network with Fourier Space Loss. 575-586 - Wenhua Gao, Lanju Zhang, Hao Yang, Yuan Zhang, Jinyao Yan, Tao Lin:
DHP: A Joint Video Download and Dynamic Bitrate Adaptation Algorithm for Short Video Streaming. 587-598 - Ting Pan, Fei Wang, Junzhou Xie, Weifeng Liu:
Generating New Paintings by Semantic Guidance. 599-610 - Maria Siopi, Giorgos Kordopatis-Zilos
, Polychronis Charitidis, Ioannis Kompatsiaris, Symeon Papadopoulos:
A Multi-Stream Fusion Network for Image Splicing Localization. 611-622 - Alexandros Oikonomidis
, Maria Pegia
, Anastasia Moumtzidou
, Ilias Gialampoukidis
, Stefanos Vrochidis
, Ioannis Kompatsiaris
:
Fusion of Multiple Classifiers Using Self Supervised Learning for Satellite Image Change Detection. 623-634 - Kazutoshi Shinoda, Yuki Takezawa
, Masahiro Suzuki, Yusuke Iwasawa, Yutaka Matsuo:
Improving the Robustness to Variations of Objects and Instructions with a Neuro-Symbolic Approach for Interactive Instruction Following. 635-646 - Jiaqin Lin, Shaoyi Du, Yuying Liu, Zhiqiang Tian, Ting Qu, Nanning Zheng:
Interpretable Driver Fatigue Estimation Based on Hierarchical Symptom Representations. 647-658 - Ly-Duyen Tran, Dongyun Nie, Liting Zhou, Binh T. Nguyen, Cathal Gurrin
:
VAISL: Visual-Aware Identification of Semantic Locations in Lifelog. 659-670 - Xin Zhao, Zhihang Ren:
Multi-scale Gaussian Difference Preprocessing and Dual Stream CNN-Transformer Hybrid Network for Skin Lesion Segmentation. 671-682 - Peijie Dong, Xin Niu, Zimian Wei, Hengyue Pan, Dongsheng Li, Zhen Huang:
AutoRF: Auto Learning Receptive Fields with Spatial Pooling. 683-694 - Zhihong Wu, Xiwen Qu, Jun Huang, Xuangou Wu:
In-Air Handwritten Chinese Text Recognition with Attention Convolutional Recurrent Network. 695-707
BNI: Brave New Ideas
- Thu Nguyen, Andrea M. Storås, Vajira Thambawita, Steven Alexander Hicks, Pål Halvorsen, Michael A. Riegler:
Multimedia Datasets: Challenges and Future Possibilities. 711-717 - Zhengyu Zhao, Nga Dang, Martha A. Larson:
The Importance of Image Interpretation: Patterns of Semantic Misclassification in Real-World Adversarial Images. 718-725
Research2Biz
- Fredrik Håland Jensen, Oda Elise Nordberg, Andy Opel, Lars Nyre:
Students Take Charge of Climate Communication. 729-735
Demo
- Yibo Hu, Chenghao Yan, Chenyu Cao, Haorui Wang, Bin Wu:
Social Relation Graph Generation on Untrimmed Video. 739-744 - Jonathan Geffen
:
Improving Parent-Child Co-play in a Roblox Game. 745-750 - Victor Adriel de Jesus Oliveira
, Gernot Rottermanner
, Magdalena Boucher
, Stefanie Größbacher
, Peter Judmaier
, Werner Bailer
, Georg Thallinger
, Thomas Kurz
, Jakob Frank, Christoph Bauer, Gabriele Fröschl, Michael Batlogg:
Taylor - Impersonation of AI for Audiovisual Content Documentation and Search. 751-757 - Daiki Shimizu, Keiji Yanai:
Virtual Try-On Considering Temporal Consistency for Videoconferencing. 758-763

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.