default search action
18th VISIGRAPP 2023: Lisbon, Portugal - Volume 5: VISAPP
- Petia Radeva, Giovanni Maria Farinella, Kadi Bouatouch:
Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2023, Volume 5: VISAPP, Lisbon, Portugal, February 19-21, 2023. SCITEPRESS 2023, ISBN 978-989-758-634-7
Invited Speakers
- Alexandru C. Telea:
Beyond the Third Dimension: How Multidimensional Projections and Machine Learning Can Help Each Other. 5-16 - Ferran Argelaguet:
The Infinite Loop. VISIGRAPP 2023: 17 - Vincent Hayward:
Human Tactile Mechanics and the Design of Haptic Interfaces. VISIGRAPP 2023: 19 - Liang Zheng:
Data-Centric Computer Vision. VISIGRAPP 2023: 21
Image and Video Understanding
- Peter Lorenz, Margret Keuper, Janis Keuper:
Unfolding Local Growth Rate Estimates for (Almost) Perfect Adversarial Detection. 27-38 - Wonwoo Jo, Kyungshin Lee, Jaewon Baik, Sang-Sun Lee, Dongho Choi, Hyunkyoo Park:
DaDe: Delay-Adaptive Detector for Streaming Perception. 39-46 - Warren Jouanneau, Aurélie Bugeau, Marc Palyart, Nicolas Papadakis, Laurent Vézard:
A Patch-Based Architecture for Multi-Label Classification from Single Positive Annotations. 47-58 - Maya Antoun, Daniel C. Asmar:
Human Object Interaction Detection Primed with Context. 59-68 - Bilal Abdulrahman, Zhigang Zhu:
Absolute-ROMP: Absolute Multi-Person 3D Mesh Prediction from a Single Image. 69-79 - Annika Mütze, Matthias Rottmann, Hanno Gottschalk:
Semi-Supervised Domain Adaptation with CycleGAN Guided by Downstream Task Awareness. 80-90 - Xuan Wang, Hao Tang, Zhigang Zhu:
A General Context Learning and Reasoning Framework for Object Detection in Urban Scenes. 91-102 - Jinlai Ning, Haoyan Guan, Michael W. Spratling:
Rethinking the Backbone Architecture for Tiny Object Detection. 103-114 - Floris De Feyter, Bram Claes, Toon Goedemé:
Rotation Equivariance for Diamond Identification. 115-123 - Cyril Li, Christophe Ducottet, Sylvain Desroziers, Maxime Moreaud:
Toward Few Pixel Annotations for 3D Segmentation of Material from Electron Tomography. 124-131 - Devashish Lohani, Carlos Fernando Crispim Junior, Quentin Barthélemy, Sarah Bertrand, Lionel Robinault, Laure Tougne Rodet:
Leveraging Unsupervised and Self-Supervised Learning for Video Anomaly Detection. 132-143 - Afshin Dini, Esa Rahtu:
Visual Anomaly Detection and Localization with a Patch-Wise Transformer and Convolutional Model. 144-152 - Christian Limberg, Andrew Melnik, Helge J. Ritter, Helmut Prendinger:
YOLO: You Only Look 10647 Times. 153-160 - Arun Kumar Subramanian, Anoop M. Namboodiri:
On Attribute Aware Open-Set Face Verification. 161-172 - Joaquin Palma-Ugarte, Laura Jovani Estacio Cerquin, Victor Flores-Benites, Rensso Mora Colque:
A Lightweight Gaussian-Based Model for Fast Detection and Classification of Moving Objects. 173-184 - Ryosuke Miyake, Tetsu Matsukawa, Einoshin Suzuki:
Image Generation from a Hyper Scene Graph with Trinomial Hyperedges. 185-195 - João Soares, Luís Magalhães, Rafaela Pinho, Mehrab K. Allahdad, Manuel Ferreira:
Automatic Defect Detection in Leather. 196-204 - Saeed Bakhshi Germi, Esa Rahtu:
IFMix: Utilizing Intermediate Filtered Images for Domain Adaptation in Classification. 205-211 - Pouya Shiri, Amirali Baniasadi:
DeepCaps+: A Light Variant of DeepCaps. 212-220 - Zihao Guo, Fei Li, Rujie Liu, Ryo Ishida, Genta Suzuki:
Body Part Information Additional in Multi-decoder Transformer-Based Network for Human Object Interaction Detection. 221-229 - Mohamed Ilyes Lakhal, Oswald Lanz, Andrea Cavallaro:
Multi-View Video Synthesis Through Progressive Synthesis and Refinement. 230-238 - Muhammad Ali, Omar Alsuwaidi, Salman Khan:
BGD: Generalization Using Large Step Sizes to Attract Flat Minima. 239-249 - Mridula Vijendran, Frederick W. B. Li, Hubert P. H. Shum:
Tackling Data Bias in Painting Classification with Style Transfer. 250-261 - Arnav Varma, Elahe Arani, Bahram Zonooz:
Dynamically Modular and Sparse General Continual Learning. 262-273 - Pedro V. V. Paiva, Josué J. G. Ramos, Marina L. Gavrilova, Marco A. G. Carvalho:
Emotion Transformer: Attention Model for Pose-Based Emotion Recognition. 274-281 - Francesco Pasti, Nicola Bellotto:
Evaluation of Computer Vision-Based Person Detection on Low-Cost Embedded Systems. 282-293 - Otto Brookes, Majid Mirmehdi, Hjalmar S. Kühl, Tilo Burghardt:
Triple-stream Deep Metric Learning of Great Ape Behavioural Actions. 294-302 - David Dueñas Gaviria, Md Mostafa Kamal Saker, Petia Radeva:
Efficient Deep Learning Ensemble for Skin Lesion Classification. 303-314 - Barbara Caroline Benato, Alexandre Xavier Falcão, Alexandru Cristian Telea:
Linking Data Separation, Visual Separation, and Classifier Performance Using Pseudo-labeling by Contrastive Learning. 315-324 - Emilie Mathian, Huidong Liu, Lynnette Fernandez-Cuesta, Dimitris Samaras, Matthieu Foll, Liming Chen:
HaloAE: A Local Transformer Auto-Encoder for Anomaly Detection and Localization Based on HaloNet. 325-337 - Aditya Kallappa, Sandeep Nagar, Girish Varma:
FInC Flow: Fast and Invertible k × k Convolutions for Normalizing Flows. 338-348 - Thomas Duboudin, Emmanuel Dellandréa, Corentin Abgrall, Gilles Hénaff, Liming Chen:
Learning Less Generalizable Patterns for Better Test-Time Adaptation. 349-358 - Silas Evandro Nachif Fernandes, Leandro A. Passos, Danilo S. Jodas, Marco Akio, André N. de Souza, João Paulo Papa:
A Multi-Class Probabilistic Optimum-Path Forest. 361-368 - Rasna A. Amit, C. Krishna Mohan:
Quantitative Analysis to Find the Optimum Scale Range for Object Representations in Remote Sensing Images. 369-379 - Pawel Majewski, Piotr Lampa, Robert Burduk, Jacek Reiner:
Mixing Augmentation and Knowledge-Based Techniques in Unsupervised Domain Adaptation for Segmentation of Edible Insect States. 380-387 - Kirill Prokofiev, Vladislav Sovrasov:
Combining Metric Learning and Attention Heads for Accurate and Efficient Multilabel Image Classification. 388-396 - Kira Maag, Matthias Rottmann:
False Negative Reduction in Semantic Segmentation Under Domain Shift Using Depth Estimation. 397-408 - Jonay Suárez-Ramírez, Alejandro Betancor-Del-Rosario, Daniel Santana-Cedrés, Nelson Monzón:
Exploring Deep Learning Capabilities for Coastal Image Segmentation on Edge Devices. 409-418 - Yuka Ogino, Yuho Shoji, Takahiro Toizumi, Ryoma Oami, Masato Tsukada:
Fast Eye Detector Using Siamese Network for NIR Partial Face Images. 419-428 - Hiroaki Minoura, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi:
Understanding of Feature Representation in Convolutional Neural Networks and Vision Transformer. 429-436 - Eduardo de O. Andrade, Igor Garcia Ballhausen Sampaio, Joris Guérin, José Viterbo:
Combining Two Adversarial Attacks Against Person Re-Identification Systems. 437-444 - Takahiro Suzuki, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi:
1D-SalsaSAN: Semantic Segmentation of LiDAR Point Cloud with Self-Attention. 445-452 - Yuki Saito, Hideo Saito, Vincent Frémont:
Monocular Depth Estimation for Tilted Images via Gravity Rectifier. 453-463 - Michael Danner, Bakir Hadzic, Robert Radloff, Xueping Su, Le Ping Peng, Thomas Weber, Matthias Rätsch:
Overcome Ethnic Discrimination with Unbiased Machine Learning for Facial Data Sets. 464-471 - Sayeh Gholipour Picha, Dawood Al Chanti, Alice Caplier:
How far Generated Data Can Impact Neural Networks Performance? 472-479 - Timothée Fréville, Charles Hamesse, Benoît Pairet, Rob Haelterman:
Object Detection in Floor Plans for Automated VR Environment Generation. 480-486 - Simon Thomine, Hichem Snoussi, Mahmoud Soua:
MixedTeacher: Knowledge Distillation for Fast Inference Textural Anomaly Detection. 487-494 - Jose Huaman, Felix O. Sumari H., Luigy Machaca, Esteban Clua, Joris Guérin:
Benchmarking Person Re-Identification Datasets and Approaches for Practical Real-World Implementations. 495-502 - Taiki Yano, Nobutaka Kimura, Kiyoto Ito:
Surface-Graph-Based 6DoF Object-Pose Estimation for Shrink-Wrapped Items Applicable to Mixed Depalletizing Robots. 503-511 - Farzan Heidari, Michael A. Bauer:
Impact of Vehicle Speed on Traffic Signs Missed by Drivers. 512-519 - Abir Fathallah, Mounim A. El-Yacoubi, Najoua Essoukri Ben Amara:
Transfer Learning for Word Spotting in Historical Arabic Documents Based Triplet-CNN. 520-527 - Zhangchi Lu, Mertcan Cokbas, Prakash Ishwar, Janusz Konrad:
Estimating Distances Between People Using a Single Overhead Fisheye Camera with Application to Social-Distancing Oversight. 528-535 - Luis E. Chuquimarca, Boris Xavier Vintimilla, Sergio A. Velastin:
Banana Ripeness Level Classification Using a Simple CNN Model Trained with Real and Synthetic Datasets. 536-543 - Reshawn Ramjattan, Rajeev Ratan, Shiva Ramoudith, Patrick Hosein, Daniele Mazzei:
Using Continual Learning on Edge Devices for Cost-Effective, Efficient License Plate Detection. 544-550 - Daniel Perazzo, Thiago de Souza, Pietro Masur, Eduardo de Amorim, Pedro de Oliveira, Kelvin B. da Cunha, Lucas Maggi, Francisco Simões, Veronica Teichrieb, Lucas N. Kirsten:
FedBID and FedDocs: A Dataset and System for Federated Document Analysis. 551-558 - Yuki Hirose, Kazuaki Nakamura, Naoko Nitta, Noboru Babaguchi:
An Experimental Consideration on Gait Spoofing. 559-566 - Masaya Mizuno, Yasutomo Kawanishi, Tomohiro Fujita, Daisuke Deguchi, Hiroshi Murase:
Subjective Baggage-Weight Estimation from Gait: Can You Estimate How Heavy the Person Feels? 567-574 - Amal El Kaid, Karim Baïna, Jamal Baïna, Vincent Barra:
Real-World Case Study of a Deep Learning Enhanced Elderly Person Fall Video-Detection System. 575-582 - Chenyu Wang, Toshio Endo, Takahiro Hirofuchi, Tsutomu Ikegami:
Pyramid Swin Transformer: Different-Size Windows Swin Transformer for Image Classification and Object Detection. 583-590 - Ali Raza, Muhammad Haroon Yousaf, Sergio A. Velastin, Serestina Viriri:
Human Fall Detection from Sequences of Skeleton Features using Vision Transformer. 591-598 - Yuichi Kamata, Moyuru Yamada, Takayuki Okatani:
Self-Modularized Transformer: Learn to Modularize Networks for Systematic Generalization. 599-606 - Rina Tagami, Hiroki Kobayashi, Shuichi Akizuki, Manabu Hashimoto:
Fast and Reliable Template Matching Based on Effective Pixel Selection Using Color and Intensity Information. 607-614 - Jack W. Barker, Neelanjan Bhowmik, Yona Falinie A. Gaus, Toby P. Breckon:
Robust Semi-Supervised Anomaly Detection via Adversarially Learned Continuous Noise Corruption. 615-625 - Nam Tuan Ly, Atsuhiro Takasu:
An End-to-End Multi-Task Learning Model for Image-based Table Recognition. 626-634 - Juan Pablo Lagos, Esa Rahtu:
PanDepth: Joint Panoptic Segmentation and Depth Completion. 635-643 - Viral Parekh, Karimulla Shaik:
Multi-Scale Feature Based Fashion Attribute Extraction Using Multi-Task Learning for e-Commerce Applications. 644-651 - Patrick Feifel, Frank Bonarens, Frank Köster:
Domain Adaptive Pedestrian Detection Based on Semantic Concepts. 652-659 - Jalila Filali, Denis Laurendeau, Steeve D. Côté:
Environmental Information Extraction Based on YOLOv5-Object Detection in Videos Collected by Camera-Collars Installed on Migratory Caribou and Black Bears in Northern Quebec. 660-667 - Souha Mansour, Saoussen Ben Jabra, Ezzeddine Zagrouba:
A Robust Deep Learning-Based Video Watermarking Using Mosaic Generation. 668-675 - Pawel Foszner, Agnieszka Szczesna, Luca Ciampi, Nicola Messina, Adam Cygan, Bartosz Bizon, Michal Cogiel, Dominik Golba, Elzbieta Macioszek, Michal Staniszewski:
CrowdSim2: An Open Synthetic Benchmark for Object Detectors. 676-683 - Guilherme Gadelha, Herman Martins Gomes, Leonardo Vidal Batista:
Neural Architecture Search in the Context of Deep Multi-Task Learning. 684-691 - Deisy Chaves, Nancy Agarwal, Eduardo Fidalgo, Enrique Alegre:
A Data Augmentation Strategy for Improving Age Estimation to Support CSEM Detection. 692-699 - Ryouichi Furukawa, Kazuhiro Hotta:
Shuffle Mixing: An Efficient Alternative to Self Attention. 700-707 - Takahiro Mano, Sota Kato, Kazuhiro Hotta:
Semantic Segmentation by Semi-Supervised Learning Using Time Series Constraint. 708-714 - Floris De Feyter, Toon Goedemé:
Joint Training of Product Detection and Recognition Using Task-Specific Datasets. 715-722 - Simon Mariani, Sander R. Klomp, Rob Romijnders, Peter H. N. de With:
The Effect of Covariate Shift and Network Training on Out-of-Distribution Detection. 723-730 - Ayato Takama, Sota Kato, Satoshi Kamiya, Kazuhiro Hotta:
Improvement of Vision Transformer Using Word Patches. 731-736 - Ana Paula dos Santos Dantas, Gabriel Bianchin de Oliveira, Daiane Mendes de Oliveira, Hélio Pedrini, Cid C. de Souza, Zanoni Dias:
Algorithmic Fairness Applied to the Multi-Label Classification Problem. 737-744 - Mohamed Dhouioui, Tarek Frikha, Hassen Drira, Mohamed Abid:
A Novel 3D Face Reconstruction Model from a Multi-Image 2D Set. 745-753 - Laure Acin, Pierre Jacob, Camille Simon Chane, Aymeric Histace:
VK-SITS: Variable Kernel Speed Invariant Time Surface for Event-Based Recognition. 754-761 - Romain Guesdon, Carlos Fernando Crispim Junior, Laure Tougne Rodet:
Synthetic Driver Image Generation for Human Pose-Related Tasks. 762-769 - Galina Zalesskaya, Bogna Bylicka, Eugene Liu:
How to Train an Accurate and Efficient Object Detection Model on any Dataset. 770-778 - Mircea Paul Muresan, Robert Schlanger, Radu Danescu, Sergiu Nedevschi:
Real-Time Obstacle Detection using a Pillar-based Representation and a Parallel Architecture on the GPU from LiDAR Measurements. 779-787 - Yuka Nokihara, Ryosuke Hori, Ryo Hachiuma, Hideo Saito:
Prediction of Shuttle Trajectory in Badminton Using Player's Position. 788-795 - Maria Pateraki, Panagiotis Sapoutzoglou, Manolis I. A. Lourakis:
Crane Spreader Pose Estimation from a Single View. 796-805 - Nikolaos Poulopoulos, Emmanouil Z. Psarakis:
Few-Shot Gaze Estimation via Gaze Transfer. 806-813 - João Almeida, Gonçalo Cruz, Diogo Silva, Tiago Oliveira:
Application of Deep Learning to the Detection of Foreign Object Debris at Aerodromes' Movement Area. 814-821 - Hadjer Boughanem, Haythem Ghazouani, Walid Barhoumi:
YCbCr Color Space as an Effective Solution to the Problem of Low Emotion Recognition Rate of Facial Expressions In-The-Wild. 822-829 - Felipe Moreno Vera, Edgar Medina, Jorge Poco:
WSAM: Visual Explanations from Style Augmentation as Adversarial Attacker and Their Influence in Image Classification. 830-837 - Xuehao Liu, Sarah Jane Delany, Susan McKeever:
Applying Positional Encoding to Enhance Vision-Language Transformers. 838-845 - Odalisio L. S. Neto, Felipe G. Oliveira, João M. B. Cavalcanti, José L. S. Pio:
Brazilian Banknote Recognition Based on CNN for Blind People. 846-853 - Natália F. de C. Meira, Ricardo C. Câmara de M. Santos, Mateus C. Silva, Eduardo José da S. Luz, Ricardo A. R. Oliveira:
Towards an Automatic System for Generating Synthetic and Representative Facial Data for Anonymization. 854-861 - Chintan Tundia, Rajiv Kumar, Om P. Damani, G. Sivakumar:
FPCD: An Open Aerial VHR Dataset for Farm Pond Change Detection. 862-869 - Rajiv Kumar, G. Sivakumar:
DEff-GAN: Diverse Attribute Transfer for Few-Shot Image Synthesis. 870-877 - Poulami Sinhamahapatra, Lena Heidemann, Maureen Monnet, Karsten Roscher:
Towards Human-Interpretable Prototypes for Visual Assessment of Image Classification Models. 878-887 - Wafa Aissa, Marin Ferecatu, Michel Crucianu:
Curriculum Learning for Compositional Visual Reasoning. 888-897 - Hayato Yumiya, Daisuke Deguchi, Yasutomo Kawanishi, Hiroshi Murase:
End-to-End Gaze Grounding of a Person Pictured from Behind. 898-905 - Mattias Billast, Kevin Mets, Tom De Schepper, José Oramas, Steven Latré:
Human Motion Prediction on the IKEA-ASM Dataset. 906-914
Motion, Tracking and Stereo Vision
- Léo Renaut, Heike Frei, Andreas Nüchter:
Smoothed Normal Distribution Transform for Efficient Point Cloud Registration During Space Rendezvous. 919-930 - Norio Tagawa, Ming Yang:
On Computing Three-Dimensional Camera Motion from Optical Flow Detected in Two Consecutive Frames. 931-942 - Alexander Dolokov, Niek Andresen, Katharina Hohlbaum, Christa Thöne-Reineke, Lars Lewejohann, Olaf Hellwich:
Upper Bound Tracker: A Multi-Animal Tracking Solution for Closed Laboratory Settings. 945-952 - Dominik Penk, Maik Horn, Christoph Strohmeyer, Frank Bauer, Marc Stamminger:
DeNos22: A Pipeline to Learn Object Tracking Using Simulated Depth. 953-962 - Mahmoud Z. Khairallah, Abanob Soliman, Fabien Bonardi, David Roussel, Samia Bouchafa:
Flow-Based Visual-Inertial Odometry for Neuromorphic Vision Sensors Using non-Linear Optimization with Online Calibration. 963-973 - Isabella de Andrade, João Paulo Lima:
Multi-Camera 3D Pedestrian Tracking Using Graph Neural Networks. 974-981 - Ritsuki Hasegawa, Fumihiko Sakaue, Jun Sato:
3D Human Body Reconstruction from Head-Mounted Omnidirectional Camera and Light Sources. 982-989 - Akira Nagatsu, Fumihiko Sakaue, Jun Sato:
3D Reconstruction of Occluded Luminous Objects. 990-996 - Johannes Künzel, Darko Vehar, Rico Nestler, Karl-Heinz Franke, Anna Hilsmann, Peter Eisert:
System for 3D Acquisition and 3D Reconstruction Using Structured Light for Sewer Line Inspection. 997-1006 - João Marcelo X. N. Teixeira, Narjara Pimentel, Eder Barbier, Enrico Bernard, Veronica Teichrieb, Gimena Chaves:
Low-Cost 3D Reconstruction of Caves. 1007-1014 - Junesuk Lee, Soon-Yong Park:
3D Mapping of Indoor Parking Space Using Edge Consistency Census Transform Stereo Odometry. 1015-1020 - Ilias Lazarou, Anastasios L. Kesidis, Andreas Tsatsaris:
Real-Time Monitoring of Crowd Panic Based on Biometric and Spatiotemporal Data. 1021-1027
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.