default search action
21st PRICAI 2024: Kyoto, Japan - Part III
- Rafik Hadfi, Patricia Anthony, Alok Sharma, Takayuki Ito, Quan Bai:
PRICAI 2024: Trends in Artificial Intelligence - 21st Pacific Rim International Conference on Artificial Intelligence, PRICAI 2024, Kyoto, Japan, November 18-24, 2024, Proceedings, Part III. Lecture Notes in Computer Science 15283, Springer 2025, ISBN 978-981-96-0121-9
Large Language Models
- Jing Xiao, Guijin Lin, Ping Li:
MLRQA: A Dataset with Multimodal Logical Reasoning Challenges. 3-14 - Huizhong Ji, Rafal Rzepka:
Fame Bias - Large Language Models Change Their Judgement Depending on Personal Name. 15-20 - Yajing Tan, Yuwei Huang, Qiqi Duan, Yijun Yang, Yuhui Shi:
Distributed Population-Based Simultaneous Perturbation Stochastic Approximation for Fine-Tuning Large Language Models. 21-26 - Yimin Du, Bi Zeng, Qingmao Wei, Boquan Zhang, Huiting Hu:
Transformer-Mamba-Based Trident-Branch RGB-T Tracker. 27-40 - Qinyan Dai, Yuxiang Lu, Chunlin Wang, Hongtao Lu:
MMAT: Multi-scale Multi-attention Transformer for Fine-Grained Wild Fungi Visual Classification. 41-53 - Yunlong Fan, Zhiheng Yang, Baixuan Li, Zhiqiang Gao:
Enhancing Parameter-Efficient Transformers with Contrastive Syntax and Regularized Dropout for Neural Machine Translation. 54-65
Computer Vision
- Kexin Bao, Fanzhao Lin, Ruyue Liu, Shiming Ge:
DB-FSCIL: Few-Shot Class-Incremental Learning Using Dual Bridges. 69-75 - Biyi Chen, Zebing Wei, WenJie Lei, Chengguang Wang:
GMMotion: Neighborhood Information Matters for Online Multi-pedestrian Tracking. 76-88 - Yufeng Chen, Guanghui Yue, Weide Liu, Chenlei Lv, Ruomei Wang, Fan Zhou, Baoquan Zhao:
Predicting Plain Text Imageability for Faithful Prompt-Conditional Image Generation. 89-95 - Chengkun Diao, Jinyu Shi:
BFNet: A Bi-frequency Fusion Semantic Segmentation Network for High-Resolution Remote Sensing Images. 96-108 - Huyen Thi Dinh, Kim Ngan Nguyen, Phuong Anh Le, Viet Hoang Nguyen:
An Improved Model of Detecting Ground Military Targets from Horizontal View. 109-121 - Jingyun Fang, Tianyang Dong, Jing Fan:
A Copy-Paste Data Augmentation Method for Urban Tree Detection. 122-133 - Shiwei Fang, Yu Xiang, Jun Zhang, Wenyong Wang:
A Novel Geometric-Encoded and Feature-Fused Model for Pressure Distribution Prediction on Airfoils. 134-146 - Haijun Huang, Teng Tian, Jing Zhao, Yidong Gu, Ruwang Jiao, Tao Peng:
Artificial Intelligence-Guided Fully-Automatic Renal Segmentation. 147-157 - Nguyen-Khang Le, Dieu-Hien Nguyen, Le-Minh Nguyen:
Integrating Vision-Tool to Enhance Visual-Question-Answering in Special Domains. 158-169 - Jiafeng Li:
AGLTN: Attention-Based Global-Local Transformer Network for Ultra-high Resolution Images. 170-181 - Jiaqi Li, Jiawei Wang, Jiahao He, Ming Ma:
GAMF-Net: A Lightweight Network for Semantic Segmentation of Land Cover Recognition in Open-Pit Coal Mining Areas. 182-194 - Xiaoyang Li, Wenzhu Yang, Zhenchao Cui:
Action Recognition Based on Multi-perspective Feature Excitation. 195-207 - Zeyu Li, Hanxiang Yang, Sheng Yang, Xiongxin Tang, Fanjiang Xu, Qiao Chen:
HQPAFT: Enhancing Low-Light Images with High-Quality Priors and Advanced Feature Transformations Using Only Normal Light Images. 208-219 - Bin Ma, Chunxin Zhao, Ruihe Ma, Yongjin Xian, Chunpeng Wang:
A Reversible Data Hiding in Encryption Domain for JPEG Image Based on Controllable Ciphertext Range of Paillier Homomorphic Encryption Algorithm. 220-232 - Gaoyuan Miao, Rong Quan, Cong Pan, Zhiheng Hu, Jie Qin:
BEVTemp: Enhancing Vision-Based Roadside 3D Object Detection with Temporal Information. 233-245 - Shun Qin, WenZhuo Han, Jinlai Zhang, Wenqi Yang, Kai Gao, Jin Li:
CPNet: Controllable Point Cloud Generation Network Using Part-Level Information. 246-257 - Chaomin Shen, Hao Huang, Zhongyi Zhou:
AffViT: Fast Affine Medical Image Registration with Convolutional Vision Transformer. 258-270 - Shouhong Wan, Sizhe Chen, Xiaoting Li, Peiquan Jin:
An Instance and Cloud Masks Guided Multi-source Fusion Network for Remote Sensing Object Detection. 271-283 - Kaixuan Wang, Lin Qi, Shiyu Qin, Kai Luo, Yakun Ju, Xia Li, Junyu Dong:
Image Gradient-Aided Photometric Stereo Network. 284-296 - Wenlong Wang, Pinyan Hua:
Enhancing Object Detection Accuracy with Hybrid Supervision and Trans-Stage Interaction. 297-308 - Yiqi Wang, Aiqing Zhu, Junbin Yuan, Qingzhen Xu:
Adaptive Threshold-Driven Semi-Supervised Facial Expression Recognition. 309-320 - Qiqiang Xia, Junhong Chen, Tianxiao Li, Yiheng Huang, Muhammad Asim, Nick Michiels, Wenyin Liu:
3D-HRFC: 3D-Aware Image Generation at High Resolution with Faster Convergence. 321-332 - He Xiao, Qingping Jiang, Songhao Guo, Jiahui Yang, Qiuming Liu:
AF-SSD: Self-attention Fusion Sampling and Fuzzy Classification for Enhanced Small Object Detection. 333-346 - Weizhi Xie, Yifeng Yao, Pengcheng Li:
A Facial Expression Recognition Model Based on a Hybrid Attention Mechanism with Multiple Information Spaces and Channels. 347-359 - Yuying Xie, Huahu Xu, Xingyuan Chen, Yuzhe Huang:
A Meta-learning Method for Generalizable Face Forgery Detection. 360-366 - Yuchen Yang, Lianrui Mu, Jiedong Zhuang, Xiaoyu Liang, Jiangnan Ye, Haoji Hu:
Data-Free Quantization of Vision Transformers Through Perturbation-Aware Image Synthesis. 367-379 - Yifeng Yao, Bei He, Minsheng Tan, Xiang Li, Zhenzhen Hu, Xingxing Duan, Lingna Chen:
HMM-VMamba: High-Order Morphological Method Vision Mamba for Medical Image Segmentation. 380-391 - Junyao Zhang, Kei Shimonishi, Hirotada Ueda, Kazuaki Kondo, Yuichi Nakamura:
Evaluating Subtle Positive-Negative Facial Expression Transitions for Monitoring Changes in Personal Internal States. 392-404 - Maoyu Zhang, Hai Xu, Fanfan Yan, Haoran Ding, Meng Guo:
Image Generation Method for Addressing Class Imbalance in Small-Sample Pulsar Candidates. 405-417 - Jiayu Zhao, Feifei Wei, Anqi Liang, Kuizhi Mei:
Efficient Matrix-Based Multi-view Projection Features Combined for Multi-modal 3D Semantic Segmentation. 418-429 - Chenyu Zhou, Xiuhong Li, Zhe Li, Fan Chen, Jiabao Sheng, Bin Chen, Haoyu Wang:
Enhancing Multimodal Rumor Detection with Statistical Image Features and Modal Alignment via Contrastive Learning. 430-442 - Siyue Zhou, Qun Guan, Chunlei Peng, Decheng Liu, Yu Zheng:
Audio-Driven Face Photo-Sketch Video Generation. 443-455 - Qingmeng Zhu, Yanan He, Tianxing Lan, Ziyin Gu, Yi Li, Qihuan Wu, Zhipeng Yu, Hao He:
A Decoupling Video Frame Selection Method for Action Recognition. 456-468
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.