default search action
7th PRCV 2024: Urumqi, China - Part XI
- Zhouchen Lin, Ming-Ming Cheng, Ran He, Kurban Ubul, Wushouer Silamu, Hongbin Zha, Jie Zhou, Cheng-Lin Liu:
Pattern Recognition and Computer Vision - 7th Chinese Conference, PRCV 2024, Urumqi, China, October 18-20, 2024, Proceedings, Part XI. Lecture Notes in Computer Science 15041, Springer 2025, ISBN 978-981-97-8794-4
Feature Extraction and Feature Selection
- Xianshan Li, Yuan Dong, Xingxing Ning, Pengwei Zhang, Fengda Zhao:
ADAL-GCN: Action Description Aided Learning Graph Convolution Network for Early Action Prediction. 3-22 - Xingyu Yang, Lidong Yao, Chunyan Liu, Yang Li, Yunlong Zhao:
Credit-Based Negative Sample Denoising in Contrastive Learning. 23-35 - Shiguang Wang, Zhongyu Zhang, Jian Cheng:
UDD: Dataset Distillation via Mining Underutilized Regions. 36-50 - Meng Xie, Haibing An, Songjun Han, Jiangtao Mao, Ying Jiang, Jiajia Wang:
BertTab: Table Learning with Feature Descriptions and Context. 51-65 - Menglin Wu, Anran Yang, Qingren Jia, Luo Chen, Zhinong Zhong, Juan Chen, Ning Jing:
SemVG: Semantic Fused Feature Extraction Network for Visual Geo-Localization Under Urban Street Scenes. 66-80 - Hang Xu, Jiaye Li:
Efficient Discriminative Feature Selection with Grouping Relative Comparison. 81-96 - Rongna Xie, Xiaoyu Chen, Xinru Zhang, Guozhen Shi:
Block Cipher Algorithm Identification Based on CNN-Transformer Fusion Model. 97-110 - Xusheng Gu, Changjie Qiu, Xiuhong Lin, Xinjie Yang, Yu Zang, Cheng Wang:
DSTF: Dual-Stream Spatio-Temporal Fusion Network for Event-Based Data. 111-125
Optimization and Learning Methods
- Ming Chen, Ning He, Chen Hong:
Continuous Multi-Agent Path Finding for Drone Delivery. 129-141 - Yeran Wang, ZhengHong Zhong, Junli Zhao, Shaoqing Gong, Zhenkuan Pan, Weibo Wei:
PottsNN: A Variational Neural Network Based on Potts Model for Image Segmentation. 142-156 - Sidan Zhu, Dixin Luo:
Enhancing Multi-modal Contrastive Learning via Optimal Transport-Based Consistent Modality Alignment. 157-171 - Chuan Li, Xiao Teng, Yan Ding, Changjian Wang, Zheng Qin, Long Lan, Jing Zhang:
Instance-Level Scaling and Dynamic Margin-Alignment Knowledge Distillation. 172-186 - Li Li, Youyi Song, Xiang Dong, Peng Yang, Tianfu Wang, Baiying Lei:
Misclassification Detection via Counterexample Learning for Trustworthy Cervical Cancer Screening. 187-200
Performance Evaluation and Benchmarks
- Xiaoyi Han, Nan Pu, Zunlei Feng, Yijun Bei, Qifei Zhang, Lechao Cheng, Liang Xue:
Benchmarking Multi-Scene Fire and Smoke Detection. 203-218 - Yijun Zhou, Zilu Ying, Haolin Lv, Xinru Li, Jie You, Yingwen Chen, Kanghong Tan:
Performance Evaluation of Anomaly Detection with a New Battery Surface Anomaly Dataset. 219-231 - Zhuheng Lu, Ting Wu, Yuewei Dai, Weiqing Li, Zhiyong Su:
Fine-Grained Metrics for Point Cloud Semantic Segmentation. 232-245 - Hongxia Gao, Zhenming Guan, Yaobin Huang, Xiaomeng Li, Hongyu Liao, Bin Huang, Hongzhen Zheng, Runze Lin, Litao Li, Haolin Tang, Guoyuan Lin, Zhanhong Chen:
114Xray: A Large-Scale X-Ray Security Detection Benchmark and Aware Enhance Network for Real-World Prohibited Item Inspection in Baggage. 246-260
Multimedia Analysis and Reasoning
- Zhiyong Chen, Xinnuo Li, Zhiqi Ai, Shugong Xu:
StyleFusion TTS: Multimodal Style-Control and Enhanced Feature Fusion for Zero-Shot Text-to-Speech Synthesis. 263-277 - Bokang Li, Changsheng Chen:
Robust Document Presentation Attack Detection via Diffusion Models and Knowledge Distillation. 278-291 - Zhuo Tian, Xiaoyi Zhou, Fan Xing, Ruiyang Zhao:
Towards the Transferable Reversible Adversarial Example via Distribution-Relevant Attack. 292-305 - Chaofei Bu, Xueliang Liu, Zhen Huang, Yuling Su, Junfeng Tu, Richang Hong:
Fine-grained Feature Assisted Cross-modal Image-text Retrieval. 306-320 - Yuxian Wu, Chengji Wang, Jingzhe Li, Wenjing Zhang, Xingpeng Jiang:
Uncertainty-Aware Gradient Modulation and Feature Masking for Multimodal Sentiment Analysis. 321-335 - Zhihua Xie, Xionghui Ye:
Local and Global Features Interactive Fusion Network for Macro- and Micro-expression Spotting in Long Videos. 336-350 - Qianyi Zhao, Mengyin Wang, Qing Zhang, Fasheng Wang, Fuming Sun:
OmniStyleGAN for Style-Guided Image-to-Image Translation. 351-365 - Yangyang Wang, Changtao Miao, Qi Chu, Tao Gong, Dianmo Sheng, Jiazhen Wang, Bin Liu, Nenghai Yu:
Detect Text Forgery with Non-forged Image Features: A Framework for Detection and Grounding of Image-Text Manipulation. 366-380
Face Recognition and Pose Recognition
- Zhenyu Zhang, Wenhao Chai, Zhongyu Jiang, Tian Ye, Mingli Song, Jenq-Neng Hwang, Gaoang Wang:
MPM: A Unified 2D-3D Human Pose Representation via Masked Pose Modeling. 383-398 - Feixiang Zhang, Xiao Sun:
Joint Multi-cue Learning for Emotion Recognition in Human-Computer Interaction. 399-411 - Zhaokun Li, Qiong Liu:
Depth Decoupling for Bottom-Up Multi-Person 3D Pose Estimation. 412-428 - Wenkui Yang, Zhida Zhang, Xiaoqiang Zhou, Junxian Duan, Jie Cao:
TT-DF: A Large-Scale Diffusion-Based Dataset and Benchmark for Human Body Forgery Detection. 429-443 - Zhongguang Zhang, Wenzhu Xu, Min Tang, Yulin Zhou, Qifei Zhang, Chao Wu, Zhao Wang:
Walking is Matter: A Benchmark for Fine-Grained Gait Segmentation. 444-458 - Xinglong Mao, Shifeng Liu, Sirui Zhao, Yiming Zhang, Hao Wang, Tong Xu, Enhong Chen:
H2LMER: A Cross Frame-Rate Representation Alignment Framework for Micro-expression Recognition. 459-472 - Chunlei Peng, Tao Chen, Decheng Liu, Yu Zheng, Nannan Wang:
Spatial-Frequency Dual-Stream Reconstruction for Deepfake Detection. 473-487 - Ziqi Gao, Qiufu Li, Linlin Shen, Junpeng Yang:
3DFaceMAE: Pre-training of Masked Autoencoder Using Patch-Based Random Masking Reconstruction and Super-resolution for 3D Face Recognition. 488-503 - Yuexuan Feng, Songchen Dai, Qifei Zhang, Zhao Wang, Xianmin Zhang, Yulin Zhou:
M3Pose: Multi-Person 3D Pose Estimation Using Sparse Millimeter-Wave Radar Point Clouds. 504-517 - Wenrui Zhu, Qiankun Li, Debin Liu, Zengfu Wang:
DeepSweep: Real-Time Multi-View 3D Pose Estimation Via Cross-View Deep Matching and Plane Sweeping. 518-532 - Yinghao Yang, Sanyi Zhang, Long Ye, Neng Rao, Xudong Luo:
PoseVR: Structure-Aware Hybrid Full-Body Pose Estimation in Virtual Reality. 533-548 - Xiaojia Wang, Mingliang Zhang, Bin Li:
Fusion Network Based on Motion Learning and Image Feature Representation for Micro-Expression Recognition. 549-562 - Yiben Jiang, Xiao Yang, Keren Fu, Hongyu Yang:
Depth-Aware Dual-Stream Interactive Transformer Network for Facial Expression Recognition. 563-577 - Xinnan Ma, Yaochen Li, Limeng Zhao, ChenXu Zhou, Yuncheng Xu:
SCALE-Pose: Skeletal Correction and Language Knowledge-assisted for 3D Human Pose Estimation. 578-592
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.