default search action
Yuexian Zou
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j40]Bang Yang, Fenglin Liu, Yuexian Zou, Xian Wu, Yaowei Wang, David A. Clifton:
ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation. IEEE Trans. Pattern Anal. Mach. Intell. 46(8): 5712-5724 (2024) - [j39]Xinhao Mei, Chutong Meng, Haohe Liu, Qiuqiang Kong, Tom Ko, Chengqi Zhao, Mark D. Plumbley, Yuexian Zou, Wenwu Wang:
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3339-3354 (2024) - [c219]Bang Yang, Yong Dai, Xuxin Cheng, Yaowei Li, Asif Raza, Yuexian Zou:
Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning. AAAI 2024: 6458-6466 - [c218]Xuxin Cheng, Zhihong Zhu, Hongxiang Li, Yaowei Li, Xianwei Zhuang, Yuexian Zou:
Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport. AAAI 2024: 17844-17852 - [c217]Hongxiang Li, Meng Cao, Xuxin Cheng, Yaowei Li, Zhihong Zhu, Yuexian Zou:
Exploiting Auxiliary Caption for Video Grounding. AAAI 2024: 18508-18516 - [c216]Zhihong Zhu, Xuxin Cheng, Yaowei Li, Hongxiang Li, Yuexian Zou:
Aligner²: Enhancing Joint Multiple Intent Detection and Slot Filling via Adjustive and Forced Cross-Task Alignment. AAAI 2024: 19777-19785 - [c215]Xianwei Zhuang, Xuxin Cheng, Yuexian Zou:
Towards Explainable Joint Models via Information Theory for Multiple Intent Detection and Slot Filling. AAAI 2024: 19786-19794 - [c214]Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Yuexian Zou, Zhou Zhao, Shinji Watanabe:
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head. AAAI 2024: 23802-23804 - [c213]Zhihong Zhu, Xuxin Cheng, Zhanpeng Chen, Xianwei Zhuang, Zhiqi Huang, Yuexian Zou:
Code-Switching Can be Better Aligners: Advancing Cross-Lingual SLU through Representation-Level and Prediction-Level Alignment. ACL (Short Papers) 2024: 153-160 - [c212]Xuxin Cheng, Zhihong Zhu, Bang Yang, Xianwei Zhuang, Hongxiang Li, Yuexian Zou:
Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding. ACL (Findings) 2024: 1806-1816 - [c211]Xianwei Zhuang, Xuxin Cheng, Liming Liang, Yuxin Xie, Zhichang Wang, Zhiqi Huang, Yuexian Zou:
PCAD: Towards ASR-Robust Spoken Language Understanding via Prototype Calibration and Asymmetric Decoupling. ACL (1) 2024: 5235-5246 - [c210]Xuxin Cheng, Ziyu Yao, Yifei Xin, Hao An, Hongxiang Li, Yaowei Li, Yuexian Zou:
Soul-Mix: Enhancing Multimodal Machine Translation with Manifold Mixup. ACL (1) 2024: 11283-11294 - [c209]Xuxin Cheng, Zhihong Zhu, Xianwei Zhuang, Zhanpeng Chen, Zhiqi Huang, Yuexian Zou:
MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts. ACL (Findings) 2024: 14868-14879 - [c208]Zhihong Zhu, Xuxin Cheng, Zhanpeng Chen, Zhichang Wang, Zhiqi Huang, Yuexian Zou:
SaLa: Scenario-aware Label Graph Interaction for Multi-intent Spoken Language Understanding. CIKM 2024: 3570-3580 - [c207]Xusheng Yang, Zhengyu Chen, Yuexian Zou:
Robust Heterophily Graph Learning via Uniformity Augmentation. CIKM 2024: 4193-4197 - [c206]Bang Yang, Yue Yu, Yuexian Zou, Tong Zhang:
PCLmed: Champion Solution for ImageCLEFmedical 2024 Caption Prediction Challenge via Medical Vision-Language Foundation Models. CLEF (Working Notes) 2024: 1763-1774 - [c205]Hao An, Zhihong Zhu, Xuxin Cheng, Zhiqi Huang, Yuexian Zou:
Knowledge-enhanced Prompt Tuning for Dialogue-based Relation Extraction with Trigger and Label Semantic. LREC/COLING 2024: 9822-9831 - [c204]Zhihong Zhu, Xuxin Cheng, Guimin Hu, Yaowei Li, Zhiqi Huang, Yuexian Zou:
Towards Multi-modal Sarcasm Detection via Disentangled Multi-grained Multi-modal Distilling. LREC/COLING 2024: 16581-16591 - [c203]Xianwei Zhuang, Hongxiang Li, Xuxin Cheng, Zhihong Zhu, Yuxin Xie, Yuexian Zou:
KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval. ECCV (34) 2024: 313-331 - [c202]Zhanpeng Chen, Zhihong Zhu, Wanshi Xu, Xianwei Zhuang, Yuexian Zou:
Relevance Is a Guiding Light: Relevance-aware Adaptive Learning for End-to-end Task-oriented Dialogue System. EMNLP 2024: 5410-5420 - [c201]Wanshi Xu, Xuxin Cheng, Zhihong Zhu, Zhanpeng Chen, Yuexian Zou:
Learning to Match Representations is Better for End-to-End Task-Oriented Dialog System. EMNLP (Findings) 2024: 10409-10419 - [c200]Wanshi Xu, Xianwei Zhuang, Zhanpeng Chen, Zhihong Zhu, Xuxin Cheng, Yuexian Zou:
What are the Generator Preferences for End-to-end Task-Oriented Dialog System? EMNLP 2024: 10992-11003 - [c199]Zhanpeng Chen, Zhihong Zhu, Xianwei Zhuang, Zhiqi Huang, Yuexian Zou:
Dual-oriented Disentangled Network with Counterfactual Intervention for Multimodal Intent Detection. EMNLP 2024: 17554-17567 - [c198]Xianwei Zhuang, Zhihong Zhu, Zhanpeng Chen, Yuxin Xie, Liming Liang, Yuexian Zou:
Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View Tree and Game Theory. EMNLP 2024: 17984-18003 - [c197]Bowen Cao, Deng Cai, Leyang Cui, Xuxin Cheng, Wei Bi, Yuexian Zou, Shuming Shi:
Retrieval is Accurate Generation. ICLR 2024 - [c196]Xuxin Cheng, Yuexian Zou:
Generating More Audios for End-to-End Spoken Language Understanding. IJCAI 2024: 6234-6242 - [c195]Xianwei Zhuang, Xuxin Cheng, Zhihong Zhu, Zhanpeng Chen, Hongxiang Li, Yuexian Zou:
Towards Multimodal-augmented Pre-trained Language Models via Self-balanced Expectation-Maximization Iteration. ACM Multimedia 2024: 4670-4679 - [c194]Xianwei Zhuang, Zhichang Wang, Xuxin Cheng, Yuxin Xie, Liming Liang, Yuexian Zou:
MaCSC: Towards Multimodal-augmented Pre-trained Language Models via Conceptual Prototypes and Self-balancing Calibration. NAACL-HLT 2024: 8077-8090 - [c193]Yuming Fan, Dongming Yang, Jiguang Zhang, Bang Yang, Yuexian Zou:
Fake-GPT: Detecting Fake Image via Large Language Model. PRCV (8) 2024: 122-136 - [c192]Zhihong Zhu, Xuxin Cheng, Hongxiang Li, Yaowei Li, Yuexian Zou:
Dance with Labels: Dual-Heterogeneous Label Graph Interaction for Multi-intent Spoken Language Understanding. WSDM 2024: 1022-1031 - [i106]Bang Yang, Yong Dai, Xuxin Cheng, Yaowei Li, Asif Raza, Yuexian Zou:
Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning. CoRR abs/2401.17186 (2024) - [i105]Bowen Cao, Deng Cai, Leyang Cui, Xuxin Cheng, Wei Bi, Yuexian Zou, Shuming Shi:
Retrieval is Accurate Generation. CoRR abs/2402.17532 (2024) - [i104]Chenchen Tao, Chong Wang, Yuexian Zou, Xiaohao Peng, Jiafei Wu, Jiangbo Qian:
Learn Suspected Anomalies from Event Prompts for Video Anomaly Detection. CoRR abs/2403.01169 (2024) - [i103]Deshun Yang, Luhui Hu, Yu Tian, Zihao Li, Chris Kelly, Bang Yang, Cindy Yang, Yuexian Zou:
WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text and Image Inputs. CoRR abs/2403.07944 (2024) - [i102]Chris Kelly, Luhui Hu, Bang Yang, Yu Tian, Deshun Yang, Cindy Yang, Zaoshan Huang, Zihao Li, Jiayin Hu, Yuexian Zou:
VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework. CoRR abs/2403.09027 (2024) - [i101]Chris Kelly, Luhui Hu, Jiayin Hu, Yu Tian, Deshun Yang, Bang Yang, Cindy Yang, Zihao Li, Zaoshan Huang, Yuexian Zou:
VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding. CoRR abs/2403.09530 (2024) - [i100]Xuxin Cheng, Wanshi Xu, Zhihong Zhu, Hongxiang Li, Yuexian Zou:
Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning. CoRR abs/2405.20852 (2024) - [i99]Bowen Cao, Deng Cai, Zhisong Zhang, Yuexian Zou, Wai Lam:
On the Worst Prompt Performance of Large Language Models. CoRR abs/2406.10248 (2024) - [i98]Yaowei Li, Xintao Wang, Zhaoyang Zhang, Zhouxia Wang, Ziyang Yuan, Liangbin Xie, Yuexian Zou, Ying Shan:
Image Conductor: Precision Control for Interactive Video Synthesis. CoRR abs/2406.15339 (2024) - [i97]Yifei Xin, Zhihong Zhu, Xuxin Cheng, Xusheng Yang, Yuexian Zou:
Audio-text Retrieval with Transformer-based Hierarchical Alignment and Disentangled Cross-modal Representation. CoRR abs/2409.09256 (2024) - [i96]Yifei Xin, Xuxin Cheng, Zhihong Zhu, Xusheng Yang, Yuexian Zou:
DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval. CoRR abs/2409.10025 (2024) - [i95]Ziyu Yao, Jialin Li, Yifeng Zhou, Yong Liu, Xi Jiang, Chengjie Wang, Feng Zheng, Yuexian Zou, Lei Li:
CAR: Controllable Autoregressive Modeling for Visual Generation. CoRR abs/2410.04671 (2024) - 2023
- [j38]Liyu Wu, Can Zhang, Yuexian Zou:
SpatioTemporal focus for skeleton-based action recognition. Pattern Recognit. 136: 109231 (2023) - [j37]Jinchuan Tian, Jianwei Yu, Chao Weng, Yuexian Zou, Dong Yu:
Integrating Lattice-Free MMI Into End-to-End Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 31: 25-38 (2023) - [j36]Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu:
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 849-862 (2023) - [j35]Dongchao Yang, Jianwei Yu, Helin Wang, Wen Wang, Chao Weng, Yuexian Zou, Dong Yu:
Diffsound: Discrete Diffusion Model for Text-to-Sound Generation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1720-1733 (2023) - [j34]Bang Yang, Meng Cao, Yuexian Zou:
Concept-Aware Video Captioning: Describing Videos With Effective Prior Information. IEEE Trans. Image Process. 32: 5366-5378 (2023) - [c191]Bowen Cao, Qichen Ye, Weiyuan Xu, Yuexian Zou:
FTM: A Frame-Level Timeline Modeling Method for Temporal Graph Representation Learning. AAAI 2023: 6888-6896 - [c190]Qichen Ye, Bowen Cao, Nuo Chen, Weiyuan Xu, Yuexian Zou:
FiTs: Fine-Grained Two-Stage Training for Knowledge-Aware Question Answering. AAAI 2023: 13914-13922 - [c189]Bang Yang, Fenglin Liu, Zheng Li, Qingyu Yin, Chenyu You, Bing Yin, Yuexian Zou:
Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels. ACL (Findings) 2023: 2652-2665 - [c188]Xuxin Cheng, Bowen Cao, Qichen Ye, Zhihong Zhu, Hongxiang Li, Yuexian Zou:
ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding. ACL (Findings) 2023: 6492-6505 - [c187]Bang Yang, Fenglin Liu, Xian Wu, Yaowei Wang, Xu Sun, Yuexian Zou:
MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning. ACL (1) 2023: 11908-11922 - [c186]Zhihong Zhu, Xuxin Cheng, Zhiqi Huang, Dongsheng Chen, Yuexian Zou:
Towards Unified Spoken Language Understanding Decoding via Label-aware Compact Linguistics Representations. ACL (Findings) 2023: 12523-12531 - [c185]Wen Wang, Dongchao Yang, Qichen Ye, Bowen Cao, Yuexian Zou:
NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement. APSIPA ASC 2023: 2416-2423 - [c184]Xuxin Cheng, Wanshi Xu, Zhihong Zhu, Hongxiang Li, Yuexian Zou:
Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning. CIKM 2023: 326-336 - [c183]Xuxin Cheng, Zhihong Zhu, Yaowei Li, Hongxiang Li, Yuexian Zou:
DAS-CL: Towards Multimodal Machine Translation via Dual-Level Asymmetric Contrastive Learning. CIKM 2023: 337-347 - [c182]Bang Yang, Asif Raza, Yuexian Zou, Tong Zhang:
PCLmed at ImageCLEFmedical 2023: Customizing General-Purpose Foundation Models for Medical Report Generation. CLEF (Working Notes) 2023: 1754-1766 - [c181]Hao An, Dongsheng Chen, Weiyuan Xu, Zhihong Zhu, Yuexian Zou:
TLAG: An Informative Trigger and Label-Aware Knowledge Guided Model for Dialogue-based Relation Extraction. CSCWD 2023: 59-64 - [c180]Meng Cao, Fangyun Wei, Can Xu, Xiubo Geng, Long Chen, Can Zhang, Yuexian Zou, Tao Shen, Daxin Jiang:
Iterative Proposal Refinement for Weakly-Supervised Video Grounding. CVPR 2023: 6524-6534 - [c179]Zhihong Zhu, Xuxin Cheng, Zhiqi Huang, Dongsheng Chen, Yuexian Zou:
Enhancing Code-Switching for Cross-lingual SLU: A Unified View of Semantic and Grammatical Coherence. EMNLP 2023: 7849-7856 - [c178]Xuxin Cheng, Zhihong Zhu, Wanshi Xu, Yaowei Li, Hongxiang Li, Yuexian Zou:
Accelerating Multiple Intent Detection and Slot Filling via Targeted Knowledge Distillation. EMNLP (Findings) 2023: 8900-8910 - [c177]Xuxin Cheng, Zhihong Zhu, Bowen Cao, Qichen Ye, Yuexian Zou:
MRRL: Modifying the Reference via Reinforcement Learning for Non-Autoregressive Joint Multiple Intent Detection and Slot Filling. EMNLP (Findings) 2023: 10495-10505 - [c176]Xuxin Cheng, Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Yuexian Zou:
M3ST: Mix at Three Levels for Speech Translation. ICASSP 2023: 1-5 - [c175]Xuxin Cheng, Zhihong Zhu, Hongxiang Li, Yaowei Li, Yuexian Zou:
SSVMR: Saliency-Based Self-Training for Video-Music Retrieval. ICASSP 2023: 1-5 - [c174]Tengtao Song, Nuo Chen, Ji Jiang, Zhihong Zhu, Yuexian Zou:
Improving Retrieval-Based Dialogue System Via Syntax-Informed Attention. ICASSP 2023: 1-5 - [c173]Yifei Xin, Dongchao Yang, Fan Cui, Yujun Wang, Yuexian Zou:
Improving Weakly Supervised Sound Event Detection with Causal Intervention. ICASSP 2023: 1-5 - [c172]Yifei Xin, Dongchao Yang, Yuexian Zou:
Improving Text-Audio Retrieval by Text-Aware Attention Pooling and Prior Matrix Revised Loss. ICASSP 2023: 1-5 - [c171]Zhihong Zhu, Weiyuan Xu, Xuxin Cheng, Tengtao Song, Yuexian Zou:
A Dynamic Graph Interactive Framework with Label-Semantic Injection for Spoken Language Understanding. ICASSP 2023: 1-5 - [c170]Yaowei Li, Bang Yang, Xuxin Cheng, Zhihong Zhu, Hongxiang Li, Yuexian Zou:
Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation. ICCV 2023: 2851-2862 - [c169]Hongxiang Li, Meng Cao, Xuxin Cheng, Yaowei Li, Zhihong Zhu, Yuexian Zou:
G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory. ICCV 2023: 11998-12008 - [c168]Yifei Xin, Yuexian Zou:
Improving Audio-Text Retrieval via Hierarchical Cross-Modal Interaction and Auxiliary Captions. INTERSPEECH 2023: 341-345 - [c167]Xuxin Cheng, Wanshi Xu, Ziyu Yao, Zhihong Zhu, Yaowei Li, Hongxiang Li, Yuexian Zou:
FC-MTLF: A Fine- and Coarse-grained Multi-Task Learning Framework for Cross-Lingual Spoken Language Understanding. INTERSPEECH 2023: 690-694 - [c166]Xuxin Cheng, Ziyu Yao, Zhihong Zhu, Yaowei Li, Hongxiang Li, Yuexian Zou:
C²A-SLU: Cross and Contrastive Attention for Improving ASR Robustness in Spoken Language Understanding. INTERSPEECH 2023: 695-699 - [c165]Xuxin Cheng, Zhihong Zhu, Ziyu Yao, Hongxiang Li, Yaowei Li, Yuexian Zou:
GhostT5: Generate More Features with Cheap Operations to Improve Textless Spoken Question Answering. INTERSPEECH 2023: 1134-1138 - [c164]Yifei Xin, Dongchao Yang, Yuexian Zou:
Background-aware Modeling for Weakly Supervised Sound Event Detection. INTERSPEECH 2023: 1199-1203 - [c163]Zhihong Zhu, Xuxin Cheng, Dongsheng Chen, Zhiqi Huang, Hongxiang Li, Yuexian Zou:
Mix before Align: Towards Zero-shot Cross-lingual Sentiment Analysis via Soft-Mix and Multi-View Learning. INTERSPEECH 2023: 3969-3973 - [c162]Dongchao Yang, Songxiang Liu, Helin Wang, Jianwei Yu, Chao Weng, Yuexian Zou:
NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS. INTERSPEECH 2023: 4798-4802 - [c161]Jiang Ji, Meng Cao, Tengtao Song, Long Chen, Yi Wang, Yuexian Zou:
Video Referring Expression Comprehension via Transformer with Content-conditioned Query. MMIR@MM 2023: 39-48 - [i94]Hongxiang Li, Meng Cao, Xuxin Cheng, Zhihong Zhu, Yaowei Li, Yuexian Zou:
Generating Templated Caption for Video Grounding. CoRR abs/2301.05997 (2023) - [i93]Xuxin Cheng, Zhihong Zhu, Hongxiang Li, Yaowei Li, Yuexian Zou:
SSVMR: Saliency-based Self-training for Video-Music Retrieval. CoRR abs/2302.09328 (2023) - [i92]Qichen Ye, Bowen Cao, Nuo Chen, Weiyuan Xu, Yuexian Zou:
FiTs: Fine-grained Two-stage Training for Knowledge-aware Question Answering. CoRR abs/2302.11799 (2023) - [i91]Bowen Cao, Qichen Ye, Weiyuan Xu, Yuexian Zou:
FTM: A Frame-level Timeline Modeling Method for Temporal Graph Representation Learning. CoRR abs/2302.11814 (2023) - [i90]Yifei Xin, Dongchao Yang, Fan Cui, Yujun Wang, Yuexian Zou:
Improving Weakly Supervised Sound Event Detection with Causal Intervention. CoRR abs/2303.05678 (2023) - [i89]Yifei Xin, Dongchao Yang, Yuexian Zou:
Improving Text-Audio Retrieval by Text-aware Attention Pooling and Prior Matrix Revised Loss. CoRR abs/2303.05681 (2023) - [i88]Bang Yang, Fenglin Liu, Yuexian Zou, Xian Wu, Yaowei Wang, David A. Clifton:
ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation. CoRR abs/2303.06458 (2023) - [i87]Tengtao Song, Nuo Chen, Ji Jiang, Zhihong Zhu, Yuexian Zou:
Improve Retrieval-based Dialogue System via Syntax-Informed Attention. CoRR abs/2303.06605 (2023) - [i86]Ziyu Yao, Xuxin Cheng, Yuexian Zou:
PoseRAC: Pose Saliency Transformer for Repetitive Action Counting. CoRR abs/2303.08450 (2023) - [i85]Yaowei Li, Bang Yang, Xuxin Cheng, Zhihong Zhu, Hongxiang Li, Yuexian Zou:
Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation. CoRR abs/2303.15932 (2023) - [i84]Hao An, Dongsheng Chen, Weiyuan Xu, Zhihong Zhu, Yuexian Zou:
TLAG: An Informative Trigger and Label-Aware Knowledge Guided Model for Dialogue-based Relation Extraction. CoRR abs/2303.17119 (2023) - [i83]Xinhao Mei, Chutong Meng, Haohe Liu, Qiuqiang Kong, Tom Ko, Chengqi Zhao, Mark D. Plumbley, Yuexian Zou, Wenwu Wang:
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research. CoRR abs/2303.17395 (2023) - [i82]Dongchao Yang, Songxiang Liu, Rongjie Huang, Jinchuan Tian, Chao Weng, Yuexian Zou:
HiFi-Codec: Group-residual Vector quantization for High Fidelity Audio Codec. CoRR abs/2305.02765 (2023) - [i81]Bang Yang, Asif Raza, Yuexian Zou, Tong Zhang:
Customizing General-Purpose Foundation Models for Medical Report Generation. CoRR abs/2306.05642 (2023) - [i80]Bang Yang, Fenglin Liu, Zheng Li, Qingyu Yin, Chenyu You, Bing Yin, Yuexian Zou:
Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels. CoRR abs/2307.01969 (2023) - [i79]Hongxiang Li, Meng Cao, Xuxin Cheng, Yaowei Li, Zhihong Zhu, Yuexian Zou:
G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory. CoRR abs/2307.14277 (2023) - [i78]Yifei Xin, Yuexian Zou:
Improving Audio-Text Retrieval via Hierarchical Cross-Modal Interaction and Auxiliary Captions. CoRR abs/2307.15344 (2023) - [i77]Bang Yang, Fenglin Liu, Xian Wu, Yaowei Wang, Xu Sun, Yuexian Zou:
MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning. CoRR abs/2308.13218 (2023) - [i76]Wen Wang, Dongchao Yang, Qichen Ye, Bowen Cao, Yuexian Zou:
NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement. CoRR abs/2309.01212 (2023) - [i75]Ji Jiang, Meng Cao, Tengtao Song, Long Chen, Yi Wang, Yuexian Zou:
Video Referring Expression Comprehension via Transformer with Content-conditioned Query. CoRR abs/2310.16402 (2023) - [i74]Chris Kelly, Luhui Hu, Cindy Yang, Yu Tian, Deshun Yang, Bang Yang, Zaoshan Huang, Zihao Li, Yuexian Zou:
UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework. CoRR abs/2311.10125 (2023) - [i73]Xuxin Cheng, Bowen Cao, Qichen Ye, Zhihong Zhu, Hongxiang Li, Yuexian Zou:
ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding. CoRR abs/2311.11375 (2023) - [i72]Yongkang Yin, Xu Li, Ying Shan, Yuexian Zou:
AFL-Net: Integrating Audio, Facial, and Lip Modalities with Cross-Attention for Robust Speaker Diarization in the Wild. CoRR abs/2312.05730 (2023) - 2022
- [j33]Shanhao Li, Bang Yang, Yuexian Zou:
Adaptive Curriculum Learning for Video Captioning. IEEE Access 10: 31751-31759 (2022) - [j32]Fenglin Liu, Xian Wu, Chenyu You, Shen Ge, Yuexian Zou, Xu Sun:
Aligning Source Visual and Target Language Domains for Unpaired Video Captioning. IEEE Trans. Pattern Anal. Mach. Intell. 44(12): 9255-9268 (2022) - [j31]Jinchuan Tian, Jianwei Yu, Chao Weng, Yuexian Zou, Dong Yu:
Improving Mandarin End-to-End Speech Recognition With Word N-Gram Language Model. IEEE Signal Process. Lett. 29: 812-816 (2022) - [j30]Meng Cao, Can Zhang, Dongming Yang, Yuexian Zou:
All You Need Is a Second Look: Towards Arbitrary-Shaped Text Detection. IEEE Trans. Circuits Syst. Video Technol. 32(2): 758-767 (2022) - [j29]Dongming Yang, Yuexian Zou, Can Zhang, Meng Cao, Jie Chen:
RR-Net: Relation Reasoning for End-to-End Human-Object Interaction Detection. IEEE Trans. Circuits Syst. Video Technol. 32(6): 3853-3865 (2022) - [j28]Meng Cao, Can Zhang, Long Chen, Mike Zheng Shou, Yuexian Zou:
Deep Motion Prior for Weakly-Supervised Temporal Action Localization. IEEE Trans. Image Process. 31: 5203-5213 (2022) - [j27]Fenglin Liu, Xian Wu, Shen Ge, Xuancheng Ren, Wei Fan, Xu Sun, Yuexian Zou:
DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention. ACM Trans. Knowl. Discov. Data 16(1): 1:1-1:19 (2022) - [c160]Lisung Chen, Nuo Chen, Yuexian Zou, Yong Wang, Xinzhong Sun:
A Transformer-based Threshold-Free Framework for Multi-Intent NLU. COLING 2022: 7187-7192 - [c159]Can Zhang, Tianyu Yang, Junwu Weng, Meng Cao, Jue Wang, Yuexian Zou:
Unsupervised Pre-training for Temporal Action Localization Tasks. CVPR 2022: 14011-14021 - [c158]Helin Wang, Dongchao Yang, Yuexian Zou, Fan Cui, Yujun Wang:
Detect What You Want: Target Sound Detection. DCASE 2022 - [c157]Dongchao Yang, Helin Wang, Wenwu Wang, Yuexian Zou:
A Mixed Supervised Learning Framework For Target Sound Detection. DCASE 2022 - [c156]Meng Cao, Tianyu Yang, Junwu Weng, Can Zhang, Jue Wang, Yuexian Zou:
LocVTP: Video-Text Pre-training for Temporal Localization. ECCV (26) 2022: 38-56 - [c155]Puzhao Ji, Meng Cao, Yuexian Zou:
Visual Relation-Aware Unsupervised Video Captioning. ICANN (3) 2022: 495-507 - [c154]Dongchao Yang, Helin Wang, Yuexian Zou, Zhongjie Ye, Wenwu Wang:
A Mutual Learning Framework for Few-Shot Sound Event Detection. ICASSP 2022: 811-815 - [c153]Xinmeng Xu, Rongzhi Gu, Yuexian Zou:
Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention. ICASSP 2022: 6492-6496 - [c152]Dongsheng Chen, Zhiqi Huang, Yuexian Zou:
Leveraging Bilinear Attention to Improve Spoken Language Understanding. ICASSP 2022: 7142-7146 - [c151]Li Wang, Rongzhi Gu, Weiji Zhuang, Peng Gao, Yujun Wang, Yuexian Zou:
Learning Decoupling Features Through Orthogonality Regularization. ICASSP 2022: 7562-7566 - [c150]Lisong Chen, Peilin Zhou, Yuexian Zou:
Joint Multiple Intent Detection and Slot Filling Via Self-Distillation. ICASSP 2022: 7612-7616 - [c149]Jinchuan Tian, Jianwei Yu, Chao Weng, Shi-Xiong Zhang, Dan Su, Dong Yu, Yuexian Zou:
Consistent Training and Decoding for End-to-End Speech Recognition Using Lattice-Free MMI. ICASSP 2022: 7782-7786 - [c148]Dongsheng Chen, Zhiqi Huang, Xian Wu, Shen Ge, Yuexian Zou:
Towards Joint Intent Detection and Slot Filling via Higher-order Attention. IJCAI 2022: 4072-4078 - [c147]Dongchao Yang, Helin Wang, Zhongjie Ye, Yuexian Zou, Wenwu Wang:
RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection. INTERSPEECH 2022: 1511-1515 - [c146]Helin Wang, Dongchao Yang, Chao Weng, Jianwei Yu, Yuexian Zou:
Improving Target Sound Extraction with Timestamp Information. INTERSPEECH 2022: 1526-1530 - [c145]Yifei Xin, Dongchao Yang, Yuexian Zou:
Audio Pyramid Transformer with Domain Adaption for Weakly Supervised Sound Event Detection and Audio Classification. INTERSPEECH 2022: 1546-1550 - [c144]Jinchuan Tian, Jianwei Yu, Chunlei Zhang, Yuexian Zou, Dong Yu:
LAE: Language-Aware Encoder for Monolingual and Multilingual ASR. INTERSPEECH 2022: 3178-3182 - [c143]Zifeng Zhao, Rongzhi Gu, Dongchao Yang, Jinchuan Tian, Yuexian Zou:
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction. INTERSPEECH 2022: 5318-5322 - [c142]Zifeng Zhao, Dongchao Yang, Rongzhi Gu, Haoran Zhang, Yuexian Zou:
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches. INTERSPEECH 2022: 5333-5337 - [c141]Meng Cao, Ji Jiang, Long Chen, Yuexian Zou:
Correspondence Matters for Video Referring Expression Comprehension. ACM Multimedia 2022: 4967-4976 - [c140]Chenyu You, Nuo Chen, Fenglin Liu, Shen Ge, Xian Wu, Yuexian Zou:
End-to-end Spoken Conversational Question Answering: Task, Dataset and Model. NAACL-HLT (Findings) 2022: 1219-1232 - [c139]Puzhao Ji, Bang Yang, Tong Zhang, Yuexian Zou:
Consensus-Guided Keyword Targeting for Video Captioning. PRCV (3) 2022: 270-281 - [c138]Bang Yang, Tong Zhang, Yuexian Zou:
CLIP Meets Video Captioning: Concept-Aware Representation Learning Does Matter. PRCV (1) 2022: 368-381 - [i71]Jinchuan Tian, Jianwei Yu, Chao Weng, Yuexian Zou, Dong Yu:
Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model. CoRR abs/2201.01995 (2022) - [i70]Can Zhang, Tianyu Yang, Junwu Weng, Meng Cao, Jue Wang, Yuexian Zou:
Unsupervised Pre-training for Temporal Action Localization Tasks. CoRR abs/2203.13609 (2022) - [i69]Jinchuan Tian, Jianwei Yu, Chao Weng, Yuexian Zou, Dong Yu:
Integrate Lattice-Free MMI into End-to-End Speech Recognition. CoRR abs/2203.15614 (2022) - [i68]Liyu Wu, Can Zhang, Yuexian Zou:
SpatioTemporal Focus for Skeleton-based Action Recognition. CoRR abs/2203.16767 (2022) - [i67]Li Wang, Rongzhi Gu, Weiji Zhuang, Peng Gao, Yujun Wang, Yuexian Zou:
Learning Decoupling Features Through Orthogonality Regularization. CoRR abs/2203.16772 (2022) - [i66]Helin Wang, Dongchao Yang, Chao Weng, Jianwei Yu, Yuexian Zou:
Improving Target Sound Extraction with Timestamp Information. CoRR abs/2204.00821 (2022) - [i65]Zifeng Zhao, Dongchao Yang, Rongzhi Gu, Haoran Zhang, Yuexian Zou:
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches. CoRR abs/2204.01355 (2022) - [i64]Dongchao Yang, Helin Wang, Yuexian Zou, Wenwu Wang:
A Two-student Learning Framework for Mixed Supervised Target Sound Detection. CoRR abs/2204.02088 (2022) - [i63]Dongchao Yang, Helin Wang, Zhongjie Ye, Yuexian Zou, Wenwu Wang:
RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection. CoRR abs/2204.02143 (2022) - [i62]Zifeng Zhao, Rongzhi Gu, Dongchao Yang, Jinchuan Tian, Yuexian Zou:
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction. CoRR abs/2204.07375 (2022) - [i61]Chenyu You, Nuo Chen, Fenglin Liu, Shen Ge, Xian Wu, Yuexian Zou:
End-to-end Spoken Conversational Question Answering: Task, Dataset and Model. CoRR abs/2204.14272 (2022) - [i60]Xinmeng Xu, Rongzhi Gu, Yuexian Zou:
Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention. CoRR abs/2205.01280 (2022) - [i59]Jinchuan Tian, Jianwei Yu, Chunlei Zhang, Chao Weng, Yuexian Zou, Dong Yu:
LAE: Language-Aware Encoder for Monolingual and Multilingual ASR. CoRR abs/2206.02093 (2022) - [i58]Dongchao Yang, Jianwei Yu, Helin Wang, Wen Wang, Chao Weng, Yuexian Zou, Dong Yu:
Diffsound: Discrete Diffusion Model for Text-to-sound Generation. CoRR abs/2207.09983 (2022) - [i57]Meng Cao, Tianyu Yang, Junwu Weng, Can Zhang, Jue Wang, Yuexian Zou:
LocVTP: Video-Text Pre-training for Temporal Localization. CoRR abs/2207.10362 (2022) - [i56]Meng Cao, Ji Jiang, Long Chen, Yuexian Zou:
Correspondence Matters for Video Referring Expression Comprehension. CoRR abs/2207.10400 (2022) - [i55]Ji Jiang, Meng Cao, Tengtao Song, Yuexian Zou:
Video Referring Expression Comprehension via Transformer with Content-aware Query. CoRR abs/2210.02953 (2022) - [i54]Fenglin Liu, Xuewei Ma, Xuancheng Ren, Xian Wu, Wei Fan, Yuexian Zou, Xu Sun:
Prophet Attention: Predicting Attention with Future Attention for Improved Image Captioning. CoRR abs/2210.10914 (2022) - [i53]Fenglin Liu, Xian Wu, Shen Ge, Xuancheng Ren, Wei Fan, Xu Sun, Yuexian Zou:
DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention. CoRR abs/2210.16431 (2022) - [i52]Dongchao Yang, Songxiang Liu, Jianwei Yu, Helin Wang, Chao Weng, Yuexian Zou:
NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS. CoRR abs/2211.02448 (2022) - [i51]Zhihong Zhu, Weiyuan Xu, Xuxin Cheng, Tengtao Song, Yuexian Zou:
A Dynamic Graph Interactive Framework with Label-Semantic Injection for Spoken Language Understanding. CoRR abs/2211.04023 (2022) - [i50]Fenglin Liu, Xian Wu, Chenyu You, Shen Ge, Yuexian Zou, Xu Sun:
Aligning Source Visual and Target Language Domains for Unpaired Video Captioning. CoRR abs/2211.12148 (2022) - [i49]Xuxin Cheng, Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Yuexian Zou:
M3ST: Mix at Three Levels for Speech Translation. CoRR abs/2212.03657 (2022) - [i48]Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu:
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation. CoRR abs/2212.08348 (2022) - 2021
- [j26]Yan Huang, Zihan Zhou, Xiangyu Sai, Yong Xu, Yuexian Zou:
Hierarchical hashing-based multi-source image retrieval method for image denoising. Appl. Soft Comput. 113(Part): 108028 (2021) - [j25]Dongming Yang, Yuexian Zou, Jian Zhang, Ge Li:
GID-Net: Detecting human-object interaction with global and instance dependency. Neurocomputing 444: 366-377 (2021) - [j24]Can Zhang, Meng Cao, Dongming Yang, Ji Jiang, Yuexian Zou:
Synergic learning for noise-insensitive webly-supervised temporal action localization. Image Vis. Comput. 113: 104247 (2021) - [j23]Can Zhang, Yuexian Zou, Guang Chen, Lei Gan:
EAR: Efficient action recognition with local-global temporal aggregation. Image Vis. Comput. 116: 104329 (2021) - [j22]Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu:
Complex Neural Spatial Filter: Enhancing Multi-Channel Target Speech Separation in Complex Domain. IEEE Signal Process. Lett. 28: 1370-1374 (2021) - [j21]Dongming Yang, Yuexian Zou, Zhu Li, Ge Li:
Learning Human-Object Interaction via Interactive Semantic Reasoning. IEEE Trans. Image Process. 30: 9294-9305 (2021) - [j20]Guang Chen, Can Zhang, Yuexian Zou:
AFNet: Temporal Locality-Aware Network With Dual Structure for Accurate and Fast Action Detection. IEEE Trans. Multim. 23: 2672-2682 (2021) - [c137]Bang Yang, Yuexian Zou, Fenglin Liu, Can Zhang:
Non-Autoregressive Coarse-to-Fine Video Captioning. AAAI 2021: 3119-3127 - [c136]Zhiqi Huang, Fenglin Liu, Xian Wu, Shen Ge, Helin Wang, Wei Fan, Yuexian Zou:
Audio-Oriented Multimodal Machine Comprehension via Dynamic Inter- and Intra-modality Attention. AAAI 2021: 13098-13106 - [c135]Fenglin Liu, Xian Wu, Shen Ge, Wei Fan, Yuexian Zou:
Exploring and Distilling Posterior and Prior Knowledge for Radiology Report Generation. CVPR 2021: 13753-13762 - [c134]Can Zhang, Meng Cao, Dongming Yang, Jie Chen, Yuexian Zou:
CoLA: Weakly-Supervised Temporal Action Localization With Snippet Contrastive Learning. CVPR 2021: 16010-16019 - [c133]Zhongjie Ye, Helin Wang, Dongchao Yang, Yuexian Zou:
Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information. DCASE 2021: 40-44 - [c132]Chenyu You, Nuo Chen, Yuexian Zou:
Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering. EMNLP (Findings) 2021: 28-39 - [c131]Meng Cao, Long Chen, Mike Zheng Shou, Can Zhang, Yuexian Zou:
On Pursuit of Designing Multi-modal Transformer for Video Grounding. EMNLP (1) 2021: 9810-9823 - [c130]Helin Wang, Yuexian Zou, Wenwu Wang:
A Global-Local Attention Framework for Weakly Labelled Audio Tagging. ICASSP 2021: 351-355 - [c129]Cong Wang, Yan Huang, Yuexian Zou, Yong Xu:
FWB-Net: Front White Balance Network for Color Shift Correction in Single Image Dehazing Via Atmospheric Light Estimation. ICASSP 2021: 2040-2044 - [c128]Liyu Wu, Yuexian Zou, Can Zhang:
Long-Short Temporal Modeling for Efficient Action Recognition. ICASSP 2021: 2435-2439 - [c127]Ranyu Ning, Can Zhang, Yuexian Zou:
SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal Action Detection. ICASSP 2021: 2460-2464 - [c126]Haoran Zhang, Yuexian Zou, Helin Wang:
Contrastive Self-Supervised Learning for Text-Independent Speaker Verification. ICASSP 2021: 6713-6717 - [c125]Zhiqi Huang, Fenglin Liu, Peilin Zhou, Yuexian Zou:
Sentiment Injected Iteratively Co-Interactive Network for Spoken Language Understanding. ICASSP 2021: 7488-7492 - [c124]Chenyu You, Nuo Chen, Yuexian Zou:
Knowledge Distillation for Improved Accuracy in Spoken Question Answering. ICASSP 2021: 7793-7797 - [c123]Nuo Chen, Fenglin Liu, Chenyu You, Peilin Zhou, Yuexian Zou:
Adaptive Bi-Directional Attention: Exploring Multi-Granularity Representations for Machine Reading Comprehension. ICASSP 2021: 7833-7837 - [c122]Dongming Yang, Yuexian Zou, Can Zhang, Meng Cao, Jie Chen:
RR-Net: Injecting Interactive Semantics in Human-Object Interaction Detection. IJCAI 2021: 1224-1230 - [c121]Chenyu You, Nuo Chen, Yuexian Zou:
MRD-Net: Multi-Modal Residual Knowledge Distillation for Spoken Question Answering. IJCAI 2021: 3985-3991 - [c120]Nuo Chen, Chenyu You, Yuexian Zou:
Self-Supervised Dialogue Learning for Spoken Conversational Question Answering. Interspeech 2021: 231-235 - [c119]Weiyuan Xu, Peilin Zhou, Chenyu You, Yuexian Zou:
Semantic Transportation Prototypical Network for Few-Shot Intent Detection. Interspeech 2021: 251-255 - [c118]Helin Wang, Yuexian Zou, Wenwu Wang:
SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification. Interspeech 2021: 551-555 - [c117]Dongchao Yang, Helin Wang, Yuexian Zou:
Unsupervised Multi-Target Domain Adaptation for Acoustic Scene Classification. Interspeech 2021: 1159-1163 - [c116]Chenyu You, Nuo Chen, Yuexian Zou:
Contextualized Attention-Based Knowledge Transfer for Spoken Conversational Question Answering. Interspeech 2021: 3211-3215 - [c115]Li Wang, Rongzhi Gu, Nuo Chen, Yuexian Zou:
Text Anchor Based Metric Learning for Small-Footprint Keyword Spotting. Interspeech 2021: 4219-4223 - [i47]Cong Wang, Yan Huang, Yuexian Zou, Yong Xu:
FWB-Net: Front White Balance Network for Color Shift Correction in Single Image Dehazing via Atmospheric Light Estimation. CoRR abs/2101.08465 (2021) - [i46]Helin Wang, Yuexian Zou, Wenwu Wang:
A Global-local Attention Framework for Weakly Labelled Audio Tagging. CoRR abs/2102.01931 (2021) - [i45]Can Zhang, Meng Cao, Dongming Yang, Jie Chen, Yuexian Zou:
CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning. CoRR abs/2103.16392 (2021) - [i44]Helin Wang, Yuexian Zou, Wenwu Wang:
SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification. CoRR abs/2103.16858 (2021) - [i43]Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu:
Complex Neural Spatial Filter: Enhancing Multi-channel Target Speech Separation in Complex Domain. CoRR abs/2104.12359 (2021) - [i42]Dongming Yang, Yuexian Zou, Can Zhang, Meng Cao, Jie Chen:
RR-Net: Injecting Interactive Semantics in Human-Object Interaction Detection. CoRR abs/2104.15015 (2021) - [i41]Jinchuan Tian, Rongzhi Gu, Helin Wang, Yuexian Zou:
Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency. CoRR abs/2105.00812 (2021) - [i40]Fenglin Liu, Xuancheng Ren, Zhiyuan Zhang, Xu Sun, Yuexian Zou:
Rethinking Skip Connection with Layer Normalization in Transformers and ResNets. CoRR abs/2105.07205 (2021) - [i39]Dongchao Yang, Helin Wang, Yuexian Zou:
Unsupervised Multi-Target Domain Adaptation for Acoustic Scene Classification. CoRR abs/2105.10340 (2021) - [i38]Nuo Chen, Chenyu You, Yuexian Zou:
Self-supervised Dialogue Learning for Spoken Conversational Question Answering. CoRR abs/2106.02182 (2021) - [i37]Fenglin Liu, Xian Wu, Shen Ge, Wei Fan, Yuexian Zou:
Exploring and Distilling Posterior and Prior Knowledge for Radiology Report Generation. CoRR abs/2106.06963 (2021) - [i36]Fenglin Liu, Meng Gao, Tianhao Zhang, Yuexian Zou:
Exploring Semantic Relationships for Unpaired Image Captioning. CoRR abs/2106.10658 (2021) - [i35]Meng Cao, Can Zhang, Dongming Yang, Yuexian Zou:
All You Need is a Second Look: Towards Arbitrary-Shaped Text Detection. CoRR abs/2106.12720 (2021) - [i34]Ranyu Ning, Can Zhang, Yuexian Zou:
SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal Action Detection. CoRR abs/2106.15258 (2021) - [i33]Liyu Wu, Yuexian Zou, Can Zhang:
Long-Short Temporal Modeling for Efficient Action Recognition. CoRR abs/2106.15787 (2021) - [i32]Zhiqi Huang, Fenglin Liu, Xian Wu, Shen Ge, Helin Wang, Wei Fan, Yuexian Zou:
Audio-Oriented Multimodal Machine Comprehension: Task, Dataset and Model. CoRR abs/2107.01571 (2021) - [i31]Li Wang, Rongzhi Gu, Nuo Chen, Yuexian Zou:
Text Anchor Based Metric Learning for Small-footprint Keyword Spotting. CoRR abs/2108.05516 (2021) - [i30]Meng Cao, Can Zhang, Long Chen, Mike Zheng Shou, Yuexian Zou:
Deep Motion Prior for Weakly-Supervised Temporal Action Localization. CoRR abs/2108.05607 (2021) - [i29]Lisong Chen, Peilin Zhou, Yuexian Zou:
Joint Multiple Intent Detection and Slot Filling via Self-distillation. CoRR abs/2108.08042 (2021) - [i28]Cong Wang, Yan Huang, Yuexian Zou, Yong Xu:
Fully Non-Homogeneous Atmospheric Scattering Modeling with Convolutional Neural Networks for Single Image Dehazing. CoRR abs/2108.11292 (2021) - [i27]Dongsheng Chen, Zhiqi Huang, Yuexian Zou:
HAN: Higher-order Attention Network for Spoken Language Understanding. CoRR abs/2108.11916 (2021) - [i26]Chenyu You, Nuo Chen, Yuexian Zou:
Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering. CoRR abs/2109.03381 (2021) - [i25]Meng Cao, Long Chen, Mike Zheng Shou, Can Zhang, Yuexian Zou:
On Pursuit of Designing Multi-modal Transformer for Video Grounding. CoRR abs/2109.06085 (2021) - [i24]Dongchao Yang, Helin Wang, Yuexian Zou, Zhongjie Ye, Wenwu Wang:
A Mutual learning framework for Few-shot Sound Event Detection. CoRR abs/2110.04474 (2021) - [i23]Zhongjie Ye, Helin Wang, Dongchao Yang, Yuexian Zou:
Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information. CoRR abs/2110.06100 (2021) - [i22]Bang Yang, Yuexian Zou:
CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning. CoRR abs/2111.15162 (2021) - [i21]Jinchuan Tian, Jianwei Yu, Chao Weng, Shi-Xiong Zhang, Dan Su, Dong Yu, Yuexian Zou:
Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI. CoRR abs/2112.02498 (2021) - [i20]Dongchao Yang, Helin Wang, Yuexian Zou, Chao Weng:
Detect what you want: Target Sound Detection. CoRR abs/2112.10153 (2021) - 2020
- [j19]Rongzhi Gu, Shi-Xiong Zhang, Yong Xu, Lianwu Chen, Yuexian Zou, Dong Yu:
Multi-Modal Multi-Channel Target Speech Separation. IEEE J. Sel. Top. Signal Process. 14(3): 530-541 (2020) - [j18]Helin Wang, Yuexian Zou, Dading Chong, Wenwu Wang:
Modeling Label Dependencies for Audio Tagging With Graph Convolutional Network. IEEE Signal Process. Lett. 27: 1560-1564 (2020) - [c114]Fenglin Liu, Xian Wu, Shen Ge, Wei Fan, Yuexian Zou:
Federated Learning for Vision-and-Language Grounding Problems. AAAI 2020: 11572-11579 - [c113]Junyi Peng, Rongzhi Gu, Haoran Zhang, Yuexian Zou:
Context-adaptive Gaussian Attention for Text-independent Speaker Verification. APSIPA 2020: 595-599 - [c112]Zhiqi Huang, Fenglin Liu, Yuexian Zou:
Federated Learning for Spoken Language Understanding. COLING 2020: 3467-3478 - [c111]Fenglin Liu, Xuancheng Ren, Zhiyuan Zhang, Xu Sun, Yuexian Zou:
Rethinking Skip Connection with Layer Normalization. COLING 2020: 3586-3598 - [c110]Helin Wang, Yuexian Zou, DaDing Chong:
Acoustic Scene Classification with Spectrogram Processing Strategies. DCASE 2020: 210-214 - [c109]Sixin Hong, Yuexian Zou, Wenwu Wang, Meng Cao:
Weakly Labelled Audio Tagging Via Convolutional Networks with Spatial and Channel-Wise Attention. ICASSP 2020: 296-300 - [c108]Meng Cao, Yuexian Zou:
All You Need is a Second Look: Towards Tighter Arbitrary Shape Text Detection. ICASSP 2020: 2228-2232 - [c107]Junling Liu, Yuexian Zou, Dongming Yang:
Semanticgan: Generative Adversarial Networks For Semantic Image To Photo-Realistic Image Translation. ICASSP 2020: 2528-2532 - [c106]Rongzhi Gu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu:
Enhancing End-to-End Multi-Channel Speech Separation Via Spatial Feature Learning. ICASSP 2020: 7319-7323 - [c105]Cong Wang, Yuexian Zou, Zehan Chen:
ABC-NET: Avoiding Blocking Effect & Color Shift Network for Single Image Dehazing Via Restraining Transmission Bias. ICIP 2020: 1053-1057 - [c104]Bang Yang, Yuexian Zou:
Visual Oriented Encoder: Integrating Multimodal and Multi-Scale Contexts for Video Captioning. ICPR 2020: 188-195 - [c103]Peilin Zhou, Zhiqi Huang, Fenglin Liu, Yuexian Zou:
PIN: A Novel Parallel Interactive Network for Spoken Language Understanding. ICPR 2020: 2950-2957 - [c102]Dongming Yang, Yuexian Zou:
A Graph-based Interactive Reasoning for Human-Object Interaction Detection. IJCAI 2020: 1111-1117 - [c101]Sixin Hong, Yuexian Zou, Wenwu Wang:
Gated Multi-Head Attention Pooling for Weakly Labelled Audio Tagging. INTERSPEECH 2020: 816-820 - [c100]Helin Wang, Yuexian Zou, Dading Chong, Wenwu Wang:
Environmental Sound Classification with Parallel Temporal-Spectral Attention. INTERSPEECH 2020: 821-825 - [c99]Junyi Peng, Rongzhi Gu, Yuexian Zou:
Deep Speaker Embedding with Long Short Term Centroid Learning for Text-Independent Speaker Verification. INTERSPEECH 2020: 3246-3250 - [c98]Ziming Wang, Yuexian Zou, Zeming Zhang:
Cluster Attention Contrast for Video Anomaly Detection. ACM Multimedia 2020: 2463-2471 - [c97]Fenglin Liu, Xian Wu, Shen Ge, Xiaoyu Zhang, Wei Fan, Yuexian Zou:
Bridging the Gap between Vision and Language Domains for Improved Image Captioning. ACM Multimedia 2020: 4153-4161 - [c96]Fenglin Liu, Xuancheng Ren, Xian Wu, Shen Ge, Wei Fan, Yuexian Zou, Xu Sun:
Prophet Attention: Predicting Attention with Future Attention. NeurIPS 2020 - [i19]Rongzhi Gu, Yuexian Zou:
Temporal-Spatial Neural Filter: Direction Informed End-to-End Multi-channel Target Speech Separation. CoRR abs/2001.00391 (2020) - [i18]Rongzhi Gu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu:
Enhancing End-to-End Multi-channel Speech Separation via Spatial Feature Learning. CoRR abs/2003.03927 (2020) - [i17]Dongming Yang, Yuexian Zou, Jian Zhang, Ge Li:
GID-Net: Detecting Human-Object Interaction with Global and Instance Dependency. CoRR abs/2003.05242 (2020) - [i16]Rongzhi Gu, Shi-Xiong Zhang, Yong Xu, Lianwu Chen, Yuexian Zou, Dong Yu:
Multi-modal Multi-channel Target Speech Separation. CoRR abs/2003.07032 (2020) - [i15]Meng Cao, Yuexian Zou:
All you need is a second look: Towards Tighter Arbitrary shape text detection. CoRR abs/2004.12436 (2020) - [i14]Helin Wang, Yuexian Zou, Dading Chong:
Acoustic Scene Classification with Spectrogram Processing Strategies. CoRR abs/2007.03781 (2020) - [i13]Dongming Yang, Yuexian Zou:
A Graph-based Interactive Reasoning for Human-Object Interaction Detection. CoRR abs/2007.06925 (2020) - [i12]Can Zhang, Yuexian Zou, Guang Chen, Lei Gan:
PAN: Towards Fast Action Recognition via Learning Persistence of Appearance. CoRR abs/2008.03462 (2020) - [i11]Peilin Zhou, Zhiqi Huang, Fenglin Liu, Yuexian Zou:
PIN: A Novel Parallel Interactive Network for Spoken Language Understanding. CoRR abs/2009.13431 (2020) - [i10]Chenyu You, Nuo Chen, Fenglin Liu, Dongchao Yang, Yuexian Zou:
Towards Data Distillation for End-to-end Spoken Conversational Question Answering. CoRR abs/2010.08923 (2020) - [i9]Chenyu You, Nuo Chen, Yuexian Zou:
Contextualized Attention-based Knowledge Transfer for Spoken Conversational Question Answering. CoRR abs/2010.11066 (2020) - [i8]Chenyu You, Nuo Chen, Yuexian Zou:
Knowledge Distillation for Improved Accuracy in Spoken Question Answering. CoRR abs/2010.11067 (2020) - [i7]Nuo Chen, Fenglin Liu, Chenyu You, Peilin Zhou, Yuexian Zou:
Adaptive Bi-directional Attention: Exploring Multi-Granularity Representations for Machine Reading Comprehension. CoRR abs/2012.10877 (2020)
2010 – 2019
- 2019
- [j17]Meng Cao, Yuexian Zou, Dongming Yang, Chao Liu:
GISCA: Gradient-Inductive Segmentation Network With Contextual Attention for Scene Text Detection. IEEE Access 7: 62805-62816 (2019) - [j16]Dongming Yang, Yuexian Zou, Jian Zhang, Ge Li:
C-RPNs: Promoting object detection in real world via a cascade structure of Region Proposal Networks. Neurocomputing 367: 20-30 (2019) - [j15]Sujuan Liu, Ning Lyu, Jiashuai Cui, Yuexian Zou:
Improved Blind Timing Skew Estimation Based on Spectrum Sparsity and ApFFT in Time-Interleaved ADCs. IEEE Trans. Instrum. Meas. 68(1): 73-86 (2019) - [c95]Wan Ding, Dong-Yan Huang, Danqing Luo, Yuexian Zou:
Speech Emotion Recognition using Spectral Normalized CycleGAN. ACII Workshops 2019: 93-99 - [c94]Helin Wang, Dading Chong, Dongyan Huang, Yuexian Zou:
What Affects the Performance of Convolutional Neural Networks for Audio Event Classification. ACII Workshops 2019: 140-146 - [c93]Junyi Peng, Rongzhi Gu, Yuexian Zou, Wenwu Wang:
Speaker-discriminative Embedding Learning via Affinity Matrix for Short Utterance Speaker Verification. APSIPA 2019: 314-319 - [c92]Rongzhi Gu, Junyi Peng, Yuexian Zou, Dong Yu:
Alleviate Cross-chunk Permutation through Chunk-level Speaker Embedding for Blind Speech Separation. APSIPA 2019: 325-331 - [c91]Zhaoyi Liu, Yuexian Zou:
Teacher-Student BLSTM Mask Model for Robust Acoustic Beamforming. APSIPA 2019: 638-643 - [c90]Junyi Peng, Yuexian Zou, Na Li, Deyi Tuo, Dan Su, Meng Yu, Chunlei Zhang, Dong Yu:
Syllable-Dependent Discriminative Learning for Small Footprint Text-Dependent Speaker Verification. ASRU 2019: 350-357 - [c89]Junyi Peng, Rongzhi Gu, Yuexian Zou:
Logistic Similarity Metric Learning via Affinity Matrix for Text-Independent Speaker Verification. ASRU 2019: 704-709 - [c88]Yuying Zhang, Yuexian Zou, Junyi Peng, Danqing Luo, Dongyan Huang:
Discriminative Feature Learning for Speech Emotion Recognition. ICANN (4) 2019: 198-210 - [c87]Lei Gan, Yuexian Zou, Can Zhang:
Discriminative Feature Learning Using Two-Stage Training Strategy for Facial Expression Recognition. ICANN (3) 2019: 397-408 - [c86]Yuexian Zou, Yi Wang, Wenjie Guan, Wenwu Wang:
Semantic Super-resolution for Extremely Low-resolution Vehicle License Plate. ICASSP 2019: 3772-3776 - [c85]Wenjie Guan, Xiaoqun Zhou, Ge Li, Yuexian Zou:
Selecting Optimal Proposal Number for Image-based Object Detection. ICASSP 2019: 3797-3801 - [c84]Yang Liu, Qinghua Hu, Yuexian Zou, Wenwu Wang:
Labelled Non-zero Particle Flow for SMC-PHD Filtering. ICASSP 2019: 5197-5201 - [c83]Fenglin Liu, Meng Gao, Tianhao Zhang, Yuexian Zou:
Exploring Semantic Relationships for Image Captioning without Parallel Data. ICDM 2019: 439-448 - [c82]Dongming Yang, Yuexian Zou:
Cascade Region Proposal Networks for Object Detection in the Wild. ICME 2019: 1744-1749 - [c81]Rongzhi Gu, Lianwu Chen, Shi-Xiong Zhang, Jimeng Zheng, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu:
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information. INTERSPEECH 2019: 4290-4294 - [c80]Can Zhang, Yuexian Zou, Guang Chen, Lei Gan:
PAN: Persistent Appearance Network with an Efficient Motion Cue for Fast Action Recognition. ACM Multimedia 2019: 500-509 - [c79]Zhaoyi Liu, Yuexian Zou:
IKDMM: Iterative Knowledge Distillation Mask Model for Robust Acoustic Beamforming. MMAsia 2019: 53:1-53:6 - [c78]Guang Chen, Yuexian Zou, Can Zhang:
STMP: Spatial Temporal Multi-level Proposal Network for Activity Detection. MMM (1) 2019: 29-41 - [c77]Dading Chong, Yuexian Zou, Wenwu Wang:
Multi-channel Convolutional Neural Networks with Multi-level Feature Fusion for Environmental Sound Classification. MMM (2) 2019: 157-168 - [c76]Chaohao Lu, Yuexian Zou:
Using Coarse Label Constraint for Fine-Grained Visual Classification. MMM (2) 2019: 266-277 - [c75]Can Zhang, Yuexian Zou, Guang Chen:
Hierarchical Temporal Pooling for Efficient Online Action Recognition. MMM (1) 2019: 471-482 - [c74]Chao Liu, Yuexian Zou, Dongming Yang:
Enhancing Scene Text Detection via Fused Semantic Segmentation Network with Attention. MMM (1) 2019: 531-542 - [c73]Luwen Pu, Yuexian Zou, Jian Zhang, Shilei Huang, Lin Yao:
Using Dependency Information to Enhance Attention Mechanism for Aspect-Based Sentiment Analysis. NLPCC (1) 2019: 672-684 - [c72]Zirui Li, Yuexian Zou, Guoshuai Wang, Jian Zhang:
Scale-Informed Density Estimation for Dense Crowd Counting. VCIP 2019: 1-4 - [i6]Rongzhi Gu, Jian Wu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu:
End-to-End Multi-Channel Speech Separation. CoRR abs/1905.06286 (2019) - [i5]Dongming Yang, Yuexian Zou, Jian Zhang, Ge Li:
C-RPNs: Promoting Object Detection in real world via a Cascade Structure of Region Proposal Networks. CoRR abs/1908.06665 (2019) - [i4]Bang Yang, Fenglin Liu, Yuexian Zou:
Non-Autoregressive Video Captioning with Iterative Refinement. CoRR abs/1911.12018 (2019) - [i3]Helin Wang, Yuexian Zou, Dading Chong, Wenwu Wang:
Learning discriminative and robust time-frequency representations for environmental sound classification. CoRR abs/1912.06808 (2019) - 2018
- [j14]Disong Wang, Yuexian Zou, Wenwu Wang:
Learning soft mask with DNN and DNN-SVM for multi-speaker DOA estimation using an acoustic vector sensor. J. Frankl. Inst. 355(4): 1692-1709 (2018) - [j13]Yi Wang, Yuexian Zou, Wenwu Wang:
Manifold-Based Visual Object Counting. IEEE Trans. Image Process. 27(7): 3248-3263 (2018) - [c71]Haoyi Yuan, Yang Yuan, H. C. Wu, Yuexian Zou:
Economic Index Forecasting via Multi-scale Recursive Dynamic Factor Analysis. AIMS 2018: 83-91 - [c70]Xiaohu Zhang, Yuexian Zou, Yi Liu:
AICDS: An Infant Crying Detection System Based on Lightweight Convolutional Neural Network. AIMS 2018: 185-196 - [c69]Wenjie Guan, Yuexian Zou, Xiaoqun Zhou:
Multi-Scale Object Detection with Feature Fusion and Region Objectness Network. ICASSP 2018: 2596-2600 - [c68]Zehan Chen, Yi Wang, Yuexian Zou:
Inverse Atmoshperic Scattering Modeling with Convolutional Neural Networks for Single Image Dehazing. ICASSP 2018: 2626-2630 - [c67]Chao Liu, Yuexian Zou, Wenjie Guan:
Hierarchical Feature Fusion With Text Attention For Multi-scale Text Detection. DSP 2018: 1-5 - [c66]Xiaohu Zhang, Yuexian Zou:
DCH-Net: Densely Connected Highway Convolution Neural Network for Environmental Sound Classification. DSP 2018: 1-5 - [c65]Xiaohu Zhang, Yuexian Zou, Wenwu Wang:
LD-CNN: A Lightweight Dilated Convolutional Neural Network for Environmental Sound Classification. ICPR 2018: 373-378 - [c64]Danqing Luo, Yuexian Zou, Dongyan Huang:
Investigation on Joint Representation Learning for Robust Feature Extraction in Speech Emotion Recognition. INTERSPEECH 2018: 152-156 - [c63]Disong Wang, Yuexian Zou:
Joint Noise and Reverberation Adaptive Learning for Robust Speaker DOA Estimation with an Acoustic Vector Sensor. INTERSPEECH 2018: 821-825 - [e1]Marco Aiello, Yujiu Yang, Yuexian Zou, Liang-Jie Zhang:
Artificial Intelligence and Mobile Services - AIMS 2018 - 7th International Conference, Held as Part of the Services Conference Federation, SCF 2018, Seattle, WA, USA, June 25-30, 2018, Proceedings. Lecture Notes in Computer Science 10970, Springer 2018, ISBN 978-3-319-94360-2 [contents] - 2017
- [j12]Baoyan Wang, Jian Zhang, Yi Liu, Yuexian Zou:
Density peaks clustering based integrate framework for multi-document summarization. CAAI Trans. Intell. Technol. 2(1): 26-30 (2017) - [c62]Yichi Huang, Yuexian Zou, Yi Liu:
Investigating the Stacked Phonetic Bottleneck Feature for Speaker Verification with Short Voice Commands. ACPR 2017: 706-711 - [c61]Danqing Luo, Yuexian Zou, Dongyan Huang:
Speech emotion recognition via ensembling neural networks. APSIPA 2017: 1351-1355 - [c60]Yuexian Zou, Rongzhi Gu, Disong Wang, Aimin Jiang, Christian H. Ritz:
Learning a robust DOA estimation model with acoustic vector sensor cues. APSIPA 2017: 1688-1691 - [c59]Xiao Song, Yuexian Zou, Shilei Huang, Shaobin Chen, Yi Liu:
Investigating multi-task learning for automatic speech recognition with code-switching between mandarin and english. IALP 2017: 27-30 - [c58]Baoyan Wang, Jian Zhang, Fanggui Ding, Yuexian Zou:
Multi-document news summarization via paragraph embedding and density peak clustering. IALP 2017: 260-263 - [c57]X. L. Huang, Yuexian Zou, Y. Wang:
Example-based Visual Object Counting for complex background with a local low-rank constraint. ICASSP 2017: 1672-1676 - [c56]Yanhan Jin, Yuexian Zou, Christian H. Ritz:
Robust speaker DOA estimation based on the inter-sensor data ratio model and binary mask estimation in the bispectrum domain. ICASSP 2017: 3266-3270 - [c55]Jin Chen, Yi Wang, Zehan Chen, Yuexian Zou:
Sequence-guided siamese neural network for video summarization of unmanned aerial vehicles. DSP 2017: 1-5 - [c54]Yichi Huang, Yuexian Zou:
Enhancing speaker verification with short voice commands via autoencoder and phonetic bottleneck learning. DSP 2017: 1-5 - [c53]Disong Wang, Yuexian Zou, Wei Shi:
A deep convolutional encoder-decoder model for robust speech dereverberation. DSP 2017: 1-5 - [c52]Xiaohu Zhang, Yuexian Zou, Wei Shi:
Dilated convolution neural network with LeakyReLU for environmental sound classification. DSP 2017: 1-5 - [c51]X. Q. Zhou, Yuexian Zou, Y. Wang:
Accurate small object detection via density map aided saliency estimation. ICIP 2017: 425-429 - [c50]Xiao Song, Yi Liu, Daming Yang, Yuexian Zou:
A Multi-task Learning Approach for Mandarin-English Code-Switching Conversational Speech Recognition. ISICA (1) 2017: 102-111 - [c49]Xiao Song, Qiang Cheng, Jingping Xing, Yuexian Zou:
Data-Driven Phone Selection for Language Identification via Bidirectional Long Short-Term Memory Modeling. ISICA (1) 2017: 301-312 - [c48]Baoyan Wang, Yuexian Zou, Jian Zhang, Jun Jiang, Yi Liu:
Multi-document Summarization via LDA and Density Peaks Based Sentence-Level Clustering. ISICA (1) 2017: 313-323 - 2016
- [c47]Zhi-Qiang Xiang, X. L. Huang, Yuexian Zou:
An Effective and Robust Multi-view Vehicle Classification Method Based on Local and Structural Features. BigMM 2016: 68-73 - [c46]Chun Wang, Yuexian Zou, Shihan Liu, Wei Shi, Weiqiao Zheng:
An Efficient Learning Based Smartphone Playback Attack Detection Using GMM Supervector. BigMM 2016: 385-389 - [c45]Disong Wang, Yuexian Zou, Junhong Liu, Yichi Huang:
A robust DBN-vector based speaker verification system under channel mismatch conditions. DSP 2016: 94-98 - [c44]Disong Wang, Xiansheng Guo, Yuexian Zou:
Accurate and robust device-free localization approach via sparse representation in presence of noise and outliers. DSP 2016: 199-203 - [c43]Yi Wang, Yuexian Zou:
Fast visual object counting via example-based density estimation. ICIP 2016: 3653-3657 - [c42]Xiaolin Huang, Yuexian Zou, Yi Wang:
Cost-sensitive sparse linear regression for crowd counting with imbalanced training data. ICME 2016: 1-6 - [c41]Yi Wang, Yuexian Zou, Jin Chen, Xiaolin Huang, Cheng Cai:
Example-based visual object counting with a sparsity constraint. ICME 2016: 1-6 - [c40]Junhong Liu, Yuexian Zou, Yichi Huang:
An effective voiceprint based identity authentication system for Mandarin smartphone users. ICPR 2016: 1077-1082 - [c39]Jin Chen, Yuexian Zou, Yi Wang:
Wireless capsule endoscopy video summarization: A learning approach based on Siamese neural network and support vector machine. ICPR 2016: 1303-1308 - 2015
- [j11]Weiyang Liu, Zhiding Yu, Lijia Lu, Yandong Wen, Hui Li, Yuexian Zou:
KCRC-LCD: Discriminative kernel collaborative representation with locality constrained dictionary for visual categorization. Pattern Recognit. 48(10): 3076-3092 (2015) - [c38]D. S. Xia, Zhi-Qiang Xiang, Yuexian Zou:
Integrating Visual and Textual Features for Web Image Clustering. BigMM 2015: 116-123 - [c37]Junhong Liu, Weiqiao Zheng, Yuexian Zou:
A Robust Acoustic Feature Extraction Approach Based on Stacked Denoising Autoencoder. BigMM 2015: 124-127 - [c36]Shihan Liu, Yuexian Zou, Hongke Ning:
Nonnegative matrix factorization based noise robust speaker verification. ChinaSIP 2015: 35-39 - [c35]Yi Wang, Cheng Cai, Ji Liu, Yuexian Zou:
A parametric modeling approach for wireless capsule endoscopy hazy image restoration. ICASSP 2015: 922-926 - [c34]Yi Wang, Cheng Cai, Yuexian Zou:
Single image super-resolution via adaptive dictionary pair learning for wireless capsule endoscopy image. DSP 2015: 595-599 - [c33]J. Chen, Y. Wang, Yuexian Zou:
An adaptive redundant image elimination for Wireless Capsule Endoscopy review based on temporal correlation and color-texture feature similarity. DSP 2015: 735-739 - [c32]Xiansheng Guo, Lei Chu, Yiming Pi, Yuexian Zou:
Two stages signal strength difference localization algorithm using SDP relaxation. DSP 2015: 957-961 - [c31]Yuexian Zou, Lei Li, Yi Wang, Jiasheng Yu, Yi Li, W. J. Deng:
Classifying digestive organs in wireless capsule endoscopy images based on deep convolutional neural network. DSP 2015: 1274-1278 - [c30]Weiyang Liu, Zhiding Yu, Yandong Wen, Meng Yang, Yuexian Zou:
Multi-kernel collaborative representation for image classification. ICIP 2015: 21-25 - [c29]Weiyang Liu, Zhiding Yu, Meng Yang, Lijia Lu, Yuexian Zou:
Joint kernel dictionary and classifier learning for sparse coding via locality preserving K-SVD. ICME 2015: 1-6 - [c28]Jiasheng Yu, Jin Chen, Z. Q. Xiang, Yuexian Zou:
A hybrid convolutional neural networks with extreme learning machine for WCE image classification. ROBIO 2015: 1822-1827 - 2014
- [c27]Wei Shi, Yuexian Zou, Yi Liu:
Long-term auto-correlation statistics based voice activity detection for strong noisy speech. ChinaSIP 2014: 100-104 - [c26]Tao Ma, Yuexian Zou, Zhiqiang Xiang, Lei Li, Yi Li:
Wireless capsule endoscopy image classification based on vector sparse coding. ChinaSIP 2014: 582-586 - [c25]Weiyang Liu, Yandong Wen, Kai Pan, Hui Li, Yuexian Zou:
A kernel-based l2 norm regularized least square algorithm for vehicle logo recognition. DSP 2014: 631-635 - [c24]Weiyang Liu, Lijia Lu, Hui Li, Wei Wang, Yuexian Zou:
A novel kernel collaborative representation approach for image classification. ICIP 2014: 4241-4245 - [i2]Weiyang Liu, Zhiding Yu, Lijia Lu, Yandong Wen, Hui Li, Yuexian Zou:
KCRC-LCD: Discriminative Kernel Collaborative Representation with Locality Constrained Dictionary for Visual Categorization. CoRR abs/1410.4673 (2014) - 2013
- [j10]Weichao Xu, Yunhe Hou, Y. S. Hung, Yuexian Zou:
A comparative analysis of Spearman's rho and Kendall's tau in normal and contaminated normal models. Signal Process. 93(1): 261-276 (2013) - [c23]Lei Li, Yuexian Zou, Yi Li:
Wireless capsule endoscopy images enhancement based on adaptive anisotropic diffusion. ChinaSIP 2013: 273-277 - [c22]Tao Ma, Yuexian Zou, Qing Ding:
Urban vehicle classification based on linear SVM with efficient vector sparse coding. ICIA 2013: 527-532 - 2012
- [j9]Mengqi Ren, Yuexian Zou:
A Novel Multiple Sparse Source Localization Using Triangular Pyramid Microphone Array. IEEE Signal Process. Lett. 19(2): 83-86 (2012) - [c21]Wei He, Pengfei Wei, Liping Wang, Yuexian Zou:
A novel EMD-based Common Spatial Pattern for motor imagery brain-computer interface. BHI 2012: 216-219 - [c20]Jia Sen Huo, Yuexian Zou, Lei Li:
An advanced WCE video summary using relation matrix rank. BHI 2012: 675-678 - 2011
- [j8]Yuexian Zou, Guangyi Shi, Hang Shi, He Zhao:
Traffic incident classification at intersections based on image sequences by HMM/SVM classifiers. Multim. Tools Appl. 52(1): 133-145 (2011) - [j7]Yuexian Zou, Shang Liang Zhang, Yong Ching Lim, Xiao Chen:
Timing Mismatch Compensation in Time-Interleaved ADCs Based on Multichannel Lagrange Polynomial Interpolation. IEEE Trans. Instrum. Meas. 60(4): 1123-1131 (2011) - [c19]Yuexian Zou, Bo Li, Xiao Chen:
An efficient blind timing skews estimation for time-interleaved analog-to-digital converters. DSP 2011: 1-4 - 2010
- [c18]Yuexian Zou, He Zhao, Hang Shi, Yiyan Wang:
A moving vehicle segmentation method based on clustering of feature points for tracking at urban intersection. APCCAS 2010: 120-123 - [c17]Yuexian Zou, Yili Chen, Yali Zheng:
A stimulus pattern extraction algorithm based on saliency map for a 625-channel retinal prosthesis system. EUSIPCO 2010: 1632-1635 - [i1]Weichao Xu, Yunhe Hou, Y. S. Hung, Yuexian Zou:
Comparison of Spearman's rho and Kendall's tau in Normal and Contaminated Normal Models. CoRR abs/1011.2009 (2010)
2000 – 2009
- 2009
- [j6]Hong Liu, Ze Yu, Hongbin Zha, Yuexian Zou, Lin Zhang:
Robust human tracking based on multi-cue integration and mean-shift. Pattern Recognit. Lett. 30(9): 827-837 (2009) - [j5]Yong Ching Lim, Yuexian Zou, Jun Wei Lee, Shing-Chow Chan:
Time-Interleaved Analog-to-Digital-Converter Compensation Using Multichannel Filters. IEEE Trans. Circuits Syst. I Regul. Pap. 56-I(10): 2234-2247 (2009) - [j4]Shing-Chow Chan, Yuexian Zou, Yi Zhou:
Reply to "Comments on 'A Recursive Least M-Estimate Algorithm for Robust Adaptive Filtering in Impulsive Noise: Fast Algorithm and Convergence Performance Analysis'". IEEE Trans. Signal Process. 57(1): 389 (2009) - [c16]Yiyan Wang, Yuexian Zou, Hang Shi, He Zhao:
Video Image Vehicle Detection System for Signaled Traffic Intersection. HIS (1) 2009: 222-227 - [c15]Yuexian Zou, Guangyi Shi, Hang Shi, Yiyan Wang:
Image Sequences Based Traffic Incident Detection for Signaled Intersections Using HMM. HIS (1) 2009: 257-261 - [c14]Guangyi Shi, Yuexian Zou, Yufeng Jin, Yali Zheng, Wen Jung Li:
Multi-category human motion recognition based on MEMS inertial sensing data. NEMS 2009: 489-493 - [c13]Yuexian Zou, Guangyi Shi, Yufeng Jin, Yali Zheng:
Extraocular image processing for retinal prosthesis based on DSP. NEMS 2009: 563-566 - [c12]Yuexian Zou, Yali Zheng, Yufeng Jin, Guangyi Shi:
Signal modulation schemes comparison in the telemetry unit for retinal prosthesis system. NEMS 2009: 632-636 - [c11]Hong Liu, Haitao Yu, Yuexian Zou, Zhenhua Huo:
A Slope K method for image based localization. ROBIO 2009: 535-538 - [c10]Hong Liu, Xiaodong Duan, Yuexian Zou, Dengke Gao:
Detection of hands-raising gestures using shape and edge features. ROBIO 2009: 1480-1483 - 2008
- [c9]Guangyi Shi, Yuexian Zou, Yufeng Jin, Wen Jung Li:
PCA/ICA-based SVM for fall recognition using MEMS motion sensing data. APCCAS 2008: 69-72 - [c8]Yuexian Zou, Shing-Chow Chan, Wan Bo, Zhao Jing:
Recursive robust variable loading mvdr beamforming in impulsive noise environment. APCCAS 2008: 988-991 - [c7]Guangyi Shi, Yuexian Zou, Yufeng Jin, Xiaole Cui, Wen Jung Li:
Towards HMM based human motion recognition using MEMS inertial sensors. ROBIO 2008: 1762-1766 - 2005
- [j3]Yong Ching Lim, Yuexian Zou, N. Zheng:
A piloted adaptive notch filter. IEEE Trans. Signal Process. 53(4): 1310-1323 (2005) - 2004
- [j2]Shing-Chow Chan, Yuexian Zou:
A recursive least M-estimate algorithm for robust adaptive filtering in impulsive noise: fast algorithm and convergence performance analysis. IEEE Trans. Signal Process. 52(4): 975-991 (2004) - 2003
- [c6]Yong Ching Lim, Yuexian Zou, N. Zheng:
A piloted adaptive notch filter. ICASSP (6) 2003: 193-196 - 2001
- [c5]Yuexian Zou, Shing-Chow Chan:
A Huber recursive least squares adaptive lattice filter for impulse noise suppression. ICASSP 2001: 3769-3772 - [c4]Yuexian Zou, Shing-Chow Chan:
A robust quasi-Newton adaptive filtering algorithm for impulse noise suppression. ISCAS (2) 2001: 677-680 - 2000
- [j1]Yuexian Zou, S. C. Chan, Tung-Sang Ng:
A recursive least M-estimate (RLM) adaptive filter for robust filtering in impulse noise. IEEE Signal Process. Lett. 7(11): 324-326 (2000) - [c3]Yuexian Zou, Shing-Chow Chan, Tung-Sang Ng:
Fast least mean M-estimate algorithms for robust adaptive filtering in impulse noise. EUSIPCO 2000: 1-4
1990 – 1999
- 1999
- [c2]Yuexian Zou, S. C. Chan, Tung-Sang Ng:
A robust M-estimate adaptive filter for impulse noise suppression. ICASSP 1999: 1765-1768 - [c1]Yuexian Zou, Shing-Chow Chan, Tung-Sang Ng:
Transform domain adaptive Volterra filter algorithm based on constrained optimization. ISCAS (3) 1999: 219-222
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-02 22:29 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint