default search action
Jiaen Liang
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c37]Jun Yu, Zerui Zhang, Zhihong Wei, Gongpeng Zhao, Zhongpeng Cai, Yongqi Wang, Guochen Xie, Jichao Zhu, Wangyuan Zhu, Qingsong Liu, Jiaen Liang:
AUD-TGN: Advancing Action Unit Detection with Temporal Convolution and GPT-2 in Wild Audiovisual Contexts. CVPR Workshops 2024: 4814-4821 - [c36]Jun Yu, Wangyuan Zhu, Jichao Zhu, Zhongpeng Cai, Gongpeng Zhao, Zerui Zhang, Guochen Xie, Zhihong Wei, Qingsong Liu, Jiaen Liang:
Efficient Feature Extraction and Late Fusion Strategy for Audiovisual Emotional Mimicry Intensity Estimation. CVPR Workshops 2024: 4866-4872 - [c35]Jun Yu, Jichao Zhu, Wangyuan Zhu, Zhongpeng Cai, Gongpeng Zhao, Zhihong Wei, Guochen Xie, Zerui Zhang, Qingsong Liu, Jiaen Liang:
Multi Model Ensemble for Compound Expression Recognition. CVPR Workshops 2024: 4873-4879 - [c34]Jun Yu, Zhihong Wei, Zhongpeng Cai, Gongpeng Zhao, Zerui Zhang, Yongqi Wang, Guochen Xie, Jichao Zhu, Wangyuan Zhu, Qingsong Liu, Jiaen Liang:
Exploring Facial Expression Recognition through Semi-Supervised Pre-training and Temporal Modeling. CVPR Workshops 2024: 4880-4887 - [c33]Jun Yu, Gongpeng Zhao, Yongqi Wang, Zhihong Wei, Zerui Zhang, Zhongpeng Cai, Guochen Xie, Jichao Zhu, Wangyuan Zhu, Shuoping Yang, Yang Zheng, Qingsong Liu, Jiaen Liang:
Improving Valence-Arousal Estimation with Spatiotemporal Relationship Learning and Multimodal Fusion. CVPR Workshops 2024: 7878-7885 - [c32]Jun Yu, Mohan Jing, Guopeng Zhao, Keda Lu, Yifan Wang, Feng Zhao, Jiaqing Sun, Qingsong Liu, Jiaen Liang:
End-to-end Spatio-Temporal Information Aggregation For Micro-Action Detection. ACM Multimedia 2024: 11306-11312 - [c31]Yifan Wang, Xuecheng Wu, Jia Zhang, Mohan Jing, Keda Lu, Jun Yu, Wen Su, Fang Gao, Qingsong Liu, Jianqing Sun, Jiaen Liang:
Building Robust Video-Level Deepfake Detection via Audio-Visual Local-Global Interactions. ACM Multimedia 2024: 11370-11376 - [c30]Jun Yu, Yunxiang Zhang, Zerui Zhang, Zhao Yang, Gongpeng Zhao, Fengzhao Sun, Fanrui Zhang, Qingsong Liu, Jianqing Sun, Jiaen Liang, Yaohui Zhang:
RAG-Guided Large Language Models for Visual Spatial Description with Adaptive Hallucination Corrector. ACM Multimedia 2024: 11407-11413 - [c29]Jun Yu, Gongpeng Zhao, Yaohui Zhang, Peng He, Zerui Zhang, Zhao Yang, Qingsong Liu, Jianqing Sun, Jiaen Liang:
Temporal-Informative Adapters in VideoMAE V2 and Multi-Scale Feature Fusion for Micro-Expression Spotting-then-Recognize. ACM Multimedia 2024: 11484-11489 - [c28]Jun Yu, Yaohui Zhang, Gongpeng Zhao, Peng He, Zerui Zhang, Zhongpeng Cai, Qingsong Liu, Jianqing Sun, Jiaen Liang:
Micro-Expression Spotting Based on Optical Flow Feature with Boundary Calibration. ACM Multimedia 2024: 11490-11496 - 2023
- [j6]Yibo Duan, Yanhua Long, Jiaen Liang:
Dual-model self-regularization and fusion for domain adaptation of robust speaker verification. Speech Commun. 155: 103001 (2023) - [c27]Jinlong Xue, Yayue Deng, Fengping Wang, Ya Li, Yingming Gao, Jianhua Tao, Jianqing Sun, Jiaen Liang:
M2-CTTS: End-to-End Multi-Scale Multi-Modal Conversational Text-to-Speech Synthesis. ICASSP 2023: 1-5 - [c26]Jun Yu, Mohan Jing, Weihao Liu, Tongxu Luo, Bingyuan Zhang, Keda Lu, Fangyu Lei, Jianqing Sun, Jiaen Liang:
Answer-Based Entity Extraction and Alignment for Visual Text Question Answering. ACM Multimedia 2023: 9487-9491 - [c25]Jun Yu, Keda Lu, Mohan Jing, Ziqi Liang, Bingyuan Zhang, Jianqing Sun, Jiaen Liang:
Sliding Window Seq2seq Modeling for Engagement Estimation. ACM Multimedia 2023: 9496-9500 - [c24]Jun Yu, Wangyuan Zhu, Jichao Zhu, Xiaxin Shen, Jianqing Sun, Jiaen Liang:
MMT-GD: Multi-Modal Transformer with Graph Distillation for Cross-Cultural Humor Detection. MuSe@ACM Multimedia 2023: 43-49 - [i8]Jinlong Xue, Yayue Deng, Fengping Wang, Ya Li, Yingming Gao, Jianhua Tao, Jianqing Sun, Jiaen Liang:
M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis. CoRR abs/2305.02269 (2023) - [i7]Dengfeng Ke, Yayue Deng, Yukang Jia, Jinlong Xue, Qi Luo, Ya Li, Jianqing Sun, Jiaen Liang, Binghuai Lin:
Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis. CoRR abs/2306.02593 (2023) - 2022
- [j5]Yunhao Liang, Yanhua Long, Yijie Li, Jiaen Liang, Yuping Wang:
Joint framework with deep feature distillation and adaptive focal loss for weakly supervised audio tagging and acoustic event detection. Digit. Signal Process. 123: 103446 (2022) - [j4]Tiantian Tang, Yanhua Long, Yijie Li, Jiaen Liang:
Acoustic domain mismatch compensation in bird audio detection. Int. J. Speech Technol. 25(1): 251-260 (2022) - [j3]Jiangyu Han, Yan Shi, Yanhua Long, Jiaen Liang:
Exploring single channel speech separation for short-time text-dependent speaker verification. Int. J. Speech Technol. 25(1): 261-268 (2022) - [c23]Yunhao Liang, Yanhua Long, Yijie Li, Jiaen Liang:
Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection. INTERSPEECH 2022: 1496-1500 - [c22]Dengfeng Ke, Yayue Deng, Yukang Jia, Jinlong Xue, Qi Luo, Ya Li, Jianqing Sun, Jiaen Liang, Binghuai Lin:
Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis. ISCSLP 2022: 220-224 - [c21]Jinlong Xue, Yayue Deng, Yichen Han, Ya Li, Jianqing Sun, Jiaen Liang:
ECAPA-TDNN for Multi-speaker Text-to-speech Synthesis. ISCSLP 2022: 230-234 - [i6]Yunhao Liang, Yanhua Long, Yijie Li, Jiaen Liang:
Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection. CoRR abs/2203.02191 (2022) - [i5]Jinlong Xue, Yayue Deng, Yichen Han, Ya Li, Jianqing Sun, Jiaen Liang:
ECAPA-TDNN for Multi-speaker Text-to-speech Synthesis. CoRR abs/2203.10473 (2022) - 2021
- [c20]Tiantian Tang, Xinyuan Zhou, Yanhua Long, Yijie Li, Jiaen Liang:
CNN-based Discriminative Training for Domain Compensation in Acoustic Event Detection with Frame-wise Classifier. APSIPA ASC 2021: 939-944 - [c19]Jiangyu Han, Wei Rao, Yanhua Long, Jiaen Liang:
Attention-Based Scaling Adaptation for Target Speech Extraction. ASRU 2021: 658-662 - [i4]Yunhao Liang, Yanhua Long, Yijie Li, Jiaen Liang:
Joint Weakly Supervised AT and AED Using Deep Feature Distillation and Adaptive Focal Loss. CoRR abs/2103.12388 (2021) - [i3]Tiantian Tang, Xinyuan Zhou, Yanhua Long, Yijie Li, Jiaen Liang:
CNN-based Discriminative Training for Domain Compensation in Acoustic Event Detection with Frame-wise Classifier. CoRR abs/2103.14297 (2021) - 2020
- [j2]Renke He, Yanhua Long, Yijie Li, Jiaen Liang:
Mask-based blind source separation and MVDR beamforming in ASR. Int. J. Speech Technol. 23(1): 133-140 (2020) - [c18]Laipeng He, Qiang Shi, Lang Wu, Jianqing Sun, Renke He, Yanhua Long, Jiaen Liang:
The SHNU System for Blizzard Challenge 2020. Blizzard Challenge / Voice Conversion Challenge 2020 - [c17]Wentao Wang, Yan Wang, Jianqing Sun, Qingsong Liu, Jiaen Liang, Teng Li:
Speech Driven Talking Head Generation via Attentional Landmarks Based Representation. INTERSPEECH 2020: 1326-1330 - [c16]Xinyuan Zhou, Grandee Lee, Emre Yilmaz, Yanhua Long, Jiaen Liang, Haizhou Li:
Self-and-Mixed Attention Decoder with Deep Acoustic Structure for Transformer-Based LVCSR. INTERSPEECH 2020: 5016-5020 - [i2]Xinyuan Zhou, Grandee Lee, Emre Yilmaz, Yanhua Long, Jiaen Liang, Haizhou Li:
Self-and-Mixed Attention Decoder with Deep Acoustic Structure for Transformer-based LVCSR. CoRR abs/2006.10407 (2020) - [i1]Jiangyu Han, Yanhua Long, Jiaen Liang:
Attention-based scaling adaptation for target speech extraction. CoRR abs/2010.10923 (2020)
2010 – 2019
- 2019
- [j1]Feng Guo, Yuhang Cao, Zhaoqiong Huang, Xing You, Haixing Guan, Jiaen Liang, Baoqing Li:
Speaker Direction-of-Arrival Estimation Based on Orthogonal Dipoles. Circuits Syst. Signal Process. 38(5): 2320-2334 (2019) - 2018
- [c15]Yanhua Long, Hong Ye, Yijie Li, Jiaen Liang:
Active Learning for LF-MMI Trained Neural Networks in ASR. INTERSPEECH 2018: 2898-2902 - 2017
- [c14]Zhong-Hua Fu, Lei Xie, Peng Li, Jiaen Liang:
Frequency-invariant differential microphone array design in the STFT domain. APSIPA 2017: 1692-1695 - [c13]Feng Guo, Yuhang Cao, Zheng Liu, Jiaen Liang, Baoqing Li, Xiaobing Yuan:
Speaker Direction-of-Arrival Estimation Based on Frequency-Independent Beampattern. INTERSPEECH 2017: 1899-1903 - 2011
- [c12]Hongyan Li, Shen Huang, Shijin Wang, Jiaen Liang, Bo Xu:
Exploring nuisance attribute projection and score normalization for GLDS-SVM based automatic mispronunciation detection method. ICASSP 2011: 5668-5671 - 2010
- [c11]Shen Huang, Hongyan Li, Shijin Wang, Jiaen Liang, Bo Xu:
Automatic reference independent evaluation of prosody quality using multiple knowledge fusions. INTERSPEECH 2010: 610-613 - [c10]Shen Huang, Hongyan Li, Shijin Wang, Jiaen Liang, Bo Xu:
Exploring goodness of prosody by diverse matching templates. INTERSPEECH 2010: 1145-1148
2000 – 2009
- 2009
- [c9]Shen Huang, Hongyan Li, Shijin Wang, Jiaen Liang, Bo Xu:
Context Dependent Feature Based Bottom-up Rescoring SVM Classifier in Children's English Stress Mis-pronunciation Detection. ICALT 2009: 236-238 - [c8]Hongyan Li, Jiaen Liang, Shijin Wang, Bo Xu:
An efficient mispronounciation detction method using GLDS-SVM and formant enhanced features. ICASSP 2009: 4845-4848 - [c7]Hongyan Li, Shijin Wang, Jiaen Liang, Shen Huang, Bo Xu:
High performance automatic mispronunciation detection method based on neural network and TRAP features. INTERSPEECH 2009: 1911-1914 - 2008
- [c6]Xiaorui Wang, Shijin Wang, Jiaen Liang, Bo Xu:
Improved phonotactic language identification using random forest language models. ICASSP 2008: 4237-4240 - [c5]Lei Wang, Shen Huang, Shijin Wang, Jiaen Liang, Bo Xu:
Music Genre Classification Based on Multiple Classifier Fusion. ICNC (5) 2008: 580-583 - [c4]Lei Wang, Shen Huang, Sheng Hu, Jiaen Liang, Bo Xu:
Improving searching speed and accuracy of query by humming system based on three methods: feature fusion, candidates set reduction and multiple similarity measurement rescoring. INTERSPEECH 2008: 2024-2027 - 2007
- [c3]Peng Gao, Jiaen Liang, Peng Ding, Bo Xu:
A Novel Phone-State Matrix Based Vocabulary-Indenendent Keyword Spotting Method for Spontaneous Speech. ICASSP (4) 2007: 425-428 - 2006
- [c2]Jiaen Liang, Meng Meng, Xiaorui Wang, Peng Ding, Bo Xu:
An Improved Mandarin Keyword Spotting System Using MCE Training and Context-Enhanced Verification. ICASSP (1) 2006: 1145-1148 - [c1]Meng Meng, Shijin Wang, Jiaen Liang, Peng Ding, Bo Xu:
Full Utilization of Closed-captions in Broadcast News Recognition. ISCSLP 2006
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-11 22:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint