default search action
Kaisheng Yao
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c56]Jianfeng He, Julian Salazar, Kaisheng Yao, Haoqi Li, Jason Cai:
Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training. EACL (1) 2024: 2239-2256 - [i14]Elliot L. Epstein, Kaisheng Yao, Jing Li, Xinyi Bai, Hamid Palangi:
MMMT-IF: A Challenging Multimodal Multi-Turn Instruction Following Benchmark. CoRR abs/2409.18216 (2024) - 2023
- [c55]Hwanjun Song, Igor Shalyminov, Hang Su, Siffi Singh, Kaisheng Yao, Saab Mansour:
Enhancing Abstractiveness of Summarization Models through Calibrated Distillation. EMNLP (Findings) 2023: 7026-7036 - [i13]Jianfeng He, Julian Salazar, Kaisheng Yao, Haoqi Li, Jinglun Cai:
Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training. CoRR abs/2305.12793 (2023) - [i12]Qingpei Guo, Kaisheng Yao, Wei Chu:
Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input. CoRR abs/2306.14182 (2023) - [i11]Hwanjun Song, Igor Shalyminov, Hang Su, Siffi Singh, Kaisheng Yao, Saab Mansour:
Enhancing Abstractiveness of Summarization Models through Calibrated Distillation. CoRR abs/2310.13760 (2023) - 2022
- [c54]Qingpei Guo, Kaisheng Yao, Wei Chu:
Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input. ECCV (36) 2022: 330-346 - 2021
- [c53]Yangming Li, Kaisheng Yao:
Interpretable NLG for Task-oriented Dialogue Systems with Heterogeneous Rendering Machines. AAAI 2021: 13306-13314 - [c52]Yangming Li, Kaisheng Yao:
Rewriter-Evaluator Architecture for Neural Machine Translation. ACL/IJCNLP (1) 2021: 5701-5710 - [c51]Zhiming Wang, Furong Xu, Kaisheng Yao, Yuan Cheng, Tao Xiong, Huijia Zhu:
AntVoice Neural Speaker Embedding System for FFSVC 2020. Interspeech 2021: 1069-1073 - [c50]Yangming Li, Lemao Liu, Kaisheng Yao:
Neural Sequence Segmentation as Determining the Leftmost Segments. NAACL-HLT 2021: 1476-1486 - [i10]Yangming Li, Lemao Liu, Kaisheng Yao:
Neural Sequence Segmentation as Determining the Leftmost Segments. CoRR abs/2104.07217 (2021) - 2020
- [c49]Yangming Li, Kaisheng Yao, Libo Qin, Shuang Peng, Yijia Liu, Xiaolong Li:
Span-Based Neural Buffer: Towards Efficient and Effective Utilization of Long-Distance Context for Neural Sequence Models. AAAI 2020: 8277-8284 - [c48]Yangming Li, Kaisheng Yao, Libo Qin, Wanxiang Che, Xiaolong Li, Ting Liu:
Slot-consistent NLG for Task-oriented Dialogue Systems with Iterative Rectification Network. ACL 2020: 97-106 - [c47]Yangming Li, Han Li, Kaisheng Yao, Xiaolong Li:
Handling Rare Entities for Neural Sequence Labeling. ACL 2020: 6441-6451 - [c46]Zhiming Wang, Kaisheng Yao, Xiaolong Li, Shuo Fang:
Multi-Resolution Multi-Head Attention in Deep Speaker Embedding. ICASSP 2020: 6464-6468 - [i9]Yangming Li, Kaisheng Yao:
Rewriter-Evaluator Framework for Neural Machine Translation. CoRR abs/2012.05414 (2020) - [i8]Yangming Li, Kaisheng Yao:
Interpretable NLG for Task-oriented Dialogue Systems with Heterogeneous Rendering Machines. CoRR abs/2012.14645 (2020)
2010 – 2019
- 2019
- [c45]Zhiming Wang, Kaisheng Yao, Shuo Fang, Xiaolong Li:
Joint Optimization of Classification and Clustering for Deep Speaker Embedding. ASRU 2019: 284-290 - [c44]Ming He, Kaisheng Yao, Peng Yang, Yuan Yao:
Tag2Vec: Tag Embedding for Top-N Recommendation. ICNC-FSKD 2019: 168-175 - 2018
- [j9]Ming He, Kaisheng Yao, Peng Yang, Jiuling Zhang:
基于标签信息特征相似性的协同过滤个性化推荐 (Collaborative Filtering Personalized Recommendation Based on Similarity of Tag Information Feature). 计算机科学 45(6A): 415-422 (2018) - [j8]Ming He, Peng Yang, Kaisheng Yao, Jiuling Zhang:
TEFRCF: 标签熵特征表示的协同过滤个性化推荐算法 (TEFRCF: Collaborative Filtering Personalized Recommendation Algorithm Based on TagEntropy Feature Representation). 计算机科学 45(6A): 465-470 (2018) - [c43]Ming He, Jiuling Zhang, Peng Yang, Kaisheng Yao:
Robust Transfer Learning for Cross-domain Collaborative Filtering Using Multiple Rating Patterns Approximation. WSDM 2018: 225-233 - 2016
- [c42]Yu Zhang, Guoguo Chen, Dong Yu, Kaisheng Yao, Sanjeev Khudanpur, James R. Glass:
Highway long short-term memory RNNS for distant speech recognition. ICASSP 2016: 5755-5759 - [c41]Changhao Shan, Lei Xie, Kaisheng Yao:
A bi-directional LSTM approach for polyphone disambiguation in Mandarin Chinese. ISCSLP 2016: 1-5 - [c40]Kaituo Xu, Lei Xie, Kaisheng Yao:
Investigating LSTM for punctuation prediction. ISCSLP 2016: 1-5 - [c39]Yangyang Shi, Kaisheng Yao, Hu Chen, Dong Yu, Yi-Cheng Pan, Mei-Yuh Hwang:
Recurrent Support Vector Machines For Slot Tagging In Spoken Language Understanding. HLT-NAACL 2016: 393-399 - [c38]Trevor Cohn, Cong Duy Vu Hoang, Ekaterina Vymolova, Kaisheng Yao, Chris Dyer, Gholamreza Haffari:
Incorporating Structural Alignment Biases into an Attentional Neural Translation Model. HLT-NAACL 2016: 876-885 - [c37]Yangyang Shi, Kaisheng Yao, Le Tian, Daxin Jiang:
Deep LSTM based Feature Mapping for Query Classification. HLT-NAACL 2016: 1501-1511 - [i7]Trevor Cohn, Cong Duy Vu Hoang, Ekaterina Vymolova, Kaisheng Yao, Chris Dyer, Gholamreza Haffari:
Incorporating Structural Alignment Biases into an Attentional Neural Translation Model. CoRR abs/1601.01085 (2016) - [i6]Kaisheng Yao, Baolin Peng, Geoffrey Zweig, Kam-Fai Wong:
An Attentional Neural Conversation Model with Improved Specificity. CoRR abs/1606.01292 (2016) - 2015
- [j7]Dong Yu, Kaisheng Yao, Yu Zhang:
The Computational Network Toolkit [Best of the Web]. IEEE Signal Process. Mag. 32(6): 123-126 (2015) - [j6]Grégoire Mesnil, Yann N. Dauphin, Kaisheng Yao, Yoshua Bengio, Li Deng, Dilek Hakkani-Tür, Xiaodong He, Larry P. Heck, Gökhan Tür, Dong Yu, Geoffrey Zweig:
Using Recurrent Neural Networks for Slot Filling in Spoken Language Understanding. IEEE ACM Trans. Audio Speech Lang. Process. 23(3): 530-539 (2015) - [c36]Yangyang Shi, Kaisheng Yao, Hu Chen, Yi-Cheng Pan, Mei-Yuh Hwang:
Semi-supervised slot tagging in spoken language understanding using recurrent transductive support vector machines. ASRU 2015: 353-360 - [c35]Yujia Li, Kaisheng Yao, Geoffrey Zweig:
Feedback-based handwriting recognition from inertial sensor data for wearable devices. ICASSP 2015: 2269-2273 - [c34]Shi-Xiong Zhang, Chaojun Liu, Kaisheng Yao, Yifan Gong:
Deep neural support vector machines for speech recognition. ICASSP 2015: 4275-4279 - [c33]Kaustubh Kalgaonkar, Chaojun Liu, Yifan Gong, Kaisheng Yao:
Estimating confidence scores on ASR results using recurrent neural networks. ICASSP 2015: 4999-5003 - [c32]Yangyang Shi, Kaisheng Yao, Hu Chen, Yi-Cheng Pan, Mei-Yuh Hwang, Baolin Peng:
Contextual spoken language understanding using recurrent neural networks. ICASSP 2015: 5271-5275 - [c31]Yangyang Shi, Yi-Cheng Pan, Mei-Yuh Hwang, Kaisheng Yao, Hu Chen, Yuanhang Zou, Baolin Peng:
A factorization network based method for multi-lingual domain classification. ICASSP 2015: 5276-5280 - [c30]Kshitiz Kumar, Chaojun Liu, Kaisheng Yao, Yifan Gong:
Intermediate-layer DNN adaptation for offline and session-based iterative speaker adaptation. INTERSPEECH 2015: 1091-1095 - [c29]Kaisheng Yao, Geoffrey Zweig:
Sequence-to-sequence neural net models for grapheme-to-phoneme conversion. INTERSPEECH 2015: 3330-3334 - [c28]Baolin Peng, Kaisheng Yao, Jing Li, Kam-Fai Wong:
Recurrent Neural Networks with External Memory for Spoken Language Understanding. NLPCC 2015: 25-35 - [i5]Baolin Peng, Kaisheng Yao:
Recurrent Neural Networks with External Memory for Language Understanding. CoRR abs/1506.00195 (2015) - [i4]Kaisheng Yao, Geoffrey Zweig:
Sequence-to-Sequence Neural Net Models for Grapheme-to-Phoneme Conversion. CoRR abs/1506.00196 (2015) - [i3]Kaisheng Yao, Trevor Cohn, Katerina Vylomova, Kevin Duh, Chris Dyer:
Depth-Gated LSTM. CoRR abs/1508.03790 (2015) - [i2]Kaisheng Yao, Geoffrey Zweig, Baolin Peng:
Attention with Intention for a Neural Network Conversation Model. CoRR abs/1510.08565 (2015) - [i1]Yu Zhang, Guoguo Chen, Dong Yu, Kaisheng Yao, Sanjeev Khudanpur, James R. Glass:
Highway Long Short-Term Memory RNNs for Distant Speech Recognition. CoRR abs/1510.08983 (2015) - 2014
- [j5]Kaisheng Yao, Dong Yu, Li Deng, Yifan Gong:
A fast maximum likelihood nonlinear feature transformation method for GMM-HMM speaker adaptation. Neurocomputing 128: 145-152 (2014) - [c27]Kaisheng Yao, Baolin Peng, Geoffrey Zweig, Dong Yu, Xiaolong Li, Feng Gao:
Recurrent conditional random field for language understanding. ICASSP 2014: 4077-4081 - [c26]Dong Yu, Adam Eversole, Michael L. Seltzer, Kaisheng Yao, Brian Guenter, Oleksii Kuchaiev, Frank Seide, Huaming Wang, Jasha Droppo, Zhiheng Huang, Geoffrey Zweig, Christopher J. Rossbach, Jon Currey:
An introduction to computational networks and the computational network toolkit (invited talk). INTERSPEECH 2014 - [c25]Kaisheng Yao, Baolin Peng, Yu Zhang, Dong Yu, Geoffrey Zweig, Yangyang Shi:
Spoken language understanding using long short-term memory neural networks. SLT 2014: 189-194 - 2013
- [c24]Dong Yu, Kaisheng Yao, Hang Su, Gang Li, Frank Seide:
KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition. ICASSP 2013: 7893-7897 - [c23]Li Deng, Jinyu Li, Jui-Ting Huang, Kaisheng Yao, Dong Yu, Frank Seide, Michael L. Seltzer, Geoffrey Zweig, Xiaodong He, Jason D. Williams, Yifan Gong, Alex Acero:
Recent advances in deep learning for speech research at Microsoft. ICASSP 2013: 8604-8608 - [c22]Yangyang Shi, Mei-Yuh Hwang, Kaisheng Yao, Martha A. Larson:
Speed up of recurrent neural network language models with sentence independent subsampling stochastic gradient descent. INTERSPEECH 2013: 1203-1207 - [c21]Kaisheng Yao, Geoffrey Zweig, Mei-Yuh Hwang, Yangyang Shi, Dong Yu:
Recurrent neural networks for language understanding. INTERSPEECH 2013: 2524-2528 - 2012
- [j4]Daniel Povey, Kaisheng Yao:
A basis representation of constrained MLLR transforms for robust adaptation. Comput. Speech Lang. 26(1): 35-51 (2012) - [c20]Kaisheng Yao, Yifan Gong, Chaojun Liu:
A Feature Space Transformation Method for Personalization using Generalized I-Vector Clustering. INTERSPEECH 2012: 1352-1355 - [c19]Kaisheng Yao, Dong Yu, Frank Seide, Hang Su, Li Deng, Yifan Gong:
Adaptation of context-dependent deep neural networks for automatic speech recognition. SLT 2012: 366-369 - 2011
- [c18]Daniel Povey, Kaisheng Yao:
A basis method for robust estimation of constrained MLLR. ICASSP 2011: 4460-4463
2000 – 2009
- 2007
- [c17]Kaisheng Yao, Lorin Netsch:
An Approach to Low Footprint Pronunciation Models for Embedded Speaker Independent Name Recognition. ICASSP (4) 2007: 965-968 - 2006
- [c16]Kaisheng Yao, Lorin Netsch, Vishu Viswanathan:
Speaker-Independent Name Recognition Using Improved Compensation and Acoustic Modeling Methods for Mobile Applications. ICASSP (1) 2006: 173-176 - 2005
- [j3]Kaisheng Yao, Kuldip K. Paliwal, Te-Won Lee:
Generative factor analyzed HMM for automatic speech recognition. Speech Commun. 45(4): 435-454 (2005) - 2004
- [j2]Kaisheng Yao, Te-Won Lee:
Time-Varying Noise Estimation for Speech Enhancement and Recognition Using Sequential Monte Carlo Method. EURASIP J. Adv. Signal Process. 2004(15): 2366-2384 (2004) - [j1]Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura:
Noise adaptive speech recognition based on sequential noise parameter estimation. Speech Commun. 42(1): 5-23 (2004) - [c15]Te-Won Lee, Kaisheng Yao:
Speech enhancement by perceptual filter with sequential noise parameter estimation. ICASSP (1) 2004: 693-696 - [c14]Hoon-Young Cho, Kaisheng Yao, Te-Won Lee:
Emotion verification for emotion detection and unknown emotion rejection. INTERSPEECH 2004: 1345-1348 - 2003
- [c13]Kaisheng Yao, Erik M. Visser, Oh-Wook Kwon, Te-Won Lee:
A speech processing front-end with eigenspace normalization for robust speech recognition in noisy automobile environments. INTERSPEECH 2003: 9-12 - [c12]Kaisheng Yao, Kuldip K. Paliwal, Te-Won Lee:
Speech recognition with a generative factor analyzed hidden Markov model. INTERSPEECH 2003: 849-852 - [c11]Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura:
Model based noisy speech recognition with environment parameters estimated by noise adaptive speech recognition with prior. INTERSPEECH 2003: 1273-1276 - 2002
- [c10]Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura:
Noise adaptive speech recognition in time-varying noise based on sequential kullback proximal algorithm. ICASSP 2002: 189-192 - [c9]Kaisheng Yao, Donglai Zhu, Satoshi Nakamura:
Evaluation of a noise adaptive speech recognition system on the Aurora 3 database. INTERSPEECH 2002: 457-460 - [c8]Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura:
Noise adaptive speech recognition with acoustic models trained from noisy speech evaluated on Aurora-2 database. INTERSPEECH 2002: 2437-2440 - 2001
- [c7]Kaisheng Yao, Jingdong Chen, Kuldip K. Paliwal, Satoshi Nakamura:
Feature extraction and model-based noise compensation for noisy speech recognition evaluated on AURORA 2 task. INTERSPEECH 2001: 233-236 - [c6]Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura:
Sequential noise compensation by a sequential kullback proximal algorithm. INTERSPEECH 2001: 1139-1142 - [c5]Kaisheng Yao, Satoshi Nakamura:
Sequential Noise Compensation by Sequential Monte Carlo Method. NIPS 2001: 1205-1212 - 2000
- [c4]Kaisheng Yao, Bertram E. Shi, Pascale Fung, Zhigang Cao:
Residual noise compensation for robust speech recognition in nonstationary noise. ICASSP 2000: 1125-1128 - [c3]Bertram E. Shi, Kaisheng Yao, Zhigang Cao:
Soft GPD for minimum classification error rate training. ICASSP 2000: 1253-1256 - [c2]Kaisheng Yao, Bertram E. Shi, Satoshi Nakamura, Zhigang Cao:
Residual noise compensation by a sequential EM algorithm for robust speech recognition in nonstationary noise. INTERSPEECH 2000: 770-773
1990 – 1999
- 1999
- [c1]Kaisheng Yao, Bertram E. Shi, Pascale Fung, Zhigang Cao:
Liftered forward masking procedure for robust digits recognition. EUROSPEECH 1999: 2873-2876
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-21 21:30 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint