default search action
Xiaodong Cui
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j32]Michael Anokye, Xiaodong Cui, Fanlin Yang, Ping Wang, Yuewen Sun, Hadong Ma, Emmanuel Oduro Amoako:
Optimizing multi-classifier fusion for seabed sediment classification using machine learning. Int. J. Digit. Earth 17(1) (2024) - [j31]Yu Cao, Xiaodong Cui, Mingyi Gan, Yaxue Wang, Fanlin Yang, Yi Huang:
MAL-YOLO: a lightweight algorithm for target detection in side-scan sonar images based on multi-scale feature fusion and attention mechanism. Int. J. Digit. Earth 17(1) (2024) - [j30]Peican Zhu, Botao Wang, Keke Tang, Haifeng Zhang, Xiaodong Cui, Zhen Wang:
A knowledge-guided graph attention network for emotion-cause pair extraction. Knowl. Based Syst. 286: 111342 (2024) - [j29]Zijun Pu, Qunfei Zhang, Yangtao Xue, Peican Zhu, Xiaodong Cui:
A Novel Multi-Feature Fusion Model Based on Pre-Trained Wav2vec 2.0 for Underwater Acoustic Target Recognition. Remote. Sens. 16(13): 2442 (2024) - [j28]Xiaodong Cui, Bingtao Chang, Shuhang Zhang, Jinchen He, Zhiyang Zhi, Wuming Zhang:
Anomaly Detection in Multibeam Bathymetric Point Clouds Integrating Prior Constraints With Geostatistical Prediction. IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens. 17: 17903-17916 (2024) - [j27]Peican Zhu, Zechen Pan, Keke Tang, Xiaodong Cui, Jinhuan Wang, Qi Xuan:
Node Injection Attack Based on Label Propagation Against Graph Neural Network. IEEE Trans. Comput. Soc. Syst. 11(5): 5858-5870 (2024) - [j26]Jiaxin Wan, Zhiliang Qin, Xiaodong Cui, Fanlin Yang, Benjun Ma:
Application of Sample Enhancement Method Combining Superpixel Segmentation and Active Learning in MBES Seafloor Sediment Classification. IEEE Trans. Geosci. Remote. Sens. 62: 1-11 (2024) - [c93]Keke Tang, Wenyu Zhao, Weilong Peng, Xiang Fang, Xiaodong Cui, Peican Zhu, Zhihong Tian:
Reparameterization Head for Efficient Multi-Input Networks. ICASSP 2024: 6190-6194 - [c92]Hui Wan, Hongkang Li, Songtao Lu, Xiaodong Cui, Marina Danilevsky:
How Can Personalized Context Help? Exploring Joint Retrieval of Passage and Personalized Context. ICASSP 2024: 9991-9995 - [c91]A F. M. Saif, Xiaodong Cui, Han Shen, Songtao Lu, Brian Kingsbury, Tianyi Chen:
Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization. ICASSP 2024: 10931-10935 - [c90]Hongkang Li, Meng Wang, Songtao Lu, Xiaodong Cui, Pin-Yu Chen:
How Do Nonlinear Transformers Learn and Generalize in In-Context Learning? ICML 2024 - [i32]A F. M. Saif, Xiaodong Cui, Han Shen, Songtao Lu, Brian Kingsbury, Tianyi Chen:
Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization. CoRR abs/2401.06980 (2024) - [i31]Hongkang Li, Meng Wang, Songtao Lu, Xiaodong Cui, Pin-Yu Chen:
Training Nonlinear Transformers for Efficient In-Context Learning: A Theoretical Learning and Generalization Analysis. CoRR abs/2402.15607 (2024) - [i30]Peican Zhu, Zechen Pan, Keke Tang, Xiaodong Cui, Jinhuan Wang, Qi Xuan:
Node Injection Attack Based on Label Propagation Against Graph Neural Network. CoRR abs/2405.18824 (2024) - [i29]Hongkang Li, Meng Wang, Songtao Lu, Xiaodong Cui, Pin-Yu Chen:
Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis. CoRR abs/2410.02167 (2024) - 2023
- [j25]Jiaheng Hua, Xiaodong Cui, Xianghua Li, Keke Tang, Peican Zhu:
Multimodal fake news detection through data augmentation-based contrastive learning. Appl. Soft Comput. 136: 110125 (2023) - [j24]Jiaxin Wan, Zhiliang Qin, Xiaodong Cui, Muhammad Yasir, Benjun Ma:
Seafloor Habitat Mapping by Combining Multiple Features From Optic and Acoustic Data: A Case Study From Ganquan Island, South China Sea. IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens. 16: 7248-7263 (2023) - [j23]Haiyang Hu, Chengkai Feng, Xiaodong Cui, Kai Zhang, Xianhai Bu, Fanlin Yang:
A Sample Enhancement Method Based on Simple Linear Iterative Clustering Superpixel Segmentation Applied to Multibeam Seabed Classification. IEEE Trans. Geosci. Remote. Sens. 61: 1-15 (2023) - [c89]Han Shen, Songtao Lu, Xiaodong Cui, Tianyi Chen:
Distributed Offline Policy Optimization Over Batch Data. AISTATS 2023: 4443-4472 - [c88]Qi Li, Xingyu Li, Xiaodong Cui, Keke Tang, Peican Zhu:
HEPT Attack: Heuristic Perpendicular Trial for Hard-label Attacks under Limited Query Budgets. CIKM 2023: 4064-4068 - [c87]George Saon, Ankit Gupta, Xiaodong Cui:
Diagonal State Space Augmented Transformers for Speech Recognition. ICASSP 2023: 1-5 - [c86]Yonggui Yan, Jie Chen, Pin-Yu Chen, Xiaodong Cui, Songtao Lu, Yangyang Xu:
Compressed Decentralized Proximal Stochastic Gradient Method for Nonconvex Composite Problems with Heterogeneous Data. ICML 2023: 39035-39061 - [c85]Jianan Shi, Xiaodong Cui, Zeyu Zhu, Lingling Zhang, Jing Han:
A Transformer-Based OFDM Receiver for Underwater Acoustic Communication. ICSPCC 2023: 1-5 - [c84]Xiaodong Cui, George Saon, Brian Kingsbury:
Improving RNN Transducer Acoustic Models for English Conversational Speech Recognition. INTERSPEECH 2023: 1299-1303 - [i28]George Saon, Ankit Gupta, Xiaodong Cui:
Diagonal State Space Augmented Transformers for Speech Recognition. CoRR abs/2302.14120 (2023) - [i27]Hui Wan, Hongkang Li, Songtao Lu, Xiaodong Cui, Marina Danilevsky:
How Can Context Help? Exploring Joint Retrieval of Passage and Personalized Context. CoRR abs/2308.13760 (2023) - [i26]Xiaodong Cui, Ashish R. Mittal, Songtao Lu, Wei Zhang, George Saon, Brian Kingsbury:
Soft Random Sampling: A Theoretical and Empirical Analysis. CoRR abs/2311.12727 (2023) - 2022
- [j22]Jiaxin Wan, Zhiliang Qin, Xiaodong Cui, Fanlin Yang, Muhammad Yasir, Benjun Ma, Xueqin Liu:
MBES Seabed Sediment Classification Based on a Decision Fusion Method Using Deep Learning Model. Remote. Sens. 14(15): 3708 (2022) - [j21]Hui Wang, Haoran Ke, Yizhe Chen, Jinhuo Wang, Fei Yan, Xiaodong Cui:
Promotion of Interface Fusion of Solid Polymer Electrolyte and Cathode by Ultrasonic Vibration. Sensors 22(5): 1814 (2022) - [j20]Xiaodong Cui, Fanlin Yang, Ziyin Wu, Kai Zhang, Miao Fan, Bo Ai:
Deep-Sea Sediment Mixed Pixel Decomposition Based on Multibeam Backscatter Intensity Segmentation. IEEE Trans. Geosci. Remote. Sens. 60: 1-15 (2022) - [c83]Songtao Lu, Xiaodong Cui, Mark S. Squillante, Brian Kingsbury, Lior Horesh:
Decentralized Bilevel Optimization for Personalized Client Learning. ICASSP 2022: 5543-5547 - [c82]Andrea Fasoli, Chia-Yu Chen, Mauricio J. Serrano, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Kailash Gopalakrishnan:
Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization. INTERSPEECH 2022: 2038-2042 - [c81]Xiaodong Cui, George Saon, Tohru Nagano, Masayuki Suzuki, Takashi Fukuda, Brian Kingsbury, Gakuto Kurata:
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing. INTERSPEECH 2022: 2638-2642 - [c80]Songtao Lu, Siliang Zeng, Xiaodong Cui, Mark S. Squillante, Lior Horesh, Brian Kingsbury, Jia Liu, Mingyi Hong:
A Stochastic Linearized Augmented Lagrangian Method for Decentralized Bilevel Optimization. NeurIPS 2022 - [c79]Xiaoming Zhang, Fan Zhang, Xiaodong Cui, Wei Zhang:
Speech Emotion Recognition with Complementary Acoustic Representations. SLT 2022: 846-852 - [i25]Xiaodong Cui, George Saon, Tohru Nagano, Masayuki Suzuki, Takashi Fukuda, Brian Kingsbury, Gakuto Kurata:
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing. CoRR abs/2203.15176 (2022) - [i24]Andrea Fasoli, Chia-Yu Chen, Mauricio J. Serrano, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Kailash Gopalakrishnan:
Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization. CoRR abs/2206.07882 (2022) - 2021
- [j19]Xiaodong Cui, Wei Zhang, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, David S. Kung:
Asynchronous Decentralized Distributed Training of Acoustic Models. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3565-3576 (2021) - [c78]Wei Zhang, Ziming Huang, Yada Zhu, Guangnan Ye, Xiaodong Cui, Fan Zhang:
On Sample Based Explanation Methods for NLP: Faithfulness, Efficiency and Semantic Evaluation. ACL/IJCNLP (1) 2021: 5399-5411 - [c77]Mingke Xu, Fan Zhang, Xiaodong Cui, Wei Zhang:
Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation. ICASSP 2021: 6319-6323 - [c76]Xiaodong Cui, Songtao Lu, Brian Kingsbury:
Federated Acoustic Modeling for Automatic Speech Recognition. ICASSP 2021: 6748-6752 - [c75]Xiaodong Cui, Brian Kingsbury, George Saon, David Haws, Zoltán Tüske:
Reducing Exposure Bias in Training Recurrent Neural Network Transducers. Interspeech 2021: 1802-1806 - [c74]Andrea Fasoli, Chia-Yu Chen, Mauricio J. Serrano, Xiao Sun, Naigang Wang, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Wei Zhang, Zoltán Tüske, Kailash Gopalakrishnan:
4-Bit Quantization of LSTM-Based Speech Recognition Models. Interspeech 2021: 2586-2590 - [i23]Mingke Xu, Fan Zhang, Xiaodong Cui, Wei Zhang:
Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation. CoRR abs/2102.01813 (2021) - [i22]Xiaodong Cui, Songtao Lu, Brian Kingsbury:
Federated Acoustic Modeling For Automatic Speech Recognition. CoRR abs/2102.04429 (2021) - [i21]Chia-Yu Chen, Jiamin Ni, Songtao Lu, Xiaodong Cui, Pin-Yu Chen, Xiao Sun, Naigang Wang, Swagath Venkataramani, Vijayalakshmi Srinivasan, Wei Zhang, Kailash Gopalakrishnan:
ScaleCom: Scalable Sparsified Gradient Compression for Communication-Efficient Distributed Training. CoRR abs/2104.11125 (2021) - [i20]Wei Zhang, Ziming Huang, Yada Zhu, Guangnan Ye, Xiaodong Cui, Fan Zhang:
On Sample Based Explanation Methods for NLP: Efficiency, Faithfulness, and Semantic Evaluation. CoRR abs/2106.04753 (2021) - [i19]Xiaodong Cui, Brian Kingsbury, George Saon, David Haws, Zoltán Tüske:
Reducing Exposure Bias in Training Recurrent Neural Network Transducers. CoRR abs/2108.10803 (2021) - [i18]Andrea Fasoli, Chia-Yu Chen, Mauricio J. Serrano, Xiao Sun, Naigang Wang, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Wei Zhang, Zoltán Tüske, Kailash Gopalakrishnan:
4-bit Quantization of LSTM-based Speech Recognition Models. CoRR abs/2108.12074 (2021) - [i17]Xiaodong Cui, Wei Zhang, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, David S. Kung:
Asynchronous Decentralized Distributed Training of Acoustic Models. CoRR abs/2110.11199 (2021) - [i16]Wei Zhang, Mingrui Liu, Yu Feng, Xiaodong Cui, Brian Kingsbury, Yuhai Tu:
Loss Landscape Dependent Self-Adjusting Learning Rates in Decentralized Stochastic Gradient Descent. CoRR abs/2112.01433 (2021) - 2020
- [j18]Tao You, Peng Wang, Danyang Jia, Fei Yang, Xiaodong Cui, Chen Liu:
The effects of heterogeneity of updating rules on cooperation in spatial network. Appl. Math. Comput. 372 (2020) - [j17]Conrad M. Albrecht, Rui Zhang, Xiaodong Cui, Marcus Freitag, Hendrik F. Hamann, Levente J. Klein, Ulrich Finkler, Fernando J. Marianno, Johannes Schmude, Norman Bobroff, Wei Zhang, Carlo Siebenschuh, Siyuan Lu:
Change Detection from Remote Sensing to Guide OpenStreetMap Labeling. ISPRS Int. J. Geo Inf. 9(7): 427 (2020) - [j16]Xiaodong Cui, Wei Zhang, Ulrich Finkler, George Saon, Michael Picheny, David S. Kung:
Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition: A comparison of current training strategies. IEEE Signal Process. Mag. 37(3): 39-49 (2020) - [c73]Wei Zhang, Xiaodong Cui, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, Youssef Mroueh, Alper Buyuktosunoglu, Payel Das, David S. Kung, Michael Picheny:
Improving Efficiency in Large-Scale Decentralized Distributed Training. ICASSP 2020: 3022-3026 - [c72]Mingrui Liu, Youssef Mroueh, Jerret Ross, Wei Zhang, Xiaodong Cui, Payel Das, Tianbao Yang:
Towards Better Understanding of Adaptive Gradient Algorithms in Generative Adversarial Nets. ICLR 2020 - [c71]Di Chen, Yada Zhu, Xiaodong Cui, Carla P. Gomes:
Task-Based Learning via Task-Oriented Prediction Network with Applications in Finance. IJCAI 2020: 4476-4482 - [c70]Rui Zhang, Conrad M. Albrecht, Wei Zhang, Xiaodong Cui, Ulrich Finkler, David S. Kung, Siyuan Lu:
Map Generation from Large Scale Incomplete and Inaccurate Data Labels. KDD 2020: 2514-2522 - [c69]Chia-Yu Chen, Jiamin Ni, Songtao Lu, Xiaodong Cui, Pin-Yu Chen, Xiao Sun, Naigang Wang, Swagath Venkataramani, Vijayalakshmi Srinivasan, Wei Zhang, Kailash Gopalakrishnan:
ScaleCom: Scalable Sparsified Gradient Compression for Communication-Efficient Distributed Training. NeurIPS 2020 - [c68]Mingrui Liu, Wei Zhang, Youssef Mroueh, Xiaodong Cui, Jarret Ross, Tianbao Yang, Payel Das:
A Decentralized Parallel Algorithm for Training Generative Adversarial Nets. NeurIPS 2020 - [c67]Xiao Sun, Naigang Wang, Chia-Yu Chen, Jiamin Ni, Ankur Agrawal, Xiaodong Cui, Swagath Venkataramani, Kaoutar El Maghraoui, Vijayalakshmi Srinivasan, Kailash Gopalakrishnan:
Ultra-Low Precision 4-bit Training of Deep Neural Networks. NeurIPS 2020 - [i15]Wei Zhang, Xiaodong Cui, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, Youssef Mroueh, Alper Buyuktosunoglu, Payel Das, David S. Kung, Michael Picheny:
Improving Efficiency in Large-Scale Decentralized Distributed Training. CoRR abs/2002.01119 (2020) - [i14]Xiaodong Cui, Wei Zhang, Ulrich Finkler, George Saon, Michael Picheny, David S. Kung:
Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition. CoRR abs/2002.10502 (2020) - [i13]Rui Zhang, Conrad M. Albrecht, Wei Zhang, Xiaodong Cui, Ulrich Finkler, David S. Kung, Siyuan Lu:
Map Generation from Large Scale Incomplete and Inaccurate Data Labels. CoRR abs/2005.10053 (2020)
2010 – 2019
- 2019
- [c66]Wei Zhang, Xiaodong Cui, Ulrich Finkler, Brian Kingsbury, George Saon, David S. Kung, Michael Picheny:
Distributed Deep Learning Strategies for Automatic Speech Recognition. ICASSP 2019: 5706-5710 - [c65]David Haws, Xiaodong Cui:
Cyclegan Bandwidth Extension Acoustic Modeling for Automatic Speech Recognition. ICASSP 2019: 6780-6784 - [c64]Khoi-Nguyen C. Mac, Xiaodong Cui, Wei Zhang, Michael Picheny:
Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition. INTERSPEECH 2019: 251-255 - [c63]Michael Picheny, Zoltán Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon:
Challenging the Boundaries of Speech Recognition: The MALACH Corpus. INTERSPEECH 2019: 326-330 - [c62]Xiaodong Cui, Michael Picheny:
Acoustic Model Optimization Based on Evolutionary Stochastic Gradient Descent with Anchors for Automatic Speech Recognition. INTERSPEECH 2019: 1581-1585 - [c61]Wei Zhang, Xiaodong Cui, Ulrich Finkler, George Saon, Abdullah Kayi, Alper Buyuktosunoglu, Brian Kingsbury, David S. Kung, Michael Picheny:
A Highly Efficient Distributed Deep Learning System for Automatic Speech Recognition. INTERSPEECH 2019: 2628-2632 - [c60]Xiao Sun, Jungwook Choi, Chia-Yu Chen, Naigang Wang, Swagath Venkataramani, Vijayalakshmi Srinivasan, Xiaodong Cui, Wei Zhang, Kailash Gopalakrishnan:
Hybrid 8-bit Floating Point (HFP8) Training and Inference for Deep Neural Networks. NeurIPS 2019: 4901-4910 - [i12]Wei Zhang, Xiaodong Cui, Ulrich Finkler, Brian Kingsbury, George Saon, David S. Kung, Michael Picheny:
Distributed Deep Learning Strategies For Automatic Speech Recognition. CoRR abs/1904.04956 (2019) - [i11]Xiaodong Cui, Michael Picheny:
Acoustic Model Optimization Based On Evolutionary Stochastic Gradient Descent with Anchors for Automatic Speech Recognition. CoRR abs/1907.04882 (2019) - [i10]Khoi-Nguyen C. Mac, Xiaodong Cui, Wei Zhang, Michael Picheny:
Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition. CoRR abs/1907.04887 (2019) - [i9]Wei Zhang, Xiaodong Cui, Ulrich Finkler, George Saon, Abdullah Kayi, Alper Buyuktosunoglu, Brian Kingsbury, David S. Kung, Michael Picheny:
A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition. CoRR abs/1907.05701 (2019) - [i8]Michael Picheny, Zoltán Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon:
Challenging the Boundaries of Speech Recognition: The MALACH Corpus. CoRR abs/1908.03455 (2019) - [i7]Di Chen, Yada Zhu, Xiaodong Cui, Carla P. Gomes:
Task-Based Learning via Task-Oriented Prediction Network. CoRR abs/1910.09357 (2019) - [i6]Mingrui Liu, Youssef Mroueh, Wei Zhang, Xiaodong Cui, Jerret Ross, Tianbao Yang, Payel Das:
Decentralized Parallel Algorithm for Training Generative Adversarial Nets. CoRR abs/1910.12999 (2019) - [i5]Mingrui Liu, Youssef Mroueh, Jerret Ross, Wei Zhang, Xiaodong Cui, Payel Das, Tianbao Yang:
Towards Better Understanding of Adaptive Gradient Algorithms in Generative Adversarial Nets. CoRR abs/1912.11940 (2019) - 2018
- [j15]Xiaodong Cui, Lin Zhang, Jia Meng, Manjeet K. Rao, Yidong Chen, Yufei Huang:
MeTDiff: A Novel Differential RNA Methylation Analysis for MeRIP-Seq Data. IEEE ACM Trans. Comput. Biol. Bioinform. 15(2): 526-534 (2018) - [c59]Xiaodong Cui, Wei Zhang, Zoltán Tüske, Michael Picheny:
Evolutionary Stochastic Gradient Descent for Optimization of Deep Neural Networks. NeurIPS 2018: 6051-6061 - [i4]Xiaodong Cui, Wei Zhang, Zoltán Tüske, Michael Picheny:
Evolutionary Stochastic Gradient Descent for Optimization of Deep Neural Networks. CoRR abs/1810.06773 (2018) - 2017
- [c58]Tom Sercu, George Saon, Jia Cui, Xiaodong Cui, Bhuvana Ramabhadran, Brian Kingsbury, Abhinav Sethy:
Network architectures for multilingual speech representation learning. ICASSP 2017: 5295-5299 - [c57]Xiaodong Cui, Vaibhava Goel, George Saon:
Embedding-Based Speaker Adaptive Training of Deep Neural Networks. INTERSPEECH 2017: 122-126 - [c56]George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall:
English Conversational Telephone Speech Recognition by Humans and Machines. INTERSPEECH 2017: 132-136 - [c55]Shiyu Chang, Yang Zhang, Wei Han, Mo Yu, Xiaoxiao Guo, Wei Tan, Xiaodong Cui, Michael Witbrock, Mark A. Hasegawa-Johnson, Thomas S. Huang:
Dilated Recurrent Neural Networks. NIPS 2017: 77-87 - [i3]George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall:
English Conversational Telephone Speech Recognition by Humans and Machines. CoRR abs/1703.02136 (2017) - [i2]Shiyu Chang, Yang Zhang, Wei Han, Mo Yu, Xiaoxiao Guo, Wei Tan, Xiaodong Cui, Michael Witbrock, Mark Hasegawa-Johnson, Thomas S. Huang:
Dilated Recurrent Neural Networks. CoRR abs/1710.02224 (2017) - [i1]Xiaodong Cui, Vaibhava Goel, George Saon:
Embedding-Based Speaker Adaptive Training of Deep Neural Networks. CoRR abs/1710.06937 (2017) - 2016
- [j14]Xiaodong Cui, Jia Meng, Shaowu Zhang, Yidong Chen, Yufei Huang:
A novel algorithm for calling mRNA m6A peaks by modeling biological variances in MeRIP-seq data. Bioinform. 32(12): 378-385 (2016) - [j13]Xiaodong Cui, Vaibhava Goel:
Maximum Likelihood Nonlinear Transformations Based on Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 24(11): 2023-2031 (2016) - [c54]Steven J. Rennie, Xiaodong Cui, Vaibhava Goel:
Efficient non-linear feature adaptation using Maxout networks. ICASSP 2016: 5310-5314 - 2015
- [j12]Xiaodong Cui, Vaibhava Goel, Brian Kingsbury:
Data Augmentation for Deep Neural Network Acoustic Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 23(9): 1469-1477 (2015) - [c53]Jia Cui, Brian Kingsbury, Bhuvana Ramabhadran, Abhinav Sethy, Kartik Audhkhasi, Xiaodong Cui, Ellen Kislal, Lidia Mangu, Markus Nußbaum-Thom, Michael Picheny, Zoltán Tüske, Pavel Golik, Ralf Schlüter, Hermann Ney, Mark J. F. Gales, Kate M. Knill, Anton Ragni, Haipeng Wang, Philip C. Woodland:
Multilingual representations for low resource speech recognition and keyword search. ASRU 2015: 259-266 - [c52]Xiaodong Cui, Zhen Wei, Lin Zhang, Hui Liu, Lei Sun, Shaowu Zhang, Yufei Huang, Jia Meng:
Sketching the distribution of transcriptomic features on RNA transcripts with Travis coordinates. BIBM 2015: 1536-1542 - [c51]Xiaodong Cui, Jia Meng, Shaowu Zhang, Yufei Huang:
Modeling of replicates variances for detecting RNA methylation site in MERIP-SEQ data. ChinaSIP 2015: 802-806 - [c50]Xiaodong Cui, Vaibhava Goel:
Maximum likelihood nonlinear transformations based on deep neural networks. ICASSP 2015: 4320-4324 - [c49]Xiaodong Cui, Vaibhava Goel, Brian Kingsbury:
Data augmentation for deep convolutional neural network acoustic modeling. ICASSP 2015: 4545-4549 - [c48]Steven J. Rennie, Pierre L. Dognin, Xiaodong Cui, Vaibhava Goel:
Annealed dropout trained maxout networks for improved LVCSR. ICASSP 2015: 5181-5185 - 2014
- [c47]Lin Zhang, Jia Meng, Hui Liu, Xiaodong Cui, Shao-Wu Zhang, Yidong Chen, Yufei Huang:
Detecting differentially methylated mRNA from MeRIP-Seq with likelihood ratio test. GlobalSIP 2014: 1368-1371 - [c46]Yu-Chen Zhang, Shao-Wu Zhang, Lian Liu, Lin Zhang, Hui Liu, Xiaodong Cui, Yufei Huang, Jia Meng:
Differential analysis of RNA methylome with improved spatial resolution. GlobalSIP 2014: 1372-1375 - [c45]Xiaodong Cui, Vaibhava Goel, Brian Kingsbury:
Data Augmentation for deep neural network acoustic modeling. ICASSP 2014: 5582-5586 - [c44]Markus Nußbaum-Thom, Xiaodong Cui, Ralf Schlüter, Vaibhava Goel, Hermann Ney:
A family of discriminative training criteria based on the F-divergence for deep neural networks. ICASSP 2014: 5612-5616 - [c43]Raul Fernandez, Jia Cui, Andrew Rosenberg, Bhuvana Ramabhadran, Xiaodong Cui:
Exploiting vocal-source features to improve ASR accuracy for low-resource languages. INTERSPEECH 2014: 805-809 - [c42]Jia Cui, Bhuvana Ramabhadran, Xiaodong Cui, Andrew Rosenberg, Brian Kingsbury, Abhinav Sethy:
Recent improvements in neural network acoustic modeling for LVCSR in low resource languages. INTERSPEECH 2014: 840-844 - [c41]Xiaodong Cui, Brian Kingsbury, Jia Cui, Bhuvana Ramabhadran, Andrew Rosenberg, Mohammad Sadegh Rasooli, Owen Rambow, Nizar Habash, Vaibhava Goel:
Improving deep neural network acoustic modeling for audio corpus indexing under the IARPA babel program. INTERSPEECH 2014: 2103-2107 - 2013
- [j11]Jia Meng, Xiaodong Cui, Manjeet K. Rao, Yidong Chen, Yufei Huang:
Exome-based analysis for RNA epigenome sequencing data. Bioinform. 29(12): 1565-1567 (2013) - [j10]Xiaodong Cui, Mohamed Afify, Yuqing Gao, Bowen Zhou:
Stereo hidden Markov modeling for noise robust speech recognition. Comput. Speech Lang. 27(2): 407-419 (2013) - [j9]Bowen Zhou, Xiaodong Cui, Songfang Huang, Martin Cmejrek, Wei Zhang, Jian Xue, Jia Cui, Bing Xiang, Gregg Daggett, Upendra V. Chaudhari, Sameer Maskey, Etienne Marcheret:
The IBM speech-to-speech translation system for smartphone: Improvements for resource-constrained tasks. Comput. Speech Lang. 27(2): 592-618 (2013) - [c40]Murat Saraclar, Abhinav Sethy, Bhuvana Ramabhadran, Lidia Mangu, Jia Cui, Xiaodong Cui, Brian Kingsbury, Jonathan Mamou:
An empirical study of confusion modeling in keyword search for low resource languages. ASRU 2013: 464-469 - [c39]Jia Meng, Xiaodong Cui, Hui Liu, Lin Zhang, Shaowu Zhang, Manjeet K. Rao, Yidong Chen, Yufei Huang:
Unveiling the dynamics in RNA epigenetic regulations. BIBM 2013: 139-144 - [c38]Xiaodong Cui, Jia Meng, Manjeet K. Rao, Yidong Chen, Yufei Huang:
An HMM-based Exome Peak-finding package for RNA epigenome sequencing data. GENSiPS 2013: 85 - [c37]Xiaodong Cui, Jia Meng, Manjeet K. Rao, Yidong Chen, Yufei Huang:
Differential analysis of rna methylation sequencing data. GlobalSIP 2013: 41-42 - [c36]Jia Cui, Xiaodong Cui, Bhuvana Ramabhadran, Janice Kim, Brian Kingsbury, Jonathan Mamou, Lidia Mangu, Michael Picheny, Tara N. Sainath, Abhinav Sethy:
Developing speech recognition systems for corpus indexing under the IARPA Babel program. ICASSP 2013: 6753-6757 - [c35]Jonathan Mamou, Jia Cui, Xiaodong Cui, Mark J. F. Gales, Brian Kingsbury, Kate M. Knill, Lidia Mangu, David Nolden, Michael Picheny, Bhuvana Ramabhadran, Ralf Schlüter, Abhinav Sethy, Philip C. Woodland:
System combination and score normalization for spoken term detection. ICASSP 2013: 8272-8276 - [c34]Brian Kingsbury, Jia Cui, Xiaodong Cui, Mark J. F. Gales, Kate M. Knill, Jonathan Mamou, Lidia Mangu, David Nolden, Michael Picheny, Bhuvana Ramabhadran, Ralf Schlüter, Abhinav Sethy, Philip C. Woodland:
A high-performance Cantonese keyword search system. ICASSP 2013: 8277-8281 - [c33]Xiaodong Cui, Vaibhava Goel, Brian Kingsbury:
Mixtures of Bayesian joint factor analyzers for noise robust automatic speech recognition. INTERSPEECH 2013: 3012-3016 - [c32]Shay Maymon, Pierre L. Dognin, Xiaodong Cui, Vaibhava Goel:
Adaptive stereo-based stochastic mapping. INTERSPEECH 2013: 3517-3521 - 2012
- [j8]Xiaodong Cui, Jing Huang, Jen-Tzung Chien:
Multi-View and Multi-Objective Semi-Supervised Learning for HMM-Based Automatic Speech Recognition. IEEE Trans. Speech Audio Process. 20(7): 1923-1935 (2012) - [j7]Xiaodong Cui, Jian Xue, Xin Chen, Peder A. Olsen, Pierre L. Dognin, Upendra V. Chaudhari, John R. Hershey, Bowen Zhou:
Hidden Markov Acoustic Modeling With Bootstrap and Restructuring for Low-Resourced Languages. IEEE Trans. Speech Audio Process. 20(8): 2252-2264 (2012) - [c31]Xiaodong Cui, Mohamed Afify, Bowen Zhou:
Stereo-based stochastic mapping with context using probabilistic PCA for noise robust automatic speech recognition. ICASSP 2012: 4705-4708 - [c30]Xiaodong Cui, Mohamed Afify, George Saon, Vaibhava Goel:
Sparse Bayesian Factor Analysis for Stereo-based Stochastic Mapping. INTERSPEECH 2012: 795-798 - 2011
- [c29]Upendra V. Chaudhari, Xiaodong Cui, Bowen Zhou, Rong Zhang:
An investigation of heuristic, manual and statistical pronunciation derivation for Pashto. ASRU 2011: 249-253 - [c28]Xin Chen, Xiaodong Cui, Jian Xue, Peder A. Olsen, John R. Hershey, Bowen Zhou, Yunxin Zhao:
Clustering of bootstrapped acoustic model with full covariance. ICASSP 2011: 4496-4499 - [c27]Xiaodong Cui, Jing Huang, Jen-Tzung Chien:
Multi-view and multi-objective semi-supervised learning for large vocabulary continuous speech recognition. ICASSP 2011: 4668-4671 - [c26]Xiaodong Cui, Xin Chen, Jian Xue, Peder A. Olsen, John R. Hershey, Bowen Zhou:
Acoustic Modeling with Bootstrap and Restructuring Based on Full Covariance. INTERSPEECH 2011: 1697-1700 - [c25]Jian Xue, Xiaodong Cui, Gregg Daggett, Etienne Marcheret, Bowen Zhou:
Towards High Performance LVCSR in Speech-to-Speech Translation System on Smart Phones. INTERSPEECH 2011: 2861-2864 - 2010
- [c24]Chengyuan Ma, Hong-Kwang Jeff Kuo, Hagen Soltau, Xiaodong Cui, Upendra V. Chaudhari, Lidia Mangu, Chin-Hui Lee:
A comparative study on system combination schemes for LVCSR. ICASSP 2010: 4394-4397 - [c23]Wei Zhang, Xiaodong Cui:
Applying scalable phonetic context similarity in unit selection of concatenative text-to-speech. INTERSPEECH 2010: 154-157 - [c22]Xiaodong Cui, Jian Xue, Pierre L. Dognin, Upendra V. Chaudhari, Bowen Zhou:
Acoustic modeling with bootstrap and restructuring for low-resourced languages. INTERSPEECH 2010: 2974-2977
2000 – 2009
- 2009
- [j6]Mohamed Afify, Xiaodong Cui, Yuqing Gao:
Stereo-Based Stochastic Mapping for Robust Speech Recognition. IEEE Trans. Speech Audio Process. 17(7): 1325-1334 (2009) - [c21]Xiaodong Cui, Jian Xue, Bowen Zhou:
Improving online incremental speaker adaptation with eigen feature space MLLR. ASRU 2009: 136-140 - [c20]Xiaodong Cui, Mohamed Afify, Yuqing Gao:
Stereo-based stochastic mapping with discriminative training for noise robust speech recognition. ICASSP 2009: 3933-3936 - [c19]Xiaodong Cui, Jian Xue, Bing Xiang, Bowen Zhou:
A study of bootstrapping with multiple acoustic features for improved automatic speech recognition. INTERSPEECH 2009: 240-243 - 2008
- [c18]Xiaodong Cui, Mohamed Afify, Yuqing Gao:
MMSE-based stereo feature stochastic mapping for noise robust speech recognition. ICASSP 2008: 4077-4080 - [c17]Xiaodong Cui, Liang Gu, Bing Xiang, Wei Zhang, Yuqing Gao:
Developing high performance asr in the IBM multilingual speech-to-speech translation system. ICASSP 2008: 5121-5124 - [c16]Xiaodong Cui, Mohamed Afify, Yuqing Gao:
N-best based stochastic mapping on stereo HMM for noise robust speech recognition. INTERSPEECH 2008: 1261-1264 - [c15]Liang Gu, Jian Xue, Xiaodong Cui, Yuqing Gao:
High-performance low-latency speech recognition via multi-layered feature streaming and fast Gaussian computation. INTERSPEECH 2008: 2098-2101 - 2007
- [j5]Xiaodong Cui, Abeer Alwan:
Robust Speaker Adaptation by Weighted Model Averaging Based on the Minimum Description Length Criterion. IEEE Trans. Speech Audio Process. 15(2): 652-660 (2007) - [j4]Xiaodong Cui, Yifan Gong:
A Study of Variable-Parameter Gaussian Mixture Hidden Markov Modeling for Noisy Speech Recognition. IEEE Trans. Speech Audio Process. 15(4): 1366-1376 (2007) - [j3]Shizhen Wang, Xiaodong Cui, Abeer Alwan:
Speaker Adaptation With Limited Data Using Regression-Tree-Based Spectral Peak Alignment. IEEE Trans. Speech Audio Process. 15(8): 2454-2464 (2007) - [c14]Mohamed Afify, Xiaodong Cui, Yuqing Gao:
Stereo-Based Stochastic Mapping for Robust Speech Recognition. ICASSP (4) 2007: 377-380 - 2006
- [j2]Xiaodong Cui, Abeer Alwan:
Adaptation of children's speech with limited data based on formant-like peak alignment. Comput. Speech Lang. 20(4): 400-419 (2006) - [c13]Li Deng, Xiaodong Cui, Robert Pruvenok, Yanyi Chen, Safiyy Momen, Abeer Alwan:
A Database of Vocal Tract Resonance Trajectories for Research in Speech Processing. ICASSP (1) 2006: 369-372 - [c12]Xiaodong Cui, Yifan Gong:
Modeling Variance Variation in a Variable Parameter HMM Framework for Noise Robust Speech Recognition. ICASSP (1) 2006: 1117-1120 - [c11]Shizhen Wang, Xiaodong Cui, Abeer Alwan:
Rapid speaker adaptation using regression-tree based spectral peak alignment. INTERSPEECH 2006 - 2005
- [j1]Xiaodong Cui, Abeer Alwan:
Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR. IEEE Trans. Speech Audio Process. 13(6): 1161-1172 (2005) - [c10]Xiaodong Cui, Abeer Alwan:
MLLR-like speaker adaptation based on linearization of VTLN with MFCC features. INTERSPEECH 2005: 273-276 - [c9]Abe Kazemzadeh, Hong You, Markus Iseli, Barbara Jones, Xiaodong Cui, Margaret Heritage, Patti Price, Elaine Andersen, Shrikanth S. Narayanan, Abeer Alwan:
TBALL data collection: the making of a young children's speech corpus. INTERSPEECH 2005: 1581-1584 - 2004
- [c8]Xiaodong Cui, Abeer Alwan:
Combining feature compensation and weighted Viterbi decoding for noise robust speech recognition with limited adaptation data. ICASSP (1) 2004: 969-972 - [c7]Alexis Bernard, Yifan Gong, Xiaodong Cui:
Can back-ends be more robust than front-ends? Investigation over the Aurora-2 database. ICASSP (1) 2004: 1025-1028 - 2003
- [c6]Xiaodong Cui, Yifan Gong:
Variable parameter Gaussian mixture hidden Markov modeling for speech recognition. ICASSP (1) 2003: 12-15 - [c5]Xiaodong Cui, Alexis Bernard, Abeer Alwan:
A noise-robust ASR back-end technique based on weighted viterbi recognition. INTERSPEECH 2003: 2169-2172 - 2002
- [c4]Xiaodong Cui, Abeer Alwan:
Efficient adaptation text design based on the Kullback-Leibler measure. ICASSP 2002: 613-616 - [c3]Xiaodong Cui, Markus Iseli, Qifeng Zhu, Abeer Alwan:
Evaluation of noise robust features on the Aurora databases. INTERSPEECH 2002: 481-484 - 2001
- [c2]Qifeng Zhu, Markus Iseli, Xiaodong Cui, Abeer Alwan:
Noise robust feature extraction for ASR using the Aurora 2 database. INTERSPEECH 2001: 185-188 - 2000
- [c1]Jiasong Sun, Xiaodong Cui, Zuoying Wang, Yang Liu:
A language model adaptation approach based on text classification. INTERSPEECH 2000: 516-519
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-10 21:41 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint