default search action
Yu Hu 0003
Person information
- affiliation: IFLYTEK Research, Hefei, China
- affiliation (PhD 2009): University of Science and Technology of China, National Engineering Laboratory of Speech and Language Information Processing, Hefei, China
Other persons with the same name
- Yu Hu — disambiguation page
- Yu Hu 0001 — Chinese Academy of Sciences, State Key Laboratory of Computer Architecture, Institute of Computing Technology, Beijing, China
- Yu Hu 0002 — Huazhong University of Science and Technology, School of Optical and Electronic Information, Department of Electronic Science and Technology, Wuhan, China (and 3 more)
- Yu Hu 0004 — South China University of Technology, Department of Computer Science and Engineering, Guangdong, China
- Yu Hu 0005 — Zhejiang University, College of Computer Science and Technology, State Key Laboratory of CAD & CG, Hangzhou, China
- Yu Hu 0006 — Beijing University of Chemical Technology, College of Information Science and Technology, Beijing, China
- Yu Hu 0007 — Beihang University, School of Automation Science and Electrical Engineering, Beijing, China (and 1 more)
- Yu Hu 0008 — Hong Kong University of Science and Technology, Department of Mathematics and Division of Life Science, Hong Kong
- Yu Hu 0009 — Beijing Information Science and Technology University, School of Automation, Beijing, China (and 2 more)
- Yu Hu 0010 — Jiangsu University, Automotive Engineering Research Institute, Zhenjiang, China
- Yu Hu 0011 — Minnan Normal University, Zhangzhou, China
- Yu Hu 0012 — Tsinghua University, Beijing, China
- Yu Hu 0013 — Liaoning Technical University, Faculty of Electrical and Control Engineering, Huludao, China
- Yu Hu 0014 — University of Pennsylvania, Perelman School of Medicine, Department of Biostatistics, Epidemiology and Informatics, Philadelphia, PA, USA
- Yu Hu 0015 — George Washington University, Department of Biochemistry and Molecular Medicine, Washington, DC, USA
- Yu Hu 0016 — University of Southern California, Integrated Media Systems Center and Department of Electrical Engineering, Los Angeles, CA, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c46]Zhenrong Zhang, Shuhang Liu, Pengfei Hu, Jiefeng Ma, Jun Du, Jianshu Zhang, Yu Hu:
UniTabNet: Bridging Vision and Language Models for Enhanced Table Structure Recognition. EMNLP (Findings) 2024: 6131-6143 - [i9]Jiefeng Ma, Yan Wang, Chenyu Liu, Jun Du, Yu Hu, Zhenrong Zhang, Pengfei Hu, Qing Wang, Jianshu Zhang:
SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding. CoRR abs/2406.08757 (2024) - [i8]Zhenrong Zhang, Shuhang Liu, Pengfei Hu, Jiefeng Ma, Jun Du, Jianshu Zhang, Yu Hu:
UniTabNet: Bridging Vision and Language Models for Enhanced Table Structure Recognition. CoRR abs/2409.13148 (2024) - 2023
- [j12]Mobai Xue, Jun Du, Bin Wang, Bo Ren, Yu Hu:
Joint optimization for attention-based generation and recognition of chinese characters using tree position embedding. Pattern Recognit. 140: 109538 (2023) - [j11]Shutong Niu, Jun Du, Lei Sun, Yu Hu, Chin-Hui Lee:
QDM-SSD: Quality-Aware Dynamic Masking for Separation-Based Speaker Diarization. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1037-1049 (2023) - 2021
- [j10]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Correlating subword articulation with lip shapes for embedding aware audio-visual speech enhancement. Neural Networks 143: 171-182 (2021) - [j9]Runze Wang, Zhen-Hua Ling, Jing-Bo Zhou, Yu Hu:
A Multiple-Integration Encoder for Multi-Turn Text-to-SQL Semantic Parsing. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1503-1513 (2021) - [j8]Yi-Yang Ding, Hao-Jian Lin, Li-Juan Liu, Zhen-Hua Ling, Yu Hu:
Robustness of Speech Spoofing Detectors Against Adversarial Post-Processing of Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3415-3426 (2021) - [c45]Runze Wang, Zhen-Hua Ling, Jingbo Zhou, Yu Hu:
Tracking Interaction States for Multi-Turn Text-to-SQL Semantic Parsing. AAAI 2021: 13979-13987 - [c44]Yi-Yang Ding, Li-Juan Liu, Yu Hu, Zhen-Hua Ling:
Adversarial Voice Conversion Against Neural Spoofing Detectors. Interspeech 2021: 816-820 - [c43]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Automatic Lip-Reading with Hierarchical Pyramidal Convolution and Self-Attention for Image Sequences with No Word Boundaries. Interspeech 2021: 3001-3005 - 2020
- [c42]Yi-Yang Ding, Jing-Xuan Zhang, Li-Juan Liu, Yuan Jiang, Yu Hu, Zhen-Hua Ling:
Adversarial Post-Processing of Voice Conversion against Spoofing Detection. APSIPA 2020: 556-560 - [i7]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement. CoRR abs/2009.09561 (2020) - [i6]Runze Wang, Zhen-Hua Ling, Jingbo Zhou, Yu Hu:
Tracking Interaction States for Multi-Turn Text-to-SQL Semantic Parsing. CoRR abs/2012.04995 (2020) - [i5]Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Chin-Hui Lee, Bao-Cai Yin:
Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention. CoRR abs/2012.14360 (2020)
2010 – 2019
- 2019
- [j7]Runze Wang, Zhen-Hua Ling, Yu Hu:
Knowledge Base Question Answering With Attentive Pooling for Question Representation. IEEE Access 7: 46773-46784 (2019) - 2017
- [j6]Yonghong Tian, Xilin Chen, Hongkai Xiong, Hong-Liang Li, Li-Rong Dai, Jing Chen, Junliang Xing, Jing Chen, Xihong Wu, Weiming Hu, Yu Hu, Tiejun Huang, Wen Gao:
Towards human-like and transhuman perception in AI 2.0: a review. Frontiers Inf. Technol. Electron. Eng. 18(1): 58-67 (2017) - [j5]Shiliang Zhang, Cong Liu, Hui Jiang, Si Wei, Li-Rong Dai, Yu Hu:
Nonrecurrent Neural Structure for Long-Term Dependence. IEEE ACM Trans. Audio Speech Lang. Process. 25(4): 871-884 (2017) - [c41]Quan Liu, Hui Jiang, Zhen-Hua Ling, Xiaodan Zhu, Si Wei, Yu Hu:
Combing Context and Commonsense Knowledge Through Neural Networks for Solving Winograd Schema Problems. AAAI Spring Symposia 2017 - [c40]Quan Liu, Hui Jiang, Andrew Evdokimov, Zhen-Hua Ling, Xiaodan Zhu, Si Wei, Yu Hu:
Cause-Effect Knowledge Acquisition and Neural Association Model for Solving A Set of Winograd Schema Problems. IJCAI 2017: 2344-2350 - 2016
- [c39]Yu-Ping Ruan, Zhen-Hua Ling, Yu Hu:
Exploring Semantic Representation in Brain Activity Using Word Embeddings. EMNLP 2016: 669-679 - [c38]Zhen-Hua Ling, Xiao-Hui Sun, Li-Rong Dai, Yu Hu:
Modulation spectrum compensation for HMM-based speech synthesis using line spectral pairs. ICASSP 2016: 5595-5599 - [c37]Quan Liu, Wu Guo, Zhen-Hua Ling, Hui Jiang, Yu Hu:
Intra-Topic Variability Normalization based on Linear Projection for Topic Classification. HLT-NAACL 2016: 441-446 - [i4]Quan Liu, Zhen-Hua Ling, Hui Jiang, Yu Hu:
Part-of-Speech Relevance Weights for Learning Word Embeddings. CoRR abs/1603.07695 (2016) - [i3]Quan Liu, Hui Jiang, Zhen-Hua Ling, Si Wei, Yu Hu:
Probabilistic Reasoning via Deep Learning: Neural Association Models. CoRR abs/1603.07704 (2016) - [i2]Quan Liu, Hui Jiang, Zhen-Hua Ling, Xiaodan Zhu, Si Wei, Yu Hu:
Combing Context and Commonsense Knowledge Through Neural Networks for Solving Winograd Schema Problems. CoRR abs/1611.04146 (2016) - 2015
- [j4]Pan Zhou, Hui Jiang, Li-Rong Dai, Yu Hu, Qingfeng Liu:
State-Clustering Based Multiple Deep Neural Networks Modeling Approach for Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 23(4): 631-642 (2015) - [c36]Quan Liu, Hui Jiang, Si Wei, Zhen-Hua Ling, Yu Hu:
Learning Semantic Word Embeddings based on Ordinal Knowledge Constraints. ACL (1) 2015: 1501-1511 - [c35]Yu Hu:
Keynote speech 1: Artificial intelligence needs a language cognitive revolution. O-COCOSDA/CASLRE 2015: 1 - [i1]Shiliang Zhang, Cong Liu, Hui Jiang, Si Wei, Li-Rong Dai, Yu Hu:
Feedforward Sequential Memory Networks: A New Structure to Learn Long-term Dependency. CoRR abs/1512.08301 (2015) - 2012
- [c34]Jia Pan, Cong Liu, Zhiguo Wang, Yu Hu, Hui Jiang:
Investigation of deep neural networks (DNN) for large vocabulary continuous speech recognition: Why DNN surpasses GMMS in acoustic modeling. ISCSLP 2012: 301-305 - 2011
- [j3]Jun Du, Yu Hu, Hui Jiang:
Boosted Mixture Learning of Gaussian Mixture Hidden Markov Models Based on Maximum Likelihood for Speech Recognition. IEEE Trans. Speech Audio Process. 19(7): 2091-2100 (2011) - [j2]Cong Liu, Yu Hu, Li-Rong Dai, Hui Jiang:
Trust Region-Based Optimization for Maximum Mutual Information Estimation of HMMs in Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 19(8): 2474-2485 (2011) - [c33]Ling-Hui Chen, Chen-Yu Yang, Zhen-Hua Ling, Yuan Jiang, Li-Rong Dai, Yu Hu, Ren-Hua Wang:
The USTC System for Blizzard Challenge 2011. Blizzard Challenge 2011 - 2010
- [c32]Yuan Jiang, Zhen-Hua Ling, Ming Lei, Cheng-Cheng Wang, Heng Lu, Yu Hu, Li-Rong Dai, Ren-Hua Wang:
The USTC System for Blizzard Challenge 2010. Blizzard Challenge 2010 - [c31]Jun Du, Yu Hu, Li-Rong Dai, Ren-Hua Wang:
HMM-based pseudo-clean speech synthesis for splice algorithm. ICASSP 2010: 4570-4573 - [c30]Cong Liu, Yu Hu, Hui Jiang, Li-Rong Dai:
A bounded trust region optimization for discriminative training of HMMS in speech recognition. ICASSP 2010: 4914-4917 - [c29]Zhen-Hua Ling, Yu Hu, Li-Rong Dai:
Global variance modeling on the log power spectrum of LSPs for HMM-based speech synthesis. INTERSPEECH 2010: 825-828 - [c28]Jun Du, Yu Hu, Hui Jiang:
Boosted mixture learning of Gaussian mixture HMMs for speech recognition. INTERSPEECH 2010: 2942-2945 - [c27]Zhiguo Wang, Cong Liu, Hai-Kun Wang, Yu Hu, Li-Rong Dai:
Phonetic clustering based confidence measure for embedded speech recognition. ISCSLP 2010: 186-189 - [c26]Si Wei, Qianyong Gao, Guoping Hu, Yu Hu:
Robust pronunciation evaluation in adverse environments. ISCSLP 2010: 412-415
2000 – 2009
- 2009
- [j1]Si Wei, Guoping Hu, Yu Hu, Ren-Hua Wang:
A new method for mispronunciation detection using Support Vector Machine based on Pronunciation Space Models. Speech Commun. 51(10): 896-905 (2009) - [c25]Heng Lu, Zhen-Hua Ling, Ming Lei, Cheng-Cheng Wang, Huan-huan Zhao, Ling-Hui Chen, Yu Hu, Li-Rong Dai, Ren-Hua Wang:
The USTC System for Blizzard Challenge 2009. Blizzard Challenge 2009 - [c24]Zhi-Jie Yan, Cong Liu, Yu Hu, Hui Jiang:
A trust region based optimization for maximum mutual information estimation of HMMS in speech recognition. ICASSP 2009: 3757-3760 - 2008
- [c23]Zhi-Jie Yan, Bo Zhu, Yu Hu, Ren-Hua Wang:
Minimum word classification error training of HMMS for automatic speech recognition. ICASSP 2008: 4521-4524 - [c22]Sibao Chen, Yu Hu, Bin Luo, Ren-Hua Wang:
Heteroscedastic discriminant analysis with two-dimensional constraints. ICASSP 2008: 4701-4704 - [c21]Si Wei, Yi-Qian Pan, Guoping Hu, Yu Hu, Ren-Hua Wang:
Pronunciation Space Models for Pronunciation Evaluation. ISCSLP 2008: 21-24 - [c20]Jun Du, Qiang Huo, Yu Hu:
Evaluation of a Feature Compensation Approach Using High-Order Vector Taylor Series Approximation of an Explicit Distortion Modelon Aurora2, Aurora3, and Aurora4 Tasks. ISCSLP 2008: 81-84 - [c19]Bo Zhu, Zhi-Jie Yan, Yu Hu, Zhiguo Wang, Li-Rong Dai, Ren-Hua Wang:
Investigation on Adaptation Using Different Discriminative Training Criteria Based Linear Regression and Map. ISCSLP 2008: 93-96 - [c18]Heng Lu, Zhen-Hua Ling, Si Wei, Yu Hu, Li-Rong Dai, Ren-Hua Wang:
Heteronym Verification for Mandarin Speech Synthesis. ISCSLP 2008: 137-140 - [c17]Sibao Chen, Yu Hu, Bin Luo, Ren-Hua Wang:
An Improvement for Training Efficiency of Semi-Tied Covariance. ISCSLP 2008: 201-204 - [c16]Cong Liu, Yu Hu, Xiong-Guo Lei, Zhiguo Wang, Li-Rong Dai, Ren-Hua Wang:
Exploiting Non-Target Region Information for Confidence Measure Based on Bayesian Information Criterion. ISCSLP 2008: 229-232 - 2006
- [c15]Si Wei, Qing-Sheng Liu, Yu Hu, Ren-Hua Wang:
Automatic Mandarin pronunciation scoring for native learners with dialect accent. INTERSPEECH 2006 - [c14]Cong Liu, Zhijie Yan, Yu Hu, Renhua Wang:
A Comparative Study on Confidence Measure in Mandarin Command Word Recognition. ISCSLP 2006 - [c13]Yu Hu, Qiang Huo:
An HMM Compensation Approach Using Unscented Transformation for Noisy Speech Recognition. ISCSLP (Selected Papers) 2006: 346-357 - [c12]Qing-Sheng Liu, Si Wei, Yu Hu, Wu Guo, Ren-Hua Wang:
The Application of Phone Weight in Putonghua Pronunciation Quality Assessment. ISCSLP 2006 - 2005
- [c11]Zhen-Hua Ling, Yu Hu, Ren-Hua Wang:
A Novel Source Analysis Method by Matching Spectral Characters of LF Model with STRAIGHT Spectrum. ACII 2005: 441-448 - 2004
- [c10]Yu Hu, Ren-Hua Wang, Lu Sun:
Polynomial regression model for duration prediction in Mandarin. INTERSPEECH 2004: 769-772 - [c9]Zhen-Hua Ling, Yu Hu, Zhiwei Shuang, Ren-Hua Wang:
Compression of speech database by feature separation and pattern clustering using STRAIGHT. INTERSPEECH 2004: 1201-1204 - [c8]Zhen-Hua Ling, Yu-Ping Wang, Yu Hu, Ren-Hua Wang:
Modeling glottal effect on the spectral envelop of STRAIGHT using mixture of Gaussians. ISCSLP 2004: 73-76 - [c7]Guoping Hu, Qingfeng Liu, Yu Hu, Ren-Hua Wang:
Hearer model based stress prediction for Chinese TTS system. ISCSLP 2004: 161-164 - 2002
- [c6]Yi-Jian Wu, Yu Hu, Xiaoru Wu, Ren-Hua Wang:
A new method of building decision tree based on target information. INTERSPEECH 2002: 129-132 - [c5]Zhiwei Shuang, Yu Hu, Zhen-Hua Ling, Ren-Hua Wang:
A miniature Chinese TTS system based on tailored corpus. INTERSPEECH 2002: 2389-2392 - [c4]Zhen-Hua Ling, Yu Hu, Zhiwei Shuang, Ren-Hua Wang:
Decision tree based unit pre-selection in Mandarin Chinese synthesis. ISCSLP 2002 - 2000
- [c3]Ren-Hua Wang, Qingfeng Liu, Yu Hu, Bo Yin, Xiaoru Wu:
KD2000 Chinese Text-To-Speech System. ICMI 2000: 300-307 - [c2]Yu Hu, Qingfeng Liu, Ren-Hua Wang:
Prosody generation in Chinese synthesis using the template of quantified prosodic unit and base intonation contour. INTERSPEECH 2000: 55-58 - [c1]Donglai Zhu, Yu Hu, Ren-Hua Wang:
Automatic Segmentation and Labeling of Speech Corpus Based on HMM With Adaptation. ISCSLP 2000
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-10 21:43 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint