default search action
Hao Huang 0009
Person information
- affiliation: Xinjiang Univerity, School of Information Science and Engineering, Xinjiang Provincial Key Laboratory of Multilingual Information Technology, Urumqi, China
- affiliation (PhD 2008): Shanghai Jiao Tong University, Department of Electronic Engineering, Shanghai, China
Other persons with the same name
- Hao Huang — disambiguation page
- Hao Huang 0001 — Wuhan University, School of Computer Science, China
- Hao Huang 0002 — University of Southern California, Department of Electrical Engineering, Los Angeles, CA, USA
- Hao Huang 0003 — New York University, Brooklyn, NY, USA
- Hao Huang 0004 — Southwest Jiaotong University, School of Transportation and Logistics, China
- Hao Huang 0005 — National University of Singapore (and 2 more)
- Hao Huang 0006 — Princeton University, Department of Electrical and Computer Engineering, Princeton, NJ, USA (and 1 more)
- Hao Huang 0007 — GE Global Research, San Ramon, CA, USA (and 1 more)
- Hao Huang 0008 — Nanjing University of Posts and Telecommunications, College of Telecommunications and Information Engineering, Nanjing, China
- Hao Huang 0010 — Huazhong University of Science and Technology, School of Software Engineering, Wuhan, China (and 1 more)
- Hao Huang 0011 — Nanjing University, Department of Computer Science and Technology, State Key Laboratory for Novel Software Technology, Nanjing, China
- Hao Huang 0012 — Yuan Ze University, Department of Industrial Engineering and Management, Taoyuan, Taiwan (and 1 more)
- Hao Huang 0013 — Nanjing University, School of Atmospheric Sciences, Frontiers Science Center for Critical Earth Material Cycling, Key Laboratory for Mesoscale Severe Weather, Nanjing, China (and 1 more)
- Hao Huang 0014 — Technische Informationsbibliothek (TIB), Hannover, Germany
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j16]Mengzhen Ma, Ying Hu, Liang He, Hao Huang:
GLFER-Net: a polyphonic sound source localization and detection network based on global-local feature extraction and recalibration. EURASIP J. Audio Speech Music. Process. 2024(1): 34 (2024) - [j15]Xiaoqing Zhao, Miaomiao Xu, Yanbing Li, Hao Huang, Wushour Silamu:
Scene text recognition with context-aware autonomous bidirectional iterative models. J. Intell. Fuzzy Syst. 46(4): 8605-8616 (2024) - [j14]Di Wu, Liting Jiang, Lili Yin, Zhe Li, Hao Huang:
CEA-Net: a co-interactive external attention network for joint intent detection and slot filling. Neural Comput. Appl. 36(22): 13513-13525 (2024) - [j13]Wenbing Wei, Ying Hu, Hao Huang, Liang He:
IIFC-Net: A Monaural Speech Enhancement Network With High-Order Information Interaction and Feature Calibration. IEEE Signal Process. Lett. 31: 196-200 (2024) - [c38]Ying Hu, Haitao Xu, Zhongcun Guo, Hao Huang, Liang He:
SMMA-Net: An Audio Clue-Based Target Speaker Extraction Network with Spectrogram Matching and Mutual Attention. ICASSP 2024: 1496-1500 - [c37]Zhida Song, Liang He, Penghao Wang, Ying Hu, Hao Huang:
Introducing Multilingual Phonetic Information to Speaker Embedding for Speaker Verification. ICASSP 2024: 10091-10095 - [c36]Minjie Tang, Hao Huang, Wenbo Zhang, Liang He:
Phase Continuity-Aware Self-Attentive Recurrent Network with Adaptive Feature Selection for Robust VAD. ICASSP 2024: 11506-11510 - [c35]Sijie Feng, Haoxiang Su, Hongyan Xie, Di Wu, Hao Huang, Wushour Silamu:
Fact-Aware Summarization with Contrastive Learning for Few-Shot Dialogue State Tracking. ICASSP 2024: 12211-12215 - [c34]Haoxiang Su, Sijie Feng, Hongyan Xie, Di Wu, Hao Huang, Zhongjiang He, Shuangyong Song, Ruiyu Fang, Xiaomeng Huang, Wushour Silamu:
Domain-Slot Aware Contrastive Learning for Improved Dialogue State Tracking. ICASSP 2024: 12521-12525 - [c33]Shuangyong Song, Hongyan Xie, Haoxiang Su, Hao Huang, Mengxiang Li, Zhongjiang He, Yongxiang Li, Ruiyu Fang:
Improving Pointer Network based Dialogue State Tracking via Dual Hierarchical Selective Augmentation. IJCNN 2024: 1-8 - [c32]Shuangyong Song, Jinwei Zhang, Hao Huang, Hongyan Xie, Haoxiang Su, Mengxiang Li, Zhongjiang He, Yongxiang Li, Ruiyu Fang:
Graph-based Dynamic Domain Selection for Dialogue State Tracking. IJCNN 2024: 1-7 - [c31]Hongli Yang, Xinyi Chen, Junjie Li, Hao Huang, Siqi Cai, Haizhou Li:
Listen to the Speaker in Your Gaze. CIS-RAM 2024: 380-385 - 2023
- [j12]Hao Huang, Lin Wang, Jichen Yang, Ying Hu, Liang He:
W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision. EURASIP J. Audio Speech Music. Process. 2023(1): 45 (2023) - [j11]Kai Wang, Jingjing Liu, Yizhou Peng, Hao Huang:
Neural RAPT: deep learning-based pitch tracking with prior algorithmic knowledge instillation. Int. J. Speech Technol. 26(4): 999-1015 (2023) - [c30]Qianying Liu, Zhuo Gong, Zhengdong Yang, Yuhang Yang, Sheng Li, Chenchen Ding, Nobuaki Minematsu, Hao Huang, Fei Cheng, Chenhui Chu, Sadao Kurohashi:
Hierarchical Softmax for End-To-End Low-Resource Multilingual Speech Recognition. ICASSP 2023: 1-5 - [c29]Zhibin Qiu, Mengfan Fu, Yinfeng Yu, LiLi Yin, Fuchun Sun, Hao Huang:
SRTNET: Time Domain Speech Enhancement via Stochastic Refinement. ICASSP 2023: 1-5 - [c28]Kai Wang, Yuhang Yang, Hao Huang, Ying Hu, Sheng Li:
Speakeraugment: Data Augmentation for Generalizable Source Separation via Speaker Parameter Manipulation. ICASSP 2023: 1-5 - [c27]Yuhang Yang, Haihua Xu, Hao Huang, Eng Siong Chng, Sheng Li:
Speech-Text Based Multi-Modal Training with Bidirectional Attention for Improved Speech Recognition. ICASSP 2023: 1-5 - [c26]Zhibin Qiu, Yachao Guo, Mengfan Fu, Hao Huang, Ying Hu, Liang He, Fuchun Sun:
CRA-DIFFUSE: Improved Cross-Domain Speech Enhancement Based on Diffusion Model with T-F Domain Pre-Denoising. ICME 2023: 1709-1714 - [c25]Ying Hu, Shijing Hou, Huamin Yang, Hao Huang, Liang He:
A Joint Network Based on Interactive Attention for Speech Emotion Recognition. ICME 2023: 1715-1720 - [c24]Fangjing Niu, Tengfei Cao, Ying Hu, Hao Huang, Liang He:
Speech Topic Classification Based on Pre-trained and Graph Networks. ICME 2023: 1721-1726 - [c23]Yachao Guo, Zhibin Qiu, Hao Huang, Chng Eng Siong:
Improved Keyword Recognition Based on Aho-Corasick Automaton. IJCNN 2023: 1-7 - [c22]Rui Li, Zhiwei Xie, Haihua Xu, Yizhou Peng, Hexin Liu, Hao Huang, Eng Siong Chng:
Self-supervised Learning Representation based Accent Recognition with Persistent Accent Memory. INTERSPEECH 2023: 1968-1972 - [c21]Yuan Gao, Ying Hu, Liusong Wang, Hao Huang, Liang He:
MTANet: Multi-band Time-frequency Attention Network for Singing Melody Extraction from Polyphonic Music. INTERSPEECH 2023: 5396-5400 - [c20]Xiaojiao Chen, Sheng Li, Jiyi Li, Hao Huang, Yang Cao, Liang He:
Reprogramming Self-supervised Learning-based Speech Representations for Speaker Anonymization. MMAsia 2023: 93:1-93:5 - [c19]Xiaojiao Chen, Sheng Li, Jiyi Li, Yang Cao, Hao Huang, Liang He:
GhostVec: A New Threat to Speaker Privacy of End-to-End Speech Recognition System. MMAsia 2023: 94:1-94:5 - [i13]Zhibin Qiu, Mengfan Fu, Fuchun Sun, Gulila Altenbek, Hao Huang:
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge. CoRR abs/2305.13796 (2023) - 2022
- [j10]Yadong Chen, Ying Hu, Liang He, Hao Huang:
Multi-stage music separation network with dual-branch attention and hybrid convolution. J. Intell. Inf. Syst. 59(3): 635-656 (2022) - [j9]Yuwu Tang, Ying Hu, Liang He, Hao Huang:
A bimodal network based on Audio-Text-Interactional-Attention with ArcFace loss for speech emotion recognition. Speech Commun. 143: 21-32 (2022) - [j8]Ying Hu, Yadong Chen, Wenzhong Yang, Liang He, Hao Huang:
Hierarchic Temporal Convolutional Network With Cross-Domain Encoder for Music Source Separation. IEEE Signal Process. Lett. 29: 1517-1521 (2022) - [c18]Zhida Song, Liang He, Zhihua Fang, Ying Hu, Hao Huang:
Virtual Fully-Connected Layer for a Large-Scale Speaker Verification Dataset. CCBR 2022: 382-390 - [c17]Kai Wang, Yizhou Peng, Hao Huang, Ying Hu, Sheng Li:
Mining Hard Samples Locally And Globally For Improved Speech Separation. ICASSP 2022: 6037-6041 - [c16]Yizhou Peng, Jicheng Zhang, Haihua Xu, Hao Huang, Eng Siong Chng:
Minimum Word Error Training For Non-Autoregressive Transformer-Based Code-Switching ASR. ICASSP 2022: 7807-7811 - [c15]Xiaojiao Chen, Sheng Li, Hao Huang:
GhostVec: Directly Extracting Speaker Embedding from End-to-End Speech Recognition Model Using Adversarial Examples. ICONIP (6) 2022: 482-492 - [c14]Guangxing Li, Wangjin Zhou, Sheng Li, Yi Zhao, Jichen Yang, Hao Huang:
Investigating Effective Domain Adaptation Method for Speaker Verification Task. ICONIP (6) 2022: 517-527 - [c13]Ying Hu, Xiujuan Zhu, Yunlong Li, Hao Huang, Liang He:
A Multi-grained based Attention Network for Semi-supervised Sound Event Detection. INTERSPEECH 2022: 1531-1535 - [c12]Ying Hu, Yuwu Tang, Hao Huang, Liang He:
A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition. INTERSPEECH 2022: 4705-4709 - [i12]Qianying Liu, Yuhang Yang, Zhuo Gong, Sheng Li, Chenchen Ding, Nobuaki Minematsu, Hao Huang, Fei Cheng, Sadao Kurohashi:
Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition. CoRR abs/2204.03855 (2022) - [i11]Ying Hu, Xiujuan Zhu, Yunlong Li, Hao Huang, Liang He:
A Multi-grained based Attention Network for Semi-supervised Sound Event Detection. CoRR abs/2206.10175 (2022) - [i10]Ying Hu, Yuwu Tang, Hao Huang, Liang He:
A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition. CoRR abs/2207.00940 (2022) - [i9]Yizhou Peng, Yufei Liu, Jicheng Zhang, Haihua Xu, Yi He, Hao Huang, Eng Siong Chng:
Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition. CoRR abs/2207.04176 (2022) - [i8]Jicheng Zhang, Yizhou Peng, Haihua Xu, Yi He, Eng Siong Chng, Hao Huang:
Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder. CoRR abs/2207.04177 (2022) - [i7]Zhibin Qiu, Mengfan Fu, Yinfeng Yu, LiLi Yin, Fuchun Sun, Hao Huang:
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement. CoRR abs/2210.16805 (2022) - [i6]Yuhang Yang, Haihua Xu, Hao Huang, Eng Siong Chng, Sheng Li:
Speech-text based multi-modal training with bidirectional attention for improved speech recognition. CoRR abs/2211.00325 (2022) - 2021
- [j7]Xiao Kang, Hao Huang, Ying Hu, Zhihua Huang:
Connectionist temporal classification loss for vector quantized variational autoencoder in zero-shot voice conversion. Digit. Signal Process. 116: 103110 (2021) - [j6]Weiqi Gao, Hao Huang:
A gating context-aware text classification model with BERT and graph convolutional networks. J. Intell. Fuzzy Syst. 40(3): 4331-4343 (2021) - [j5]Wenfang Ma, Ying Hu, Hao Huang:
Dual Attention Network for Pitch Estimation of Monophonic Music. Symmetry 13(7): 1296 (2021) - [c11]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named Entities for Improved Speech Recognition. APSIPA ASC 2021: 1021-1025 - [c10]Yizhou Peng, Jicheng Zhang, Haobo Zhang, Haihua Xu, Hao Huang, Sheng Li, Eng Siong Chng:
Multilingual Approach to Joint Speech and Accent Recognition with DNN-HMM Framework. APSIPA ASC 2021: 1043-1048 - [c9]Hao Huang, Kai Wang, Ying Hu, Sheng Li:
Encoder-Decoder Based Pitch Tracking and Joint Model Training for Mandarin Tone Classification. ICASSP 2021: 6943-6947 - [c8]Jicheng Zhang, Yizhou Peng, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
E2E-Based Multi-Task Learning Approach to Joint Speech and Accent Recognition. Interspeech 2021: 1519-1523 - [c7]Kai Wang, Hao Huang, Ying Hu, Zhihua Huang, Sheng Li:
End-to-End Speech Separation Using Orthogonal Representation in Complex and Real Time-Frequency Domain. Interspeech 2021: 3046-3050 - [c6]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. ISCSLP 2021: 1-5 - 2020
- [j4]Zhen Zhang, Hao Huang, Kai Wang:
Using Deep Time Delay Neural Network for Slot Filling in Spoken Language Understanding. Symmetry 12(6): 993 (2020) - [j3]Haibo Geng, Ying Hu, Hao Huang:
Monaural Singing Voice and Accompaniment Separation Based on Gated Nested U-Net Architecture. Symmetry 12(6): 1051 (2020) - [c5]Haobo Zhang, Haihua Xu, Van Tung Pham, Hao Huang, Eng Siong Chng:
Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-Switching Speech Recognition. INTERSPEECH 2020: 2392-2396 - [c4]Ying Zhong, Ying Hu, Hao Huang, Wushour Silamu:
A Lightweight Model Based on Separable Convolution for Speech Emotion Recognition. INTERSPEECH 2020: 3331-3335 - [i5]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. CoRR abs/2005.08742 (2020) - [i4]Yizhou Peng, Jicheng Zhang, Haobo Zhang, Haihua Xu, Hao Huang, Eng Siong Chng:
A multilingual approach to joint Speech and Accent Recognition with DNN-HMM framework. CoRR abs/2010.11483 (2020) - [i3]Haobo Zhang, Tingzhi Mao, Haihua Xu, Hao Huang:
The NTU-AISG Text-to-speech System for Blizzard Challenge 2020. CoRR abs/2010.11489 (2020) - [i2]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named-Entities To Improve Speech Recognition Performance. CoRR abs/2010.12143 (2020)
2010 – 2019
- 2017
- [i1]Hao Huang, Ying Hu, Haihua Xu:
Mandarin tone modeling using recurrent neural networks. CoRR abs/1711.01946 (2017) - 2016
- [c3]Haihua Xu, Wei Rao, Xiong Xiao, Hao Huang, Eng Siong Chng, Haizhou Li:
I-vector based deep neural network acoustic model adaptation using multilingual language resource. APSIPA 2016: 1-5 - [c2]Ying Hu, Liejun Wang, Hao Huang, Gang Zhou:
Monaural Singing Voice Separation by Non-negative Matrix Partial Co-Factorization with Temporal Continuity and Sparsity Criteria. ICIC (3) 2016: 33-43 - [c1]Haihua Xu, Hang Su, Chongjia Ni, Xiong Xiao, Hao Huang, Eng Siong Chng, Haizhou Li:
Semi-Supervised and Cross-Lingual Knowledge Transfer Learnings for DNN Hybrid Acoustic Models Under Low-Resource Conditions. INTERSPEECH 2016: 1315-1319 - 2015
- [j2]Hao Huang, Haihua Xu, Xianhui Wang, Wushour Silamu:
Maximum F1-Score Discriminative Training Criterion for Automatic Mispronunciation Detection. IEEE ACM Trans. Audio Speech Lang. Process. 23(4): 787-797 (2015)
2000 – 2009
- 2009
- [j1]Ying Xiong, Jie Zhu, Hao Huang, Haihua Xu:
Minimum tag error for discriminative training of conditional random fields. Inf. Sci. 179(1-2): 169-179 (2009)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-19 23:11 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint