default search action

combined dblp search
author search
venue search
publication search

ask others

Hao Huang 0009

> Home > Persons

Person information

affiliation: Xinjiang Univerity, School of Information Science and Engineering, Xinjiang Provincial Key Laboratory of Multilingual Information Technology, Urumqi, China
affiliation (PhD 2008): Shanghai Jiao Tong University, Department of Electronic Engineering, Shanghai, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/MaHHH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/MaHHH24
Mengzhen Ma, Ying Hu, Liang He, Hao Huang:
GLFER-Net: a polyphonic sound source localization and detection network based on global-local feature extraction and recalibration. EURASIP J. Audio Speech Music. Process. 2024(1): 34 (2024)
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/jifs/ZhaoXLHS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jifs/ZhaoXLHS24
Xiaoqing Zhao, Miaomiao Xu, Yanbing Li, Hao Huang, Wushour Silamu:
Scene text recognition with context-aware autonomous bidirectional iterative models. J. Intell. Fuzzy Syst. 46(4): 8605-8616 (2024)
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/nca/WuJYLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nca/WuJYLH24
Di Wu, Liting Jiang, Lili Yin, Zhe Li, Hao Huang:
CEA-Net: a co-interactive external attention network for joint intent detection and slot filling. Neural Comput. Appl. 36(22): 13513-13525 (2024)
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/WeiHHH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/WeiHHH24
Wenbing Wei, Ying Hu, Hao Huang, Liang He:
IIFC-Net: A Monaural Speech Enhancement Network With High-Order Information Interaction and Feature Calibration. IEEE Signal Process. Lett. 31: 196-200 (2024)
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuXGHH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuXGHH24
Ying Hu, Haitao Xu, Zhongcun Guo, Hao Huang, Liang He:
SMMA-Net: An Audio Clue-Based Target Speaker Extraction Network with Spectrogram Matching and Mutual Attention. ICASSP 2024: 1496-1500
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Song0W0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Song0W0024
Zhida Song, Liang He, Penghao Wang, Ying Hu, Hao Huang:
Introducing Multilingual Phonetic Information to Speaker Embedding for Speaker Verification. ICASSP 2024: 10091-10095
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TangHZH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TangHZH24
Minjie Tang, Hao Huang, Wenbo Zhang, Liang He:
Phase Continuity-Aware Self-Attentive Recurrent Network with Adaptive Feature Selection for Robust VAD. ICASSP 2024: 11506-11510
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FengSXWHS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FengSXWHS24
Sijie Feng, Haoxiang Su, Hongyan Xie, Di Wu, Hao Huang, Wushour Silamu:
Fact-Aware Summarization with Contrastive Learning for Few-Shot Dialogue State Tracking. ICASSP 2024: 12211-12215
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SuFXWHHSFHS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SuFXWHHSFHS24
Haoxiang Su, Sijie Feng, Hongyan Xie, Di Wu, Hao Huang, Zhongjiang He, Shuangyong Song, Ruiyu Fang, Xiaomeng Huang, Wushour Silamu:
Domain-Slot Aware Contrastive Learning for Improved Dialogue State Tracking. ICASSP 2024: 12521-12525
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/SongXSHLHLF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/SongXSHLHLF24
Shuangyong Song, Hongyan Xie, Haoxiang Su, Hao Huang, Mengxiang Li, Zhongjiang He, Yongxiang Li, Ruiyu Fang:
Improving Pointer Network based Dialogue State Tracking via Dual Hierarchical Selective Augmentation. IJCNN 2024: 1-8
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/SongZHXSLHLF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/SongZHXSLHLF24
Shuangyong Song, Jinwei Zhang, Hao Huang, Hongyan Xie, Haoxiang Su, Mengxiang Li, Zhongjiang He, Yongxiang Li, Ruiyu Fang:
Graph-based Dynamic Domain Selection for Dialogue State Tracking. IJCNN 2024: 1-7
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/ram/YangCLHC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ram/YangCLHC024
Hongli Yang, Xinyi Chen, Junjie Li, Hao Huang, Siqi Cai, Haizhou Li:
Listen to the Speaker in Your Gaze. CIS-RAM 2024: 380-385
2023
[j12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/HuangWYHH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/HuangWYHH23
Hao Huang, Lin Wang, Jichen Yang, Ying Hu, Liang He:
W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision. EURASIP J. Audio Speech Music. Process. 2023(1): 45 (2023)
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/ijst/WangLPH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijst/WangLPH23
Kai Wang, Jingjing Liu, Yizhou Peng, Hao Huang:
Neural RAPT: deep learning-based pitch tracking with prior algorithmic knowledge instillation. Int. J. Speech Technol. 26(4): 999-1015 (2023)
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuGYYLDMHCCK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuGYYLDMHCCK23
Qianying Liu, Zhuo Gong, Zhengdong Yang, Yuhang Yang, Sheng Li, Chenchen Ding, Nobuaki Minematsu, Hao Huang, Fei Cheng, Chenhui Chu, Sadao Kurohashi:
Hierarchical Softmax for End-To-End Low-Resource Multilingual Speech Recognition. ICASSP 2023: 1-5
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QiuFYYSH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QiuFYYSH23
Zhibin Qiu, Mengfan Fu, Yinfeng Yu, LiLi Yin, Fuchun Sun, Hao Huang:
SRTNET: Time Domain Speech Enhancement via Stochastic Refinement. ICASSP 2023: 1-5
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangYHHL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangYHHL23
Kai Wang, Yuhang Yang, Hao Huang, Ying Hu, Sheng Li:
Speakeraugment: Data Augmentation for Generalizable Source Separation via Speaker Parameter Manipulation. ICASSP 2023: 1-5
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangXHCL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangXHCL23
Yuhang Yang, Haihua Xu, Hao Huang, Eng Siong Chng, Sheng Li:
Speech-Text Based Multi-Modal Training with Bidirectional Attention for Improved Speech Recognition. ICASSP 2023: 1-5
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/QiuGFHHHS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/QiuGFHHHS23
Zhibin Qiu, Yachao Guo, Mengfan Fu, Hao Huang, Ying Hu, Liang He, Fuchun Sun:
CRA-DIFFUSE: Improved Cross-Domain Speech Enhancement Based on Diffusion Model with T-F Domain Pre-Denoising. ICME 2023: 1709-1714
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/HuHYHH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/HuHYHH23
Ying Hu, Shijing Hou, Huamin Yang, Hao Huang, Liang He:
A Joint Network Based on Interactive Attention for Speech Emotion Recognition. ICME 2023: 1715-1720
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/NiuCHHH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/NiuCHHH23
Fangjing Niu, Tengfei Cao, Ying Hu, Hao Huang, Liang He:
Speech Topic Classification Based on Pre-trained and Graph Networks. ICME 2023: 1721-1726
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/GuoQHS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/GuoQHS23
Yachao Guo, Zhibin Qiu, Hao Huang, Chng Eng Siong:
Improved Keyword Recognition Based on Aho-Corasick Automaton. IJCNN 2023: 1-7
[c22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiXXPLHC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiXXPLHC23
Rui Li, Zhiwei Xie, Haihua Xu, Yizhou Peng, Hexin Liu, Hao Huang, Eng Siong Chng:
Self-supervised Learning Representation based Accent Recognition with Persistent Accent Memory. INTERSPEECH 2023: 1968-1972
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoHWHH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoHWHH23
Yuan Gao, Ying Hu, Liusong Wang, Hao Huang, Liang He:
MTANet: Multi-band Time-frequency Attention Network for Singing Melody Extraction from Polyphonic Music. INTERSPEECH 2023: 5396-5400
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/mmasia/Chen0LHCH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmasia/Chen0LHCH23
Xiaojiao Chen, Sheng Li, Jiyi Li, Hao Huang, Yang Cao, Liang He:
Reprogramming Self-supervised Learning-based Speech Representations for Speaker Anonymization. MMAsia 2023: 93:1-93:5
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/mmasia/ChenLLCHH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmasia/ChenLLCHH23
Xiaojiao Chen, Sheng Li, Jiyi Li, Yang Cao, Hao Huang, Liang He:
GhostVec: A New Threat to Speaker Privacy of End-to-End Speech Recognition System. MMAsia 2023: 94:1-94:5
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13796
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13796
Zhibin Qiu, Mengfan Fu, Fuchun Sun, Gulila Altenbek, Hao Huang:
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge. CoRR abs/2305.13796 (2023)
2022
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/jiis/ChenHHH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jiis/ChenHHH22
Yadong Chen, Ying Hu, Liang He, Hao Huang:
Multi-stage music separation network with dual-branch attention and hybrid convolution. J. Intell. Inf. Syst. 59(3): 635-656 (2022)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/TangHHH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/TangHHH22
Yuwu Tang, Ying Hu, Liang He, Hao Huang:
A bimodal network based on Audio-Text-Interactional-Attention with ArcFace loss for speech emotion recognition. Speech Commun. 143: 21-32 (2022)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/HuCYHH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/HuCYHH22
Ying Hu, Yadong Chen, Wenzhong Yang, Liang He, Hao Huang:
Hierarchic Temporal Convolutional Network With Cross-Domain Encoder for Music Source Separation. IEEE Signal Process. Lett. 29: 1517-1521 (2022)
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/ccbr/SongHFHH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ccbr/SongHFHH22
Zhida Song, Liang He, Zhihua Fang, Ying Hu, Hao Huang:
Virtual Fully-Connected Layer for a Large-Scale Speaker Verification Dataset. CCBR 2022: 382-390
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangPHHL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangPHHL22
Kai Wang, Yizhou Peng, Hao Huang, Ying Hu, Sheng Li:
Mining Hard Samples Locally And Globally For Improved Speech Separation. ICASSP 2022: 6037-6041
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PengZXHC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PengZXHC22
Yizhou Peng, Jicheng Zhang, Haihua Xu, Hao Huang, Eng Siong Chng:
Minimum Word Error Training For Non-Autoregressive Transformer-Based Code-Switching ASR. ICASSP 2022: 7807-7811
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/iconip/Chen0H22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iconip/Chen0H22
Xiaojiao Chen, Sheng Li, Hao Huang:
GhostVec: Directly Extracting Speaker Embedding from End-to-End Speech Recognition Model Using Adversarial Examples. ICONIP (6) 2022: 482-492
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/iconip/LiZLZYH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iconip/LiZLZYH22
Guangxing Li, Wangjin Zhou, Sheng Li, Yi Zhao, Jichen Yang, Hao Huang:
Investigating Effective Domain Adaptation Method for Speaker Verification Task. ICONIP (6) 2022: 517-527
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuZLHH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuZLHH22
Ying Hu, Xiujuan Zhu, Yunlong Li, Hao Huang, Liang He:
A Multi-grained based Attention Network for Semi-supervised Sound Event Detection. INTERSPEECH 2022: 1531-1535
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuTHH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuTHH22
Ying Hu, Yuwu Tang, Hao Huang, Liang He:
A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition. INTERSPEECH 2022: 4705-4709
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-03855
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-03855
Qianying Liu, Yuhang Yang, Zhuo Gong, Sheng Li, Chenchen Ding, Nobuaki Minematsu, Hao Huang, Fei Cheng, Sadao Kurohashi:
Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition. CoRR abs/2204.03855 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-10175
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-10175
Ying Hu, Xiujuan Zhu, Yunlong Li, Hao Huang, Liang He:
A Multi-grained based Attention Network for Semi-supervised Sound Event Detection. CoRR abs/2206.10175 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-00940
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-00940
Ying Hu, Yuwu Tang, Hao Huang, Liang He:
A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition. CoRR abs/2207.00940 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-04176
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-04176
Yizhou Peng, Yufei Liu, Jicheng Zhang, Haihua Xu, Yi He, Hao Huang, Eng Siong Chng:
Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition. CoRR abs/2207.04176 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-04177
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-04177
Jicheng Zhang, Yizhou Peng, Haihua Xu, Yi He, Eng Siong Chng, Hao Huang:
Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder. CoRR abs/2207.04177 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-16805
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-16805
Zhibin Qiu, Mengfan Fu, Yinfeng Yu, LiLi Yin, Fuchun Sun, Hao Huang:
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement. CoRR abs/2210.16805 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00325
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00325
Yuhang Yang, Haihua Xu, Hao Huang, Eng Siong Chng, Sheng Li:
Speech-text based multi-modal training with bidirectional attention for improved speech recognition. CoRR abs/2211.00325 (2022)
2021
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/dsp/KangHHH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/dsp/KangHHH21
Xiao Kang, Hao Huang, Ying Hu, Zhihua Huang:
Connectionist temporal classification loss for vector quantized variational autoencoder in zero-shot voice conversion. Digit. Signal Process. 116: 103110 (2021)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/jifs/GaoH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jifs/GaoH21
Weiqi Gao, Hao Huang:
A gating context-aware text classification model with BERT and graph convolutional networks. J. Intell. Fuzzy Syst. 40(3): 4331-4343 (2021)
[j5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/symmetry/MaHH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/symmetry/MaHH21
Wenfang Ma, Ying Hu, Hao Huang:
Dual Attention Network for Pitch Estimation of Monophonic Music. Symmetry 13(7): 1296 (2021)
[c11]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/MaoKPXHWC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/MaoKPXHWC21
Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named Entities for Improved Speech Recognition. APSIPA ASC 2021: 1021-1025
[c10]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/PengZZXHLC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/PengZZXHLC21
Yizhou Peng, Jicheng Zhang, Haobo Zhang, Haihua Xu, Hao Huang, Sheng Li, Eng Siong Chng:
Multilingual Approach to Joint Speech and Accent Recognition with DNN-HMM Framework. APSIPA ASC 2021: 1043-1048
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangWHL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangWHL21
Hao Huang, Kai Wang, Ying Hu, Sheng Li:
Encoder-Decoder Based Pitch Tracking and Joint Model Training for Mandarin Tone Classification. ICASSP 2021: 6943-6947
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangPPXHC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangPPXHC21
Jicheng Zhang, Yizhou Peng, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
E2E-Based Multi-Task Learning Approach to Joint Speech and Accent Recognition. Interspeech 2021: 1519-1523
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangHHH021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangHHH021
Kai Wang, Hao Huang, Ying Hu, Zhihua Huang, Sheng Li:
End-to-End Speech Separation Using Orthogonal Representation in Complex and Real Time-Frequency Domain. Interspeech 2021: 3046-3050
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/MaoKPXHC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/MaoKPXHC21
Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. ISCSLP 2021: 1-5
2020
[j4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/symmetry/ZhangHW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/symmetry/ZhangHW20
Zhen Zhang, Hao Huang, Kai Wang:
Using Deep Time Delay Neural Network for Slot Filling in Spoken Language Understanding. Symmetry 12(6): 993 (2020)
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/symmetry/GengHH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/symmetry/GengHH20
Haibo Geng, Ying Hu, Hao Huang:
Monaural Singing Voice and Accompaniment Separation Based on Gated Nested U-Net Architecture. Symmetry 12(6): 1051 (2020)
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangXPHC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangXPHC20
Haobo Zhang, Haihua Xu, Van Tung Pham, Hao Huang, Eng Siong Chng:
Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-Switching Speech Recognition. INTERSPEECH 2020: 2392-2396
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhongHHS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhongHHS20
Ying Zhong, Ying Hu, Hao Huang, Wushour Silamu:
A Lightweight Model Based on Separable Convolution for Speech Emotion Recognition. INTERSPEECH 2020: 3331-3335
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08742
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08742
Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. CoRR abs/2005.08742 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11483
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11483
Yizhou Peng, Jicheng Zhang, Haobo Zhang, Haihua Xu, Hao Huang, Eng Siong Chng:
A multilingual approach to joint Speech and Accent Recognition with DNN-HMM framework. CoRR abs/2010.11483 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11489
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11489
Haobo Zhang, Tingzhi Mao, Haihua Xu, Hao Huang:
The NTU-AISG Text-to-speech System for Blizzard Challenge 2020. CoRR abs/2010.11489 (2020)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-12143
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-12143
Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named-Entities To Improve Speech Recognition Performance. CoRR abs/2010.12143 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2017
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-01946
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-01946
Hao Huang, Ying Hu, Haihua Xu:
Mandarin tone modeling using recurrent neural networks. CoRR abs/1711.01946 (2017)
2016
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/XuRXHCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/XuRXHCL16
Haihua Xu, Wei Rao, Xiong Xiao, Hao Huang, Eng Siong Chng, Haizhou Li:
I-vector based deep neural network acoustic model adaptation using multilingual language resource. APSIPA 2016: 1-5
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icic/HuWHZ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icic/HuWHZ16
Ying Hu, Liejun Wang, Hao Huang, Gang Zhou:
Monaural Singing Voice Separation by Non-negative Matrix Partial Co-Factorization with Temporal Continuity and Sparsity Criteria. ICIC (3) 2016: 33-43
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuSNXHCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuSNXHCL16
Haihua Xu, Hang Su, Chongjia Ni, Xiong Xiao, Hao Huang, Eng Siong Chng, Haizhou Li:
Semi-Supervised and Cross-Lingual Knowledge Transfer Learnings for DNN Hybrid Acoustic Models Under Low-Resource Conditions. INTERSPEECH 2016: 1315-1319
2015
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HuangXWS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HuangXWS15
Hao Huang, Haihua Xu, Xianhui Wang, Wushour Silamu:
Maximum F1-Score Discriminative Training Criterion for Automatic Mispronunciation Detection. IEEE ACM Trans. Audio Speech Lang. Process. 23(4): 787-797 (2015)

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/isci/XiongZHX09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/isci/XiongZHX09
Ying Xiong, Jie Zhu, Hao Huang, Haihua Xu:
Minimum tag error for discriminative training of conditional random fields. Inf. Sci. 179(1-2): 169-179 (2009)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.