default search action
Zheng-Hua Tan
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j83]Andreas Jonas Fuglsig, Zheng-Hua Tan, Lars Søndergaard Bertelsen, Jesper Jensen, Jens Christian Lindof, Jan Østergaard:
Joint Far- and Near-End Speech and Listening Enhancement With Minimum Processing. IEEE Access 12: 119983-120004 (2024) - [j82]Georg Ørnskov Rønsch, Iván López-Espejo, Daniel Michelsanti, Yuying Xie, Petar Popovski, Zheng-Hua Tan:
Utilization of acoustic signals with generative Gaussian and autoencoder modeling for condition-based maintenance of injection moulds. Int. J. Comput. Integr. Manuf. 37(4): 438-453 (2024) - [j81]Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May:
The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems. IEEE Signal Process. Lett. 31: 2225-2229 (2024) - [j80]Yiming Zhang, Ruoyi Du, Zheng-Hua Tan, Wenwu Wang, Zhanyu Ma:
Generating Accurate and Diverse Audio Captions Through Variational Autoencoder Framework. IEEE Signal Process. Lett. 31: 2520-2524 (2024) - [j79]Mathias Bach Pedersen, Søren Holdt Jensen, Zheng-Hua Tan, Jesper Jensen:
Data-Driven Non-Intrusive Speech Intelligibility Prediction Using Speech Presence Probability. IEEE ACM Trans. Audio Speech Lang. Process. 32: 55-67 (2024) - [j78]Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May:
Investigating the Design Space of Diffusion Models for Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4486-4500 (2024) - [c126]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafael Wisniewski, Mihály Petreczky:
PAC-Bayes Generalisation Bounds for Dynamical Systems including Stable RNNs. AAAI 2024: 11901-11909 - [c125]Yuying Xie, Michael Kuhlmann, Frederik Rautenberg, Zheng-Hua Tan, Reinhold Haeb-Umbach:
Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder. EUSIPCO 2024: 436-440 - [c124]M. Asjid Tanveer, Jesper Jensen, Zheng-Hua Tan, Jan Østergaard:
Envelope Based Deep Source Separation and EEG Auditory Attention Decoding for Speech and Music. EUSIPCO 2024: 872-876 - [c123]Andreas Jonas Fuglsig, Jesper Jensen, Zheng-Hua Tan, Lars Søndergaard Bertelsen, Jens Christian Lindof, Jan Østergaard:
Joint Minimum Processing Beamforming and Near-End Listening Enhancement. ICASSP Workshops 2024: 485-489 - [c122]Holger Severin Bovbjerg, Jesper Jensen, Jan Østergaard, Zheng-Hua Tan:
Self-Supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions. ICASSP 2024: 10126-10130 - [c121]Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May:
Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler. ICASSP 2024: 10431-10435 - [c120]Sarthak Yadav, Sergios Theodoridis, Lars Kai Hansen, Zheng-Hua Tan:
Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners. ICLR 2024 - [c119]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky:
PAC-Bayesian Error Bound, via Rényi Divergence, for a Class of Linear Time-Invariant State-Space Models. ICML 2024 - [c118]Yuying Xie, Thomas Arildsen, Zheng-Hua Tan:
Complex Recurrent Variational Autoencoder for Speech Resynthesis and Enhancement. IJCNN 2024: 1-7 - [c117]Filippo Villani, Wai-Yip Chan, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen:
Near-End Listening Enhancement Using a Noise-Robust Linear Time-Invariant Filter. IWAENC 2024: 444-448 - [i70]Jacob Mørk, Holger Severin Bovbjerg, Gergely Kiss, Zheng-Hua Tan:
Noise-Robust Keyword Spotting through Self-supervised Pretraining. CoRR abs/2403.18560 (2024) - [i69]Sarthak Yadav, Zheng-Hua Tan:
Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations. CoRR abs/2406.02178 (2024) - [i68]Yiming Zhang, Xuenan Xu, Ruoyi Du, Haohe Liu, Yuan Dong, Zheng-Hua Tan, Wenwu Wang, Zhanyu Ma:
Zero-Shot Audio Captioning Using Soft and Hard Prompts. CoRR abs/2406.06295 (2024) - [i67]Sarthak Yadav, Sergios Theodoridis, Zheng-Hua Tan:
Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs. CoRR abs/2408.16568 (2024) - [i66]Gustav Wagner Zakarias, Lars Kai Hansen, Zheng-Hua Tan:
BiSSL: Bilevel Optimization for Self-Supervised Pre-Training and Fine-Tuning. CoRR abs/2410.02387 (2024) - 2023
- [j77]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky:
Explicit construction of the minimum error variance estimator for stochastic LTI-ss systems. Autom. 153: 111018 (2023) - [j76]Iván López-Espejo, Amin Edraki, Wai-Yip Chan, Zheng-Hua Tan, Jesper Jensen:
On the deficiency of intelligibility metrics as proxies for subjective intelligibility. Speech Commun. 150: 9-22 (2023) - [j75]Sharon Gannot, Zheng-Hua Tan, Martin Haardt, Nancy F. Chen, Hoi-To Wai, Ivan Tashev, Walter Kellermann, Justin Dauwels:
Data Science Education: The Signal Processing Perspective [SP Education]. IEEE Signal Process. Mag. 40(7): 89-93 (2023) - [j74]Andreas Jonas Fuglsig, Jesper Jensen, Zheng-Hua Tan, Lars Søndergaard Bertelsen, Jens Christian Lindof, Jan Østergaard:
Minimum Processing Near-End Listening Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2233-2245 (2023) - [j73]Yiming Zhang, Hong Yu, Ruoyi Du, Zheng-Hua Tan, Wenwu Wang, Zhanyu Ma, Yuan Dong:
ACTUAL: Audio Captioning With Caption Feature Space Regularization. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2643-2657 (2023) - [j72]Zhanyu Ma, Xiaoou Lu, Jiyang Xie, Zhen Yang, Jing-Hao Xue, Zheng-Hua Tan, Bo Xiao, Jun Guo:
On the Comparisons of Decorrelation Approaches for Non-Gaussian Neutral Vector Variables. IEEE Trans. Neural Networks Learn. Syst. 34(4): 1823-1837 (2023) - [c116]Yuying Xie, Thomas Arildsen, Zheng-Hua Tan:
Improved Disentangled Speech Representations Using Contrastive Learning in Factorized Hierarchical Variational Autoencoder. EUSIPCO 2023: 1330-1334 - [c115]Holger Severin Bovbjerg, Zheng-Hua Tan:
Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining. ICASSP Workshops 2023: 1-5 - [c114]Iván López-Espejo, Ram C. M. C. Shekar, Zheng-Hua Tan, Jesper Jensen, John H. L. Hansen:
Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting. ICASSP 2023: 1-5 - [c113]Daniel Michelsanti, Zheng-Hua Tan, Sergi Rotger-Griful, Jesper Jensen:
A Vision-Assisted Hearing Aid System Based on Deep Learning. ICASSP Workshops 2023: 1-4 - [c112]Cristian J. Vaca-Rubio, Pablo Ramirez-Espinosa, Kimmo Kansanen, Zheng-Hua Tan, Elisabeth de Carvalho:
Radio Sensing with Large Intelligent Surface for 6G. ICASSP 2023: 1-5 - [c111]Juan Felipe Montesinos, Daniel Michelsanti, Gloria Haro, Zheng-Hua Tan, Jesper Jensen:
Speech inpainting: Context-based speech synthesis guided by video. INTERSPEECH 2023: 4459-4463 - [i65]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafael Wisniewski, Mihály Petreczky:
PAC-Bayesian bounds for learning LTI-ss systems with input from empirical loss. CoRR abs/2303.16816 (2023) - [i64]Juan F. Montesinos, Daniel Michelsanti, Gloria Haro, Zheng-Hua Tan, Jesper Jensen:
Speech inpainting: Context-based speech synthesis guided by video. CoRR abs/2306.00489 (2023) - [i63]Sarthak Yadav, Sergios Theodoridis, Lars Kai Hansen, Zheng-Hua Tan:
Masked Autoencoders with Multi-Window Attention Are Better Audio Learners. CoRR abs/2306.00561 (2023) - [i62]Andreas Jonas Fuglsig, Jesper Jensen, Zheng-Hua Tan, Lars Søndergaard Bertelsen, Jens Christian Lindof, Jan Østergaard:
Joint Minimum Processing Beamforming and Near-end Listening Enhancement. CoRR abs/2309.11243 (2023) - [i61]Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May:
Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler. CoRR abs/2312.02683 (2023) - [i60]Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May:
Investigating the Design Space of Diffusion Models for Speech Enhancement. CoRR abs/2312.04370 (2023) - [i59]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky:
PAC-Bayes Generalisation Bounds for Dynamical Systems Including Stable RNNs. CoRR abs/2312.09793 (2023) - [i58]Holger Severin Bovbjerg, Jesper Jensen, Jan Østergaard, Zheng-Hua Tan:
Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions. CoRR abs/2312.16613 (2023) - 2022
- [j71]Iván López-Espejo, Zheng-Hua Tan, John H. L. Hansen, Jesper Jensen:
Deep Spoken Keyword Spotting: An Overview. IEEE Access 10: 4169-4199 (2022) - [j70]Poul Hoang, Zheng-Hua Tan, Jan Mark de Haan, Jesper Jensen:
The Minimum Overlap-Gap Algorithm for Speech Enhancement. IEEE Access 10: 14698-14716 (2022) - [j69]Bjørn Uttrup Dideriksen, Kristoffer Derosche, Zheng-Hua Tan:
iVAE-GAN: Identifiable VAE-GAN Models for Latent Representation Learning. IEEE Access 10: 48405-48418 (2022) - [j68]Mathias Bach Pedersen, Asger Heidemann Andersen, Søren Holdt Jensen, Zheng-Hua Tan, Jesper Jensen:
Training Data-Driven Speech Intelligibility Predictors on Heterogeneous Listening Test Data. IEEE Access 10: 66175-66189 (2022) - [j67]Jiyang Xie, Zhanyu Ma, Jianjun Lei, Guoqiang Zhang, Jing-Hao Xue, Zheng-Hua Tan, Jun Guo:
Advanced Dropout: A Model-Free Methodology for Bayesian Dropout Optimization. IEEE Trans. Pattern Anal. Mach. Intell. 44(9): 4605-4625 (2022) - [j66]Poul Hoang, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
Multichannel Speech Enhancement With Own Voice-Based Interfering Speech Suppression for Hearing Assistive Devices. IEEE ACM Trans. Audio Speech Lang. Process. 30: 706-720 (2022) - [c110]Cristian J. Vaca-Rubio, Dariush Salami, Petar Popovski, Elisabeth de Carvalho, Zheng-Hua Tan, Stephan Sigg:
User Localization using RF Sensing: A Performance comparison between LIS and mmWave Radars. EUSIPCO 2022: 1916-1920 - [c109]Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
An Experimental Study on Light Speech Features for Small-Footprint Keyword Spotting. IberSPEECH 2022: 131-135 - [c108]Andreas Jonas Fuglsig, Jan Østergaard, Jesper Jensen, Lars Søndergaard Bertelsen, Peter Mariager, Zheng-Hua Tan:
Joint Far- and Near-End Speech Intelligibility Enhancement Based on the Approximated Speech Intelligibility Index. ICASSP 2022: 7752-7756 - [c107]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. ICASSP 2022: 9156-9160 - [c106]Claus M. Larsen, Peter Koch, Zheng-Hua Tan:
Adversarial Multi-Task Deep Learning for Noise-Robust Voice Activity Detection with Low Algorithmic Delay. INTERSPEECH 2022: 3759-3763 - [c105]Cristian J. Vaca-Rubio, Roberto Pereira, Xavier Mestre, David Gregoratti, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski:
Floor Map Reconstruction Through Radio Sensing and Learning by a Large Intelligent Surface. MLSP 2022: 1-6 - [c104]Chien-Cheng Wu, Zheng-Hua Tan, Cedomir Stefanovic:
AoI and Throughput Optimization for Hybrid Traffic in Cellular Uplink Using Reinforcement Learning. VTC Spring 2022: 1-6 - [i57]Achintya Kumar Sarkar, Zheng-Hua Tan:
On Training Targets and Activation Functions for Deep Representation Learning in Text-Dependent Speaker Verification. CoRR abs/2201.06426 (2022) - [i56]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. CoRR abs/2202.03647 (2022) - [i55]Cristian J. Vaca-Rubio, Dariush Salami, Petar Popovski, Elisabeth de Carvalho, Zheng-Hua Tan, Stephan Sigg:
User Localization using RF Sensing: A Performance comparison between LIS and mmWave Radars. CoRR abs/2205.10321 (2022) - [i54]Cristian J. Vaca-Rubio, Roberto Pereira, Xavier Mestre, David Gregoratti, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski:
Floor Map Reconstruction Through Radio Sensing and Learning By a Large Intelligent Surface. CoRR abs/2206.10750 (2022) - [i53]Holger Severin Bovbjerg, Zheng-Hua Tan:
Improving Label-Deficient Keyword Spotting Using Self-Supervised Pretraining. CoRR abs/2210.01703 (2022) - [i52]Andreas Jonas Fuglsig, Jesper Jensen, Zheng-Hua Tan, Lars Søndergaard Bertelsen, Jens Christian Lindof, Jan Østergaard:
Minimum Processing Near-end Listening Enhancement. CoRR abs/2210.17154 (2022) - [i51]Christian Heider Nielsen, Zheng-Hua Tan:
Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise. CoRR abs/2211.01621 (2022) - [i50]Yuying Xie, Thomas Arildsen, Zheng-Hua Tan:
Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder. CoRR abs/2211.08191 (2022) - [i49]Iván López-Espejo, Ram C. M. C. Shekar, Zheng-Hua Tan, Jesper Jensen, John H. L. Hansen:
Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise. CoRR abs/2211.10565 (2022) - [i48]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky:
PAC-Bayesian-Like Error Bound for a Class of Linear Time-Invariant Stochastic State-Space Models. CoRR abs/2212.14838 (2022) - 2021
- [j65]Achintya Kumar Sarkar, Zheng-Hua Tan:
Self-segmentation of pass-phrase utterances for deep feature learning in text-dependent speaker verification. Comput. Speech Lang. 70: 101229 (2021) - [j64]Xiaoxu Li, Dongliang Chang, Zhanyu Ma, Zheng-Hua Tan, Jing-Hao Xue, Jie Cao, Jun Guo:
Deep InterBoost networks for small-sample image classification. Neurocomputing 456: 492-503 (2021) - [j63]Cristian J. Vaca-Rubio, Pablo Ramirez-Espinosa, Kimmo Kansanen, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski:
Assessing Wireless Sensing Potential With Large Intelligent Surfaces. IEEE Open J. Commun. Soc. 2: 934-947 (2021) - [j62]Achintya Kumar Sarkar, Zheng-Hua Tan:
Vocal Tract Length Perturbation for Text-Dependent Speaker Verification With Autoregressive Prediction Coding. IEEE Signal Process. Lett. 28: 364-368 (2021) - [j61]Daniel Michelsanti, Zheng-Hua Tan, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu, Jesper Jensen:
An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1368-1396 (2021) - [j60]Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
A Novel Loss Function and Training Strategy for Noise-Robust Keyword Spotting. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2254-2266 (2021) - [c103]Chien-Cheng Wu, Petar Popovski, Zheng-Hua Tan, Cedomir Stefanovic:
Design of AoI-Aware 5G Uplink Scheduler Using Reinforcement Learning. 5GWF 2021: 176-181 - [c102]Wei Rao, Yihui Fu, Yanxin Hu, Xin Xu, Yvkai Jv, Jiangyu Han, Zhongjie Jiang, Lei Xie, Yannan Wang, Shinji Watanabe, Zheng-Hua Tan, Hui Bu, Tao Yu, Shidong Shang:
Conferencingspeech Challenge: Towards Far-Field Multi-Channel Speech Enhancement for Video Conferencing. ASRU 2021: 679-686 - [c101]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Alireza Fakhrizadeh Esfahani, Mihály Petreczky:
PAC-Bayesian theory for stochastic LTI systems. CDC 2021: 6626-6633 - [c100]Poul Hoang, Zheng-Hua Tan, Jan Mark de Haan, Jesper Jensen:
Joint Maximum Likelihood Estimation of Power Spectral Densities and Relative Acoustic Transfer Functions for Acoustic Beamforming. ICASSP 2021: 6119-6123 - [c99]Giovanni Morrone, Daniel Michelsanti, Zheng-Hua Tan, Jesper Jensen:
Audio-Visual Speech Inpainting with Deep Learning. ICASSP 2021: 6653-6657 - [c98]Morten Østergaard Nielsen, Jan Østergaard, Jesper Jensen, Zheng-Hua Tan:
Compression of DNNs Using Magnitude Pruning and Nonlinear Information Bottleneck Training. MLSP 2021: 1-6 - [c97]Yuying Xie, Thomas Arildsen, Zheng-Hua Tan:
Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective. MLSP 2021: 1-6 - [c96]Md. Sahidullah, Achintya Kumar Sarkar, Ville Vestman, Xuechen Liu, Romain Serizel, Tomi Kinnunen, Zheng-Hua Tan, Emmanuel Vincent:
UIAI System for Short-Duration Speaker Verification Challenge 2020. SLT 2021: 323-329 - [c95]Anders E. Kalør, Daniel Michelsanti, Federico Chiariotti, Zheng-Hua Tan, Petar Popovski:
Remote Anomaly Detection in Industry 4.0 Using Resource-Constrained Devices. SPAWC 2021: 251-255 - [i47]Achintya Kumar Sarkar, Md. Sahidullah, Zheng-Hua Tan:
Data Generation Using Pass-phrase-dependent Deep Auto-encoders for Text-Dependent Speaker Verification. CoRR abs/2102.02074 (2021) - [i46]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Alireza Fakhrizadeh Esfahani, Mihály Petreczky:
PAC-Bayesian theory for stochastic LTI systems. CoRR abs/2103.12866 (2021) - [i45]Morten Kolbæk, Zheng-Hua Tan, Søren Holdt Jensen, Jesper Jensen:
On TasNet for Low-Latency Single-Speaker Speech Enhancement. CoRR abs/2103.14882 (2021) - [i44]Wei Rao, Yihui Fu, Yanxin Hu, Xin Xu, Yvkai Jv, Jiangyu Han, Zhongjie Jiang, Lei Xie, Yannan Wang, Shinji Watanabe, Zheng-Hua Tan, Hui Bu, Tao Yu, Shidong Shang:
INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing. CoRR abs/2104.00960 (2021) - [i43]Max Væhrens, Andreas Jonas Fuglsig, Anders Post Jacobsen, Nicolai Almskou Rasmussen, Victor Mølbach Nissen, Joachim Roland Hejslet, Zheng-Hua Tan:
Improvement of Noise-Robust Single-Channel Voice Activity Detection with Spatial Pre-processing. CoRR abs/2104.05481 (2021) - [i42]Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky:
Optimal Prediction of Unmeasured Output from Measurable Outputs In LTI Systems. CoRR abs/2109.02384 (2021) - [i41]Anders E. Kalør, Daniel Michelsanti, Federico Chiariotti, Zheng-Hua Tan, Petar Popovski:
Remote Anomaly Detection in Industry 4.0 Using Resource-Constrained Devices. CoRR abs/2110.05757 (2021) - [i40]Chien-Cheng Wu, Petar Popovski, Zheng-Hua Tan, Cedomir Stefanovic:
Design of AoI-Aware 5G Uplink Scheduler UsingReinforcement Learning. CoRR abs/2110.09995 (2021) - [i39]Andreas Jonas Fuglsig, Jan Østergaard, Jesper Jensen, Lars Søndergaard Bertelsen, Peter Mariager, Zheng-Hua Tan:
Joint Far- and Near-End Speech Intelligibility Enhancement based on the Approximated Speech Intelligibility Index. CoRR abs/2111.07759 (2021) - [i38]Iván López-Espejo, Zheng-Hua Tan, John H. L. Hansen, Jesper Jensen:
Deep Spoken Keyword Spotting: An Overview. CoRR abs/2111.10592 (2021) - 2020
- [j59]Zheng-Hua Tan, Achintya Kumar Sarkar, Najim Dehak:
rVAD: An unsupervised segment-based robust voice activity detection method. Comput. Speech Lang. 59: 1-21 (2020) - [j58]Bhaskar D. Rao, Zheng-Hua Tan:
Highlights From the Machine Learning for Signal Processing Technical Committee [In the Spotlight]. IEEE Signal Process. Mag. 37(6): 200-202 (2020) - [j57]Morten Kolbæk, Zheng-Hua Tan, Søren Holdt Jensen, Jesper Jensen:
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 28: 825-838 (2020) - [j56]Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
Improved External Speaker-Robust Keyword Spotting for Hearing Assistive Devices. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1233-1247 (2020) - [j55]Juan M. Martín-Doñas, Jesper Jensen, Zheng-Hua Tan, Angel M. Gomez, Antonio M. Peinado:
Online Multichannel Speech Enhancement Based on Recursive EM and DNN-Based Speech Presence Estimation. IEEE ACM Trans. Audio Speech Lang. Process. 28: 3080-3094 (2020) - [j54]Xiaoxu Li, Dongliang Chang, Zhanyu Ma, Zheng-Hua Tan, Jing-Hao Xue, Jie Cao, Jingyi Yu, Jun Guo:
OSLNet: Deep Small-Sample Classification With an Orthogonal Softmax Layer. IEEE Trans. Image Process. 29: 6482-6495 (2020) - [j53]Miklas Strøm Kristoffersen, Sven Ewan Shepstone, Zheng-Hua Tan:
The Importance of Context When Recommending TV Content: Dataset and Algorithms. IEEE Trans. Multim. 22(6): 1531-1541 (2020) - [c94]Cristian J. Vaca-Rubio, Pablo Ramirez-Espinosa, Robin Jess Williams, Kimmo Kansanen, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski:
A Primer on Large Intelligent Surface (LIS) for Wireless Sensing in an Industrial Setting. CrownCom 2020: 126-138 - [c93]Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
Exploring Filterbank Learning for Keyword Spotting. EUSIPCO 2020: 331-335 - [c92]Saeid Samizade, Zheng-Hua Tan, Chao Shen, Xiaohong Guan:
Adversarial Example Detection by Classification for Deep Speech Recognition. ICASSP 2020: 3102-3106 - [c91]Poul Hoang, Zheng-Hua Tan, Thomas Lunner, Jan Mark de Haan, Jesper Jensen:
Maximum Likelihood Estimation of the Interference-Plus-Noise Cross Power Spectral Density Matrix for Own Voice Retrieval. ICASSP 2020: 6939-6943 - [c90]Zeyu Song, Dongliang Chang, Zhanyu Ma, Xiaoxu Li, Zheng-Hua Tan:
CC-Loss: Channel Correlation Loss for Image Classification. ICPR 2020: 7601-7608 - [c89]Daniel Michelsanti, Olga Slizovskaia, Gloria Haro, Emilia Gómez, Zheng-Hua Tan, Jesper Jensen:
Vocoder-Based Speech Synthesis from Silent Videos. INTERSPEECH 2020: 3530-3534 - [i37]Miklas S. Kristoffersen, Sven Ewan Shepstone, Zheng-Hua Tan:
Context-Aware Recommendations for Televisions Using Deep Embeddings with Relaxed N-Pairs Loss Objective. CoRR abs/2002.01554 (2020) - [i36]Daniel Michelsanti, Olga Slizovskaia, Gloria Haro, Emilia Gómez, Zheng-Hua Tan, Jesper Jensen:
Vocoder-Based Speech Synthesis from Silent Videos. CoRR abs/2004.02541 (2020) - [i35]Xiaoxu Li, Dongliang Chang, Zhanyu Ma, Zheng-Hua Tan, Jing-Hao Xue, Jie Cao, Jingyi Yu, Jun Guo:
OSLNet: Deep Small-Sample Classification with an Orthogonal Softmax Layer. CoRR abs/2004.09033 (2020) - [i34]Achintya Kumar Sarkar, Zheng-Hua Tan:
On Bottleneck Features for Text-Dependent Speaker Verification Using X-vectors. CoRR abs/2005.07383 (2020) - [i33]Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
Exploring Filterbank Learning for Keyword Spotting. CoRR abs/2006.00217 (2020) - [i32]Cristian J. Vaca-Rubio, Pablo Ramirez-Espinosa, Robin Jess Williams, Kimmo Kansanen, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski:
A Primer on Large Intelligent Surface (LIS) for Wireless Sensing in an Industrial Setting. CoRR abs/2006.06563 (2020) - [i31]Achintya Kumar Sarkar, Himangshu Sarma, Priyanka Dwivedi, Zheng-Hua Tan:
Data augmentation enhanced speaker enrollment for text-dependent speaker verification. CoRR abs/2007.08004 (2020) - [i30]Md. Sahidullah, Achintya Kumar Sarkar, Ville Vestman, Xuechen Liu, Romain Serizel, Tomi Kinnunen, Zheng-Hua Tan, Emmanuel Vincent:
UIAI System for Short-Duration Speaker Verification Challenge 2020. CoRR abs/2007.13118 (2020) - [i29]Daniel Michelsanti, Zheng-Hua Tan, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu, Jesper Jensen:
An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation. CoRR abs/2008.09586 (2020) - [i28]Giovanni Morrone, Daniel Michelsanti, Zheng-Hua Tan, Jesper Jensen:
Audio-Visual Speech Inpainting with Deep Learning. CoRR abs/2010.04556 (2020) - [i27]Jiyang Xie, Zhanyu Ma, Guoqiang Zhang, Jing-Hao Xue, Zheng-Hua Tan, Jun Guo:
Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization. CoRR abs/2010.05244 (2020) - [i26]Zeyu Song, Dongliang Chang, Zhanyu Ma, Xiaoxu Li, Zheng-Hua Tan:
CC-Loss: Channel Correlation Loss For Image Classification. CoRR abs/2010.05469 (2020) - [i25]Cristian J. Vaca-Rubio, Pablo Ramirez-Espinosa, Kimmo Kansanen, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski:
Assessing Wireless Sensing Potential with Large Intelligent Surfaces. CoRR abs/2011.08465 (2020) - [i24]Achintya Kumar Sarkar, Zheng-Hua Tan:
Vocal Tract Length Perturbation for Text-Dependent Speaker Verification with Autoregressive Prediction Coding. CoRR abs/2011.12536 (2020)
2010 – 2019
- 2019
- [j52]Yonggang Qi, Zheng-Hua Tan:
SketchSegNet+: An End-to-End Learning of RNN for Multi-Class Sketch Semantic Segmentation. IEEE Access 7: 102717-102726 (2019) - [j51]Daniel Michelsanti, Zheng-Hua Tan, Sigurdur Sigurdsson, Jesper Jensen:
Deep-learning-based audio-visual speech enhancement in presence of Lombard effect. Speech Commun. 115: 38-50 (2019) - [j50]Morten Kolbaek, Zheng-Hua Tan, Jesper Jensen:
On the Relationship Between Short-Time Objective Intelligibility and Short-Time Spectral-Amplitude Mean-Square Error for Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 27(2): 283-295 (2019) - [j49]Achintya Kumar Sarkar, Zheng-Hua Tan, Hao Tang, Suwon Shon, James R. Glass:
Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 27(8): 1267-1279 (2019) - [c88]Poul Hoang, Zheng-Hua Tan, Jan Mark de Haan, Thomas Lunner, Jesper Jensen:
Robust Bayesian and Maximum a Posteriori Beamforming for Hearing Assistive Devices. GlobalSIP 2019: 1-5 - [c87]Daniel Michelsanti, Zheng-Hua Tan, Sigurdur Sigurdsson, Jesper Jensen:
Effects of Lombard Reflex on the Performance of Deep-learning-based Audio-visual Speech Enhancement Systems. ICASSP 2019: 6615-6619 - [c86]Daniel Michelsanti, Zheng-Hua Tan, Sigurdur Sigurdsson, Jesper Jensen:
On Training Targets and Objective Functions for Deep-learning-based Audio-visual Speech Enhancement. ICASSP 2019: 8077-8081 - [c85]Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
Keyword Spotting for Hearing Assistive Devices Robust to External Speakers. INTERSPEECH 2019: 3223-3227 - [c84]Jiyang Xie, Zhanyu Ma, Guoqiang Zhang, Jing-Hao Xue, Zheng-Hua Tan, Jun Guo:
Soft Dropout And Its Variational Bayes Approximation. MLSP 2019: 1-6 - [c83]Andrea Coifman, Péter Rohoska, Miklas S. Kristoffersen, Sven Ewan Shepstone, Zheng-Hua Tan:
Subjective Annotations for Vision-based Attention Level Estimation. VISIGRAPP (5: VISAPP) 2019: 249-256 - [i23]Achintya Kumar Sarkar, Zheng-Hua Tan, Hao Tang, Suwon Shon, James R. Glass:
Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification. CoRR abs/1905.04554 (2019) - [i22]Daniel Michelsanti, Zheng-Hua Tan, Sigurdur Sigurdsson, Jesper Jensen:
Deep-Learning-Based Audio-Visual Speech Enhancement in Presence of Lombard Effect. CoRR abs/1905.12605 (2019) - [i21]Zheng-Hua Tan, Achintya Kumar Sarkar, Najim Dehak:
rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method. CoRR abs/1906.03588 (2019) - [i20]Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
Keyword Spotting for Hearing Assistive Devices Robust to External Speakers. CoRR abs/1906.09417 (2019) - [i19]Morten Kolbæk, Zheng-Hua Tan, Søren Holdt Jensen, Jesper Jensen:
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement. CoRR abs/1909.01019 (2019) - [i18]Miklas S. Kristoffersen, Jacob L. Wieland, Sven Ewan Shepstone, Zheng-Hua Tan, Vinoba Vinayagamoorthy:
Deep Joint Embeddings of Context and Content for Recommendation. CoRR abs/1909.06076 (2019) - [i17]Saeid Samizade, Zheng-Hua Tan, Chao Shen, Xiaohong Guan:
Adversarial Example Detection by Classification for Deep Speech Recognition. CoRR abs/1910.10013 (2019) - 2018
- [j48]Achintya Kumar Sarkar, Zheng-Hua Tan:
Incorporating pass-phrase dependent background models for text-dependent speaker verification. Comput. Speech Lang. 47: 259-271 (2018) - [j47]Zhanyu Ma, Jen-Tzung Chien, Zheng-Hua Tan, Yi-Zhe Song, Jalil Taghia, Ming Xiao:
Recent advances in machine learning for non-Gaussian data processing. Neurocomputing 278: 1-3 (2018) - [j46]Jen-Tzung Chien, Chao-Hsi Lee, Zheng-Hua Tan:
Latent Dirichlet mixture model. Neurocomputing 278: 12-22 (2018) - [j45]Zheng-Hua Tan, Nicolai Bæk Thomsen, Xiaodong Duan, Evgenios Vlachos, Sven Ewan Shepstone, Morten Højfeldt Rasmussen, Jesper Lisby Højvang:
iSocioBot: A Multimodal Interactive Social Robot. Int. J. Soc. Robotics 10(1): 5-19 (2018) - [j44]Xiaodong Duan, Zheng-Hua Tan:
A spatial self-similarity based feature learning method for face recognition under varying poses. Pattern Recognit. Lett. 111: 109-116 (2018) - [j43]Renhua Peng, Zheng-Hua Tan, Xiaodong Li, Chengshi Zheng:
A perceptually motivated LP residual estimator in noisy and reverberant environments. Speech Commun. 96: 129-141 (2018) - [j42]Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
Refinement and validation of the binaural short time objective intelligibility measure for spatially diverse conditions. Speech Commun. 102: 1-13 (2018) - [j41]Sven Ewan Shepstone, Zheng-Hua Tan, Søren Holdt Jensen:
Audio-Based Granularity-Adapted Emotion Classification. IEEE Trans. Affect. Comput. 9(2): 176-190 (2018) - [j40]Md. Sahidullah, Dennis Alexander Lehmann Thomsen, Rosa González Hautamäki, Tomi Kinnunen, Zheng-Hua Tan, Robert Parts, Martti Pitkänen:
Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones. IEEE ACM Trans. Audio Speech Lang. Process. 26(1): 44-56 (2018) - [j39]Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan, Jesper Jensen:
Bias-Compensated Informed Sound Source Localization Using Relative Transfer Functions. IEEE ACM Trans. Audio Speech Lang. Process. 26(7): 1271-1285 (2018) - [j38]Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
Nonintrusive Speech Intelligibility Prediction Using Convolutional Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 26(10): 1925-1939 (2018) - [j37]Sven Ewan Shepstone, Zheng-Hua Tan, Miklas S. Kristoffersen:
Using Closed-Set Speaker Identification Score Confidence to Enhance Audio-Based Collaborative Filtering for Multiple Users. IEEE Trans. Consumer Electron. 64(1): 11-18 (2018) - [j36]Zhanyu Ma, Jing-Hao Xue, Arne Leijon, Zheng-Hua Tan, Zhen Yang, Jun Guo:
Decorrelation of Neutral Vector Variables: Theory and Applications. IEEE Trans. Neural Networks Learn. Syst. 29(1): 129-143 (2018) - [j35]Hong Yu, Zheng-Hua Tan, Zhanyu Ma, Rainer Martin, Jun Guo:
Spoofing Detection in Automatic Speaker Verification Systems Using DNN Classifiers and Dynamic Acoustic Features. IEEE Trans. Neural Networks Learn. Syst. 29(10): 4633-4644 (2018) - [j34]Jun Guo, Zheng-Hua Tan, Sung Ho Cho, Guoqiang Zhang:
Wireless Personal Communications: Machine Learning for Big Data Processing in Mobile Internet. Wirel. Pers. Commun. 102(3): 2093-2098 (2018) - [c82]Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen:
Monaural Speech Enhancement Using Deep Neural Networks by Maximizing a Short-Time Objective Intelligibility Measure. ICASSP 2018: 5059-5063 - [c81]Peter Sibbern Frederiksen, Jesús Villalba, Shinji Watanabe, Zheng-Hua Tan, Najim Dehak:
Effectiveness of Single-Channel BLSTM Enhancement for Language Identification. INTERSPEECH 2018: 1823-1827 - [c80]Evgenios Vlachos, Zheng-Hua Tan:
Public perception of android robots: Indications from an analysis of YouTube comments. IROS 2018: 1255-1260 - [c79]Gabriele Trovato, Renato Paredes, Javier Balvin, Francisco Cuéllar, Nicolai Bæk Thomsen, Søren Bech, Zheng-Hua Tan:
The Sound or Silence: Investigating the Influence of Robot Noise on Proxemics. RO-MAN 2018: 713-718 - [c78]Miklas S. Kristoffersen, Sven Ewan Shepstone, Zheng-Hua Tan:
A Dataset for Inferring Contextual Preferences of Users Watching TV. UMAP 2018: 367-368 - [i16]Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen:
Monaural Speech Enhancement using Deep Neural Networks by Maximizing a Short-Time Objective Intelligibility Measure. CoRR abs/1802.00604 (2018) - [i15]Ioannis T. Christou, Emmanouil Amolochitis, Zheng-Hua Tan:
A Parallel/Distributed Algorithmic Framework for Mining All Quantitative Association Rules. CoRR abs/1804.06764 (2018) - [i14]Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen:
On the Equivalence between Objective Intelligibility and Mean-Squared Error for Deep Neural Network based Speech Enhancement. CoRR abs/1806.08404 (2018) - [i13]Miklas S. Kristoffersen, Sven Ewan Shepstone, Zheng-Hua Tan:
The Importance of Context When Recommending TV Content: Dataset and Algorithms. CoRR abs/1808.00337 (2018) - [i12]Daniel Michelsanti, Zheng-Hua Tan, Sigurdur Sigurdsson, Jesper Jensen:
On Training Targets and Objective Functions for Deep-Learning-Based Audio-Visual Speech Enhancement. CoRR abs/1811.06234 (2018) - [i11]Daniel Michelsanti, Zheng-Hua Tan, Sigurdur Sigurdsson, Jesper Jensen:
Effects of Lombard Reflex on the Performance of Deep-Learning-Based Audio-Visual Speech Enhancement Systems. CoRR abs/1811.06250 (2018) - [i10]Andrea Coifman, Péter Rohoska, Miklas S. Kristoffersen, Sven Ewan Shepstone, Zheng-Hua Tan:
Subjective Annotations for Vision-Based Attention Level Estimation. CoRR abs/1812.04949 (2018) - 2017
- [j33]Hong Yu, Zheng-Hua Tan, Yiming Zhang, Zhanyu Ma, Jun Guo:
DNN Filter Bank Cepstral Coefficients for Spoofing Detection. IEEE Access 5: 4779-4787 (2017) - [j32]Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen:
Speech Intelligibility Potential of General and Specialized Deep Neural Network Based Speech Enhancement Systems. IEEE ACM Trans. Audio Speech Lang. Process. 25(1): 149-163 (2017) - [j31]Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan, Jesper Jensen:
Informed Sound Source Localization Using Relative Transfer Functions for Hearing Aid Applications. IEEE ACM Trans. Audio Speech Lang. Process. 25(3): 611-623 (2017) - [j30]Morten Kolbaek, Dong Yu, Zheng-Hua Tan, Jesper Jensen:
Multitalker Speech Separation With Utterance-Level Permutation Invariant Training of Deep Recurrent Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1901-1913 (2017) - [j29]Swati Prasad, Zheng-Hua Tan, Ramjee Prasad:
Frame Selection for Robust Speaker Identification: A Hybrid Approach. Wirel. Pers. Commun. 97(1): 933-950 (2017) - [j28]Stefanos Astaras, Aristodemos Pnevmatikakis, Zheng-Hua Tan:
Visual Detection of Events of Interest from Urban Activity. Wirel. Pers. Commun. 97(2): 1877-1888 (2017) - [c77]Dong Yu, Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen:
Permutation invariant training of deep models for speaker-independent multi-talker speech separation. ICASSP 2017: 241-245 - [c76]Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
A non-intrusive Short-Time Objective Intelligibility measure. ICASSP 2017: 5085-5089 - [c75]Tomi Kinnunen, Md. Sahidullah, Mauro Falcone, Luca Costantini, Rosa González Hautamäki, Dennis Alexander Lehmann Thomsen, Achintya Kumar Sarkar, Zheng-Hua Tan, Héctor Delgado, Massimiliano Todisco, Nicholas W. D. Evans, Ville Hautamäki, Kong-Aik Lee:
RedDots replayed: A new replay spoofing attack corpus for text-dependent speaker verification research. ICASSP 2017: 5395-5399 - [c74]Xiaodong Duan, Nicolai Bæk Thomsen, Zheng-Hua Tan, Børge Lindberg, Søren Holdt Jensen:
Weighted Score Based Fast Converging CO-training with Application to Audio-Visual Person Identification. ICTAI 2017: 610-617 - [c73]Kong-Aik Lee, Ville Hautamäki, Tomi Kinnunen, Anthony Larcher, Chunlei Zhang, Andreas Nautsch, Themos Stafylakis, Gang Liu, Mickaël Rouvier, Wei Rao, Federico Alegre, J. Ma, Man-Wai Mak, Achintya Kumar Sarkar, Héctor Delgado, Rahim Saeidi, Hagai Aronowitz, Aleksandr Sizov, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Bin Ma, Ville Vestman, Md. Sahidullah, M. Halonen, Anssi Kanervisto, Gaël Le Lan, Fahimeh Bahmaninezhad, Sergey Isadskiy, Christian Rathgeb, Christoph Busch, Georgios Tzimiropoulos, Q. Qian, Z. Wang, Q. Zhao, T. Wang, H. Li, J. Xue, S. Zhu, R. Jin, T. Zhao, Pierre-Michel Bousquet, Moez Ajili, Waad Ben Kheder, Driss Matrouf, Zhi Hao Lim, Chenglin Xu, Haihua Xu, Xiong Xiao, Eng Siong Chng, Benoit G. B. Fauve, Kaavya Sriskandaraja, Vidhyasaharan Sethu, W. W. Lin, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, Massimiliano Todisco, Nicholas W. D. Evans, Haizhou Li, John H. L. Hansen, Jean-François Bonastre, Eliathamby Ambikairajah:
The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016. INTERSPEECH 2017: 1328-1332 - [c72]Hong Yu, Zheng-Hua Tan, Zhanyu Ma, Jun Guo:
Adversarial Network Bottleneck Features for Noise Robust Speaker Verification. INTERSPEECH 2017: 1492-1496 - [c71]Daniel Michelsanti, Zheng-Hua Tan:
Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification. INTERSPEECH 2017: 2008-2012 - [c70]Achintya Kumar Sarkar, Md. Sahidullah, Zheng-Hua Tan, Tomi Kinnunen:
Improving Speaker Verification Performance in Presence of Spoofing Attacks Using Out-of-Domain Spoofed Data. INTERSPEECH 2017: 2611-2615 - [c69]Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
On the Use of Band Importance Weighting in the Short-Time Objective Intelligibility Measure. INTERSPEECH 2017: 2963-2967 - [c68]Morten Kolbaek, Dong Yu, Zheng-Hua Tan, Jesper Jensen:
Joint separation and denoising of noisy multi-talker speech using recurrent neural networks and permutation invariant training. MLSP 2017: 1-6 - [i9]Hong Yu, Zheng-Hua Tan, Zhanyu Ma, Jun Guo:
DNN Filter Bank Cepstral Coefficients for Spoofing Detection. CoRR abs/1702.03791 (2017) - [i8]Morten Kolbæk, Dong Yu, Zheng-Hua Tan, Jesper Jensen:
Multi-talker Speech Separation and Tracing with Permutation Invariant Training of Deep Recurrent Neural Networks. CoRR abs/1703.06284 (2017) - [i7]Achintya Kumar Sarkar, Zheng-Hua Tan:
Time-Contrastive Learning Based Unsupervised DNN Feature Extraction for Speaker Verification. CoRR abs/1704.02373 (2017) - [i6]Zhanyu Ma, Jing-Hao Xue, Arne Leijon, Zheng-Hua Tan, Zhen Yang, Jun Guo:
Decorrelation of Neutral Vector Variables: Theory and Applications. CoRR abs/1705.10524 (2017) - [i5]Hong Yu, Zheng-Hua Tan, Zhanyu Ma, Jun Guo:
Adversarial Network Bottleneck Features for Noise Robust Speaker Verification. CoRR abs/1706.03397 (2017) - [i4]Morten Kolbæk, Dong Yu, Zheng-Hua Tan, Jesper Jensen:
Joint Separation and Denoising of Noisy Multi-talker Speech using Recurrent Neural Networks and Permutation Invariant Training. CoRR abs/1708.09588 (2017) - [i3]Daniel Michelsanti, Zheng-Hua Tan:
Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification. CoRR abs/1709.01703 (2017) - 2016
- [j27]Zhanyu Ma, Hong Yu, Zheng-Hua Tan, Jun Guo:
Text-Independent Speaker Identification Using the Histogram Transform Model. IEEE Access 4: 9733-9739 (2016) - [j26]Zhanyu Ma, Zheng-Hua Tan, Jun Guo:
Feature selection for neutral vector in EEG signal classification. Neurocomputing 174: 937-945 (2016) - [j25]Elizabeth Ann Jochum, Evgenios Vlachos, Anja Christoffersen, Sally Grindsted Nielsen, Ibrahim A. Hameed, Zheng-Hua Tan:
Using Theatre to Study Interaction with Care Robots. Int. J. Soc. Robotics 8(4): 457-470 (2016) - [j24]Ioannis T. Christou, Emmanouil Amolochitis, Zheng-Hua Tan:
AMORE: design and implementation of a commercial-strength parallel hybrid movie recommendation engine. Knowl. Inf. Syst. 47(3): 671-696 (2016) - [j23]Sven Ewan Shepstone, Kong-Aik Lee, Haizhou Li, Zheng-Hua Tan, Søren Holdt Jensen:
Total Variability Modeling Using Source-Specific Priors. IEEE ACM Trans. Audio Speech Lang. Process. 24(3): 504-517 (2016) - [j22]Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
Predicting the Intelligibility of Noisy and Nonlinearly Processed Binaural Speech. IEEE ACM Trans. Audio Speech Lang. Process. 24(11): 1908-1920 (2016) - [j21]Nikolaos Katsarakis, Aristodemos Pnevmatikakis, Zheng-Hua Tan, Ramjee Prasad:
Improved Gaussian Mixture Models for Adaptive Foreground Segmentation. Wirel. Pers. Commun. 87(3): 629-643 (2016) - [c67]Mojtaba Farmani, Richard Heusdens, Michael Syskind Pedersen, Zheng-Hua Tan, Jesper Jensen:
Concurrent localization of sound sources and dual-microphone sub-arrays using TOFs. FUSION 2016: 1931-1936 - [c66]Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan, Jesper Jensen:
Informed Direction of Arrival estimation using a spherical-head model for Hearing Aid applications. ICASSP 2016: 360-364 - [c65]Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
A method for predicting the intelligibility of noisy and non-linearly enhanced binaural speech. ICASSP 2016: 4995-4999 - [c64]Hengwei Lin, Josep M. Guerrero, Chenxi Jia, Zheng-Hua Tan, Juan C. Vasquez, Chengxi Liu:
Adaptive overcurrent protection for microgrids in extensive distribution systems. IECON 2016: 4042-4047 - [c63]Achintya Kumar Sarkar, Zheng-Hua Tan:
Text Dependent Speaker Verification Using Un-Supervised HMM-UBM and Temporal GMM-UBM. INTERSPEECH 2016: 425-429 - [c62]Tomi Kinnunen, Md. Sahidullah, Ivan Kukanov, Héctor Delgado, Massimiliano Todisco, Achintya Kumar Sarkar, Nicolai Bæk Thomsen, Ville Hautamäki, Nicholas W. D. Evans, Zheng-Hua Tan:
Utterance Verification for Text-Dependent Speaker Recognition: A Comparative Assessment Using the RedDots Corpus. INTERSPEECH 2016: 430-434 - [c61]Md. Sahidullah, Héctor Delgado, Massimiliano Todisco, Hong Yu, Tomi Kinnunen, Nicholas W. D. Evans, Zheng-Hua Tan:
Integrated Spoofing Countermeasures and Automatic Speaker Verification: An Evaluation on ASVspoof 2015. INTERSPEECH 2016: 1700-1704 - [c60]Md. Sahidullah, Rosa González Hautamäki, Dennis Alexander Lehmann Thomsen, Tomi Kinnunen, Zheng-Hua Tan, Ville Hautamäki, Robert Parts, Martti Pitkänen:
Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech. INTERSPEECH 2016: 1720-1724 - [c59]Nicolai Bæk Thomsen, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, Børge Lindberg, Søren Holdt Jensen:
Speaker-Dependent Dictionary-Based Speech Enhancement for Text-Dependent Speaker Verification. INTERSPEECH 2016: 1839-1843 - [c58]Tomi Kinnunen, Alexey Sholokhov, Elie Khoury, Dennis Alexander Lehmann Thomsen, Md. Sahidullah, Zheng-Hua Tan:
HAPPY Team Entry to NIST OpenSAD Challenge: A Fusion of Short-Term Unsupervised and Segment i-Vector Based Speech Activity Detectors. INTERSPEECH 2016: 2992-2996 - [c57]Zongji Sun, Li Meng, Aladdin M. Ariyaeeinia, Xiaodong Duan, Zheng-Hua Tan:
Privacy protection performance of De-identified face images with and without background. MIPRO 2016: 1354-1359 - [c56]Jen-Tzung Chien, Chao-Hsi Lee, Zheng-Hua Tan:
Dirichlet mixture allocation. MLSP 2016: 1-6 - [c55]Héctor Delgado, Massimiliano Todisco, Md. Sahidullah, Achintya Kumar Sarkar, Nicholas W. D. Evans, Tomi Kinnunen, Zheng-Hua Tan:
Further optimisations of constant Q cepstral processing for integrated utterance and text-dependent speaker verification. SLT 2016: 179-185 - [c54]Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen:
Speech enhancement using Long Short-Term Memory based recurrent Neural Networks for noise robust Speaker Verification. SLT 2016: 305-311 - [c53]Mohamed Abou-Zleikha, Mads Græsbøll Christensen, Zheng-Hua Tan, Søren Holdt Jensen:
Projecting emotional speech into arousal-valence space using pairwise preference learning. SPLINE 2016: 1-5 - [c52]Stefanos Astaras, Aristodemos Pnevmatikakis, Zheng-Hua Tan:
Background subtraction for patterns of activities in cities. SPLINE 2016: 1-5 - [c51]Nicolai Bæk Thomsen, Xiaodong Duan, Zheng-Hua Tan, Børge Lindberg, Søren Holdt Jensen:
Improving the convergence of co-training for audio-visual person identification. SPLINE 2016: 1-5 - [c50]Hong Yu, Achintya Kumar Sarkar, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, Zhanyu Ma, Jun Guo:
Effect of multi-condition training and speech enhancement methods on spoofing detection. SPLINE 2016: 1-5 - [i2]Dong Yu, Morten Kolbæk, Zheng-Hua Tan, Jesper Jensen:
Permutation Invariant Training of Deep Models for Speaker-Independent Multi-talker Speech Separation. CoRR abs/1607.00325 (2016) - [i1]Achintya Kumar Sarkar, Zheng-Hua Tan:
Incorporating Pass-Phrase Dependent Background Models for Text Dependent Speaker Verification. CoRR abs/1611.06423 (2016) - 2015
- [j20]Yonggang Qi, Jun Guo, Yi-Zhe Song, Tao Xiang, Honggang Zhang, Zheng-Hua Tan:
Im2Sketch: Sketch generation by unconflicted perceptual grouping. Neurocomputing 165: 338-349 (2015) - [j19]Jesper Jensen, Zheng-Hua Tan:
Minimum Mean-Square Error Estimation of Mel-Frequency Cepstral Features-A Theoretically Consistent Approach. IEEE ACM Trans. Audio Speech Lang. Process. 23(1): 186-197 (2015) - [c49]Mohamed Abou-Zleikha, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen:
A discriminative approach for speaker selection in speaker de-identification systems. EUSIPCO 2015: 2102-2106 - [c48]Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan, Jesper Jensen:
Informed TDoA-based direction of arrival estimation for hearing aid applications. GlobalSIP 2015: 953-957 - [c47]Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan, Jesper Jensen:
Maximum likelihood approach to "informed" Sound Source Localization for Hearing Aid applications. ICASSP 2015: 16-20 - [c46]Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan, Jesper Jensen:
On the influence of microphone array geometry on HRTF-based Sound Source Localization. ICASSP 2015: 439-443 - [c45]Sven Ewan Shepstone, Kong-Aik Lee, Haizhou Li, Zheng-Hua Tan, Søren Holdt Jensen:
Source-specific informative prior for i-vector extraction. ICASSP 2015: 4185-4189 - [c44]Xiaodong Duan, Zheng-Hua Tan:
A feature subtraction method for image based kinship verification under uncontrolled environments. ICIP 2015: 1573-1577 - [c43]Xiaodong Duan, Zheng-Hua Tan:
Local feature learning for face recognition under varying poses. ICIP 2015: 2905-2909 - [c42]Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen:
A binaural short time objective intelligibility measure for noisy and enhanced speech. INTERSPEECH 2015: 2563-2567 - [c41]Ivan Kraljevski, Zheng-Hua Tan, Maria Paola Bissiri:
Comparison of forced-alignment speech recognition and humans for generating reference VAD. INTERSPEECH 2015: 2937-2941 - [c40]Nicolai Bæk Thomsen, Zheng-Hua Tan, Børge Lindberg, Søren Holdt Jensen:
A heuristic approach for a social robot to navigate to a person based on audio and range information. IROS 2015: 5884-5890 - [c39]Xiaodong Duan, Zheng-Hua Tan:
Neighbors Based Discriminative Feature Difference Learning for Kinship Verification. ISVC (2) 2015: 258-267 - [c38]Clara Schaarup, Gunnar Hartvigsen, Lars Bo Larsen, Zheng-Hua Tan, Eirik Årsand, Ole Kristian Hejlesen:
Assessing the Potential Use of Eye-Tracking Triangulation for Evaluating the Usability of an Online Diabetes Exercise System. MedInfo 2015: 84-88 - [c37]Rasmus Lyngby Kristensen, Zheng-Hua Tan, Zhanyu Ma, Jun Guo:
Binary pattern flavored feature extractors for Facial Expression Recognition: An overview. MIPRO 2015: 1131-1137 - 2014
- [j18]Zheng-Hua Tan, Ivan Kraljevski:
Joint variable frame rate and length analysis for speech recognition under adverse conditions. Comput. Electr. Eng. 40(7): 2139-2149 (2014) - [j17]Emmanouil Amolochitis, Ioannis T. Christou, Zheng-Hua Tan:
Implementing a Commercial-Strength Parallel Hybrid Movie Recommendation Engine. IEEE Intell. Syst. 29(2): 92-96 (2014) - [j16]Sven Ewan Shepstone, Zheng-Hua Tan, Søren Holdt Jensen:
Using Audio-Derived Affective Offset to Enhance TV Recommendation. IEEE Trans. Multim. 16(7): 1999-2010 (2014) - [j15]Zhanyu Ma, Arne Leijon, Zheng-Hua Tan, Sheng Gao:
Predictive Distribution of the Dirichlet Mixture Model by Local Variational Inference. J. Signal Process. Syst. 74(3): 359-374 (2014) - [j14]Nikos Katsarakis, Aristodemos Pnevmatikakis, Zheng-Hua Tan, Ramjee Prasad:
Combination of Multiple Measurement Cues for Visual Face Tracking. Wirel. Pers. Commun. 78(3): 1789-1810 (2014) - [c36]Mohamed Abou-Zleikha, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen:
Cluster-based adaptation using density forest for HMM phone recognition. EUSIPCO 2014: 2065-2069 - [c35]Mohamed Abou-Zleikha, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen:
Utilising Tree-Based Ensemble Learning for Speaker Segmentation. AIAI 2014: 50-59 - [c34]Nicolai Bæk Thomsen, Zheng-Hua Tan, Børge Lindberg, Søren Holdt Jensen:
Improving Robustness Against Environmental Sounds for Directing Attention of Social Robots. MA3HMI@INTERSPEECH 2014: 25-34 - 2013
- [j13]Emmanouil Amolochitis, Ioannis T. Christou, Zheng-Hua Tan, Ramjee Prasad:
A heuristic hierarchical scheme for academic search and retrieval. Inf. Process. Manag. 49(6): 1326-1343 (2013) - [j12]Sven Ewan Shepstone, Zheng-Hua Tan, Søren Holdt Jensen:
Audio-based age and gender identification to enhance the recommendation of TV content. IEEE Trans. Consumer Electron. 59(3): 721-729 (2013) - [c33]Oldrich Plchot, Spyros Matsoukas, Pavel Matejka, Najim Dehak, Jeff Z. Ma, Sandro Cumani, Ondrej Glembek, Hynek Hermansky, Sri Harish Reddy Mallidi, Nima Mesgarani, Richard M. Schwartz, Mehdi Soufifar, Zheng-Hua Tan, Samuel Thomas, Bing Zhang, Xinhui Zhou:
Developing a speaker identification system for the DARPA RATS project. ICASSP 2013: 6768-6772 - [c32]Sven Ewan Shepstone, Zheng-Hua Tan, Søren Holdt Jensen:
Demographic recommendation by means of group profile elicitation using speaker age and gender recognition. INTERSPEECH 2013: 2827-2831 - [c31]Morten Højfeldt Rasmussen, Zheng-Hua Tan:
Fusing eye-gaze and speech recognition for tracking in an automatic reading tutor - a step in the right direction? SLaTE 2013: 112-115 - [c30]Yonggang Qi, Jun Guo, Yi Li, Honggang Zhang, Tao Xiang, Yi-Zhe Song, Zheng-Hua Tan:
Perceptual grouping via untangling Gestalt principles. VCIP 2013: 1-6 - [c29]Swati Prasad, Zheng-Hua Tan, Ramjee Prasad:
Multi-frame rate based multiple-model training for robust speaker identification of disguised voice. WPMC 2013: 1-4 - 2012
- [j11]Weichuan Yu, Zheng-Hua Tan, Yi Wan:
Guest Editors' Introduction to the Special Issue on "New Trends in Signal Processing and Biomedical Engineering". Comput. Electr. Eng. 38(1): 1-2 (2012) - [j10]Pejman Mowlaee, Rahim Saeidi, Mads Græsbøll Christensen, Zheng-Hua Tan, Tomi Kinnunen, Pasi Fränti, Søren Holdt Jensen:
A Joint Approach for Single-Channel Speaker Identification and Speech Separation. IEEE Trans. Speech Audio Process. 20(9): 2586-2601 (2012) - [c28]Emmanouil Amolochitis, Ioannis T. Christou, Zheng-Hua Tan:
PubSearch - A Hierarchical Heuristic Scheme for Ranking Academic Search Results. ICPRAM (2) 2012: 509-514 - [c27]Zhanyu Ma, Zheng-Hua Tan, Swati Prasad:
EEG signal classification with super-Dirichlet mixture model. SSP 2012: 440-443 - 2011
- [j9]Hristijan Petreski, Sofia Tsekeridou, Eri Giannaka, Neeli Rashmi Prasad, Ramjee Prasad, Zheng-Hua Tan:
Technology-enabled social learning: a review. Int. J. Knowl. Learn. 7(3/4): 253-270 (2011) - [j8]Theodore Petsatodis, Christos Boukis, Fotios Talantzis, Zheng-Hua Tan, Ramjee Prasad:
Convex Combination of Multiple Statistical Models With Application to VAD. IEEE Trans. Speech Audio Process. 19(8): 2314-2327 (2011) - [c26]Pejman Mowlaee, Rahim Saeidi, Zheng-Hua Tan, Mads Græsbøll Christensen, Tomi Kinnunen, Pasi Fränti, Søren Holdt Jensen:
Sinusoidal Approach for the Single-Channel Speech Separation and Recognition Challenge. INTERSPEECH 2011: 677-680 - [c25]Theodore Petsatodis, Fotios Talantzis, Christos Boukis, Zheng-Hua Tan, Ramjee Prasad:
Multi-Sensor Voice Activity Detection Based on Multiple Observation Hypothesis Testing. INTERSPEECH 2011: 2633-2636 - [c24]Menelaos Bakopoulos, Sofia Tsekeridou, Eri Giannaka, Zheng-Hua Tan, Ramjee Prasad:
Mobile video annotation for enhanced rich media communication during emergency handling. ISABEL 2011: 32:1-32:5 - [c23]Menelaos Bakopoulos, Sofia Tsekeridou, Eri Giannaka, Zheng-Hua Tan, Ramjee Prasad:
Command & control: Information merging, selective visualization and decision support for emergency handling. ISCRAM 2011 - [c22]Morten Højfeldt Rasmussen, Jack Mostow, Zheng-Hua Tan, Børge Lindberg, Yuanpeng Li:
Evaluating tracking accuracy of an automatic reading tutor. SLaTE 2011: 17-20 - [c21]Morten Højfeldt Rasmussen, Børge Lindberg, Zheng-Hua Tan:
Combining acoustic and language model miscue detection methods for adult dyslexic read speech. SLaTE 2011: 21-24 - [c20]Swati Prasad, Zheng-Hua Tan, Ramjee Prasad, Alvaro Fuentes Cabrera, Ying Gu, Kim Dremstrup:
Feature selection strategy for classification of single-trial EEG elicited by motor imagery. WPMC 2011: 1-4 - 2010
- [j7]Zheng-Hua Tan, Reinhold Haeb-Umbach, Sadaoki Furui, James R. Glass, Maurizio Omologo:
Introduction to the Issue on Speech Processing for Natural Interaction With Intelligent Environments. IEEE J. Sel. Top. Signal Process. 4(5): 769-771 (2010) - [j6]Zheng-Hua Tan, Børge Lindberg:
Low-Complexity Variable Frame Rate Analysis for Speech Recognition and Voice Activity Detection. IEEE J. Sel. Top. Signal Process. 4(5): 798-807 (2010) - [c19]Francesco Santoro, Sergio Pedro, Zheng-Hua Tan, Thomas B. Moeslund:
Crowd analysis by using optical flow and density based clustering. EUSIPCO 2010: 269-273 - [c18]Martina Andersen, Rasmus S. Andersen, Nikos Katsarakis, Aristodemos Pnevmatikakis, Zheng-Hua Tan:
Three-dimensional adaptive sensing of people in a multi-camera setup. EUSIPCO 2010: 964-968 - [c17]Pejman Mowlaee, Rahim Saeidi, Zheng-Hua Tan, Mads Græsbøll Christensen, Pasi Fränti, Søren Holdt Jensen:
Joint single-channel speech separation and speaker identification. ICASSP 2010: 4430-4433 - [c16]Rahim Saeidi, Pejman Mowlaee, Tomi Kinnunen, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen, Pasi Fränti:
Signal-to-Signal Ratio Independent Speaker Identification for Co-channel Speech Signals. ICPR 2010: 4565-4568 - [c15]Rahim Saeidi, Pejman Mowlaee, Tomi Kinnunen, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen, Pasi Fränti:
Improving monaural speaker identification by double-talk detection. INTERSPEECH 2010: 1069-1072
2000 – 2009
- 2009
- [c14]Morten Højfeldt Rasmussen, Zheng-Hua Tan, Børge Lindberg, Søren Holdt Jensen:
A system for detecting miscues in dyslexic read speech. INTERSPEECH 2009: 1467-1470 - [c13]Zheng-Hua Tan, Børge Lindberg:
High-accuracy, low-complexity voice activity detection based on a posteriori SNR weighted energy. INTERSPEECH 2009: 2231-2234 - [r1]Zheng-Hua Tan:
Audio and Speech Processing for Data Mining. Encyclopedia of Data Warehousing and Mining 2009: 98-103 - 2008
- [j5]Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Robust Speech Recognition by Nonlocal Means Denoising Processing. IEEE Signal Process. Lett. 15: 701-704 (2008) - [c12]Zheng-Hua Tan, Børge Lindberg:
Speech Recognition on Mobile Devices. WMMP 2008: 221-237 - [c11]Zheng-Hua Tan, Børge Lindberg:
A posteriori SNR weighted energy based variable frame rate analysis for speech recognition. INTERSPEECH 2008: 1024-1027 - 2007
- [j4]Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Exploiting Temporal Correlation of Speech for Error Robust and Bandwidth Flexible Distributed Speech Recognition. IEEE Trans. Speech Audio Process. 15(4): 1391-1403 (2007) - [j3]Haitian Xu, Paul Dalsgaard, Zheng-Hua Tan, Børge Lindberg:
Noise Condition-Dependent Training Based on Noise Classification and SNR Estimation. IEEE Trans. Speech Audio Process. 15(8): 2431-2443 (2007) - 2006
- [j2]Zheng-Hua Tan:
Fuzzy Metagraph and Its Combination with the Indexing Approach in Rule-Based Systems. IEEE Trans. Knowl. Data Eng. 18(6): 829-841 (2006) - [c10]Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Robust Speech Recognition From Noise-Type Based Feature Compensation and Model Interpolation in a Multiple Model Framework. ICASSP (1) 2006: 1141-1144 - [c9]Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Robust speech recognition over mobile networks using combined weighted viterbi decoding and subvector based error concealment. INTERSPEECH 2006 - 2005
- [j1]Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Automatic speech recognition over error-prone wireless networks. Speech Commun. 47(1-2): 220-242 (2005) - [c8]Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Robust speech recognition based on noise and SNR classification - a multiple-model framework. INTERSPEECH 2005: 977-980 - [c7]Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg, Haitian Xu:
Robust speech recognition in ubiquitous networking and context-aware computing. INTERSPEECH 2005: 2849-2852 - [c6]Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Adaptive Multi-Frame-Rate Scheme for Distributed Speech Recognition Based on a Half Frame-Rate Front-End. MMSP 2005: 1-4 - 2004
- [c5]Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
A subvector-based error concealment algorithm for speech recognition over mobile networks. ICASSP (1) 2004: 57-60 - [c4]Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
Spectral subtraction with full-wave rectification and likelihood controlled instantaneous noise estimation for robust speech recognition. INTERSPEECH 2004: 2085-2088 - [c3]Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
On the integration of speech recognition into personal networks. INTERSPEECH 2004: 2317-2320 - 2003
- [c2]Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
OOV-detection and channel error protection for distributed speech recognition over wireless networks. ICASSP (1) 2003: 336-339 - 2002
- [c1]Zheng-Hua Tan, Paul Dalsgaard:
Channel error protection scheme for distributed speech recognition. INTERSPEECH 2002: 2225-2228 - 2001
- [e1]Paul Dalsgaard, Børge Lindberg, Henrik Benner, Zheng-Hua Tan:
EUROSPEECH 2001 Scandinavia, 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, Aalborg, Denmark, September 3-7, 2001. ISCA 2001 [contents]
Coauthor Index
aka: Morten Kolbaek
aka: Miklas Strøm Kristoffersen
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-14 22:05 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint