default search action

combined dblp search
author search
venue search
publication search

ask others

Hank Liao

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChangLSSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChangLSSS24
Oscar Chang, Hank Liao, Dmitriy Serdyuk, Ankit Shahy, Olivier Siohan:
Conformer is All You Need for Visual Speech Recognition. ICASSP 2024: 10136-10140
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoWPZLHLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoWPZLHLW24
Guanlong Zhao, Yongqiang Wang, Jason Pelecanos, Yu Zhang, Hank Liao, Yiling Huang, Han Lu, Quan Wang:
USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models. ICASSP 2024: 11801-11805
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-03506
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-03506
Quan Wang, Yiling Huang, Guanlong Zhao, Evan Clark, Wei Xia, Hank Liao:
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models. CoRR abs/2401.03506 (2024)
2023
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-10915
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-10915
Oscar Chang, Hank Liao, Dmitriy Serdyuk, Ankit Parag Shah, Olivier Siohan:
Conformers are All You Need for Visual Speech Recogntion. CoRR abs/2302.10915 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08023
Guanlong Zhao, Yongqiang Wang, Jason Pelecanos, Yu Zhang, Hank Liao, Yiling Huang, Han Lu, Quan Wang:
USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models. CoRR abs/2309.08023 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08489
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08489
Yiling Huang, Weiran Wang, Guanlong Zhao, Hank Liao, Wei Xia, Quan Wang:
Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network. CoRR abs/2309.08489 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-10088
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-10088
Oscar Chang, Otavio Braga, Hank Liao, Dmitriy Serdyuk, Olivier Siohan:
On Robustness to Missing Video for Audiovisual Speech Recognition. CoRR abs/2312.10088 (2023)
2022
[j2]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/ChangBLSS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/ChangBLSS22
Oscar Chang, Otavio Braga, Hank Liao, Dmitriy Serdyuk, Olivier Siohan:
On Robustness to Missing Video for Audiovisual Speech Recognition. Trans. Mach. Learn. Res. 2022 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-05586
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-05586
Otavio Braga, Takaki Makino, Olivier Siohan, Hank Liao:
End-to-End Multi-Person Audio/Visual Automatic Speech Recognition. CoRR abs/2205.05586 (2022)
2020
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BragaMSL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BragaMSL20
Otavio Braga, Takaki Makino, Olivier Siohan, Hank Liao:
End-to-End Multi-Person Audio/Visual Automatic Speech Recognition. ICASSP 2020: 6994-6998

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ChiuKPCSWHZPKNN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ChiuKPCSWHZPKNN19
Chung-Cheng Chiu, Anjuli Kannan, Rohit Prabhavalkar, Zhifeng Chen, Tara N. Sainath, Yonghui Wu, Wei Han, Yu Zhang, Ruoming Pang, Sergey Kishchenko, Patrick Nguyen, Arun Narayanan, Hank Liao, Shuyuan Zhang:
A Comparison of End-to-End Models for Long-Form Speech Recognition. ASRU 2019: 889-896
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MakinoLASGBS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MakinoLASGBS19
Takaki Makino, Hank Liao, Yannis M. Assael, Brendan Shillingford, Basilio Garcia, Otavio Braga, Olivier Siohan:
Recurrent Neural Network Transducer for Audio-Visual Speech Recognition. ASRU 2019: 905-912
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShillingfordAHP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShillingfordAHP19
Brendan Shillingford, Yannis M. Assael, Matthew W. Hoffman, Thomas Paine, Cían Hughes, Utsav Prabhu, Hank Liao, Hasim Sak, Kanishka Rao, Lorrayne Bennett, Marie Mulville, Misha Denil, Ben Coppin, Ben Laurie, Andrew W. Senior, Nando de Freitas:
Large-Scale Visual Speech Recognition. INTERSPEECH 2019: 4135-4139
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-02930
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-02930
Antonios Anastasopoulos, Shankar Kumar, Hank Liao:
Neural Language Modeling with Visual Features. CoRR abs/1903.02930 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-07093
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-07093
Ke Hu, Hasim Sak, Hank Liao:
Adversarial Training for Multilingual Acoustic Modeling. CoRR abs/1906.07093 (2019)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-02242
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-02242
Chung-Cheng Chiu, Wei Han, Yu Zhang, Ruoming Pang, Sergey Kishchenko, Patrick Nguyen, Arun Narayanan, Hank Liao, Shuyuan Zhang, Anjuli Kannan, Rohit Prabhavalkar, Zhifeng Chen, Tara N. Sainath, Yonghui Wu:
A comparison of end-to-end models for long-form speech recognition. CoRR abs/1911.02242 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-04890
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-04890
Takaki Makino, Hank Liao, Yannis M. Assael, Brendan Shillingford, Basilio Garcia, Otavio Braga, Olivier Siohan:
Recurrent Neural Network Transducer for Audio-Visual Speech Recognition. CoRR abs/1911.04890 (2019)
2018
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/IrieKNL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/IrieKNL18
Kazuki Irie, Shankar Kumar, Michael Nirschl, Hank Liao:
RADMM: Recurrent Adaptive Mixture Model with Applications to Domain Robust Language Modeling. ICASSP 2018: 6079-6083
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-05162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-05162
Brendan Shillingford, Yannis M. Assael, Matthew W. Hoffman, Thomas Paine, Cían Hughes, Utsav Prabhu, Hank Liao, Hasim Sak, Kanishka Rao, Lorrayne Bennett, Marie Mulville, Ben Coppin, Ben Laurie, Andrew W. Senior, Nando de Freitas:
Large-Scale Visual Speech Recognition. CoRR abs/1807.05162 (2018)
2017
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SoltauLS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SoltauLS17
Hagen Soltau, Hank Liao, Hasim Sak:
Reducing the computational complexity for whole word models. ASRU 2017: 63-68
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/KumarNHLSY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/KumarNHLSY17
Shankar Kumar, Michael Nirschl, Daniel Niels Holtmann-Rice, Hank Liao, Ananda Theertha Suresh, Felix X. Yu:
Lattice rescoring strategies for long short term memory language models in speech recognition. ASRU 2017: 165-172
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoltauLS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoltauLS17
Hagen Soltau, Hank Liao, Hasim Sak:
Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition. INTERSPEECH 2017: 3707-3711
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-05448
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-05448
Shankar Kumar, Michael Nirschl, Daniel N. Holtmann-Rice, Hank Liao, Ananda Theertha Suresh, Felix X. Yu:
Lattice Rescoring Strategies for Long Short Term Memory Language Models in Speech Recognition. CoRR abs/1711.05448 (2017)
2016
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuznetsovLMRR16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuznetsovLMRR16
Vitaly Kuznetsov, Hank Liao, Mehryar Mohri, Michael Riley, Brian Roark:
Learning N-Gram Language Models from Uncertain Data. INTERSPEECH 2016: 2323-2327
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SoltauLS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SoltauLS16
Hagen Soltau, Hank Liao, Hasim Sak:
Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition. CoRR abs/1610.09975 (2016)
2015
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuSSKL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuSSKL15
Yanbo Xu, Olivier Siohan, David Simcha, Sanjiv Kumar, Hank Liao:
Exemplar-based large vocabulary speech recognition using k-nearest neighbors. ICASSP 2015: 5167-5171
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiaoPSCCJSSBB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiaoPSCCJSSBB15
Hank Liao, Golan Pundak, Olivier Siohan, Melissa K. Carroll, Noah Coccaro, Qi-Ming Jiang, Tara N. Sainath, Andrew W. Senior, Françoise Beaufays, Michiel Bacchiani:
Large vocabulary automatic speech recognition for children. INTERSPEECH 2015: 1611-1615
2014
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SeniorHBL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SeniorHBL14
Andrew W. Senior, Georg Heigold, Michiel Bacchiani, Hank Liao:
GMM-free DNN acoustic model training. ICASSP 2014: 5602-5606
2013
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiaoMS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiaoMS13
Hank Liao, Erik McDermott, Andrew W. Senior:
Large scale deep neural network acoustic modeling with semi-supervised training data for YouTube video transcription. ASRU 2013: 368-373
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Liao13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Liao13
Hank Liao:
Speaker adaptation of context dependent deep neural networks. ICASSP 2013: 7947-7951
2012
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/SimZYL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/SimZYL12
Khe Chai Sim, Shengdong Zhao, Kai Yu, Hank Liao:
ICMI'12 grand challenge: haptic voice recognition. ICMI 2012: 363-370
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/wi/12/Liao12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/wi/12/Liao12
Hank Liao:
Uncertainty Decoding. Techniques for Noise Robustness in Automatic Speech Recognition 2012: 463-486
2010
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiaoABS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiaoABS10
Hank Liao, Christopher Alberti, Michiel Bacchiani, Olivier Siohan:
Decision tree state clustering with word and syllable features. INTERSPEECH 2010: 2958-2961

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AlbertiBBCDLMPSSS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AlbertiBBCDLMPSSS09
Christopher Alberti, Michiel Bacchiani, Ari Bezman, Ciprian Chelba, Anastassia Drofa, Hank Liao, Pedro J. Moreno, Ted Power, Arnaud Sahuguet, Maria Shugrina, Olivier Siohan:
An audio indexing system for election video material. ICASSP 2009: 4873-4876
2008
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/LiaoG08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/LiaoG08
Hank Liao, Mark J. F. Gales:
Issues with uncertainty decoding for noise robust automatic speech recognition. Speech Commun. 50(4): 265-277 (2008)
2007
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiaoG07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiaoG07
Hank Liao, Mark J. F. Gales:
Adaptive Training with Joint Uncertainty Decoding for Robust Recognition of Noisy Data. ICASSP (4) 2007: 389-392
2006
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiaoG06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiaoG06
Hank Liao, Mark J. F. Gales:
Issues with uncertainty decoding for noise robust speech recognition. INTERSPEECH 2006
2005
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiaoG05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiaoG05
Hank Liao, Mark J. F. Gales:
Joint uncertainty decoding for noise robust speech recognition. INTERSPEECH 2005: 3129-3132

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.