default search action
Hank Liao
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c22]Oscar Chang, Hank Liao, Dmitriy Serdyuk, Ankit Shahy, Olivier Siohan:
Conformer is All You Need for Visual Speech Recognition. ICASSP 2024: 10136-10140 - [c21]Guanlong Zhao, Yongqiang Wang, Jason Pelecanos, Yu Zhang, Hank Liao, Yiling Huang, Han Lu, Quan Wang:
USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models. ICASSP 2024: 11801-11805 - [i13]Quan Wang, Yiling Huang, Guanlong Zhao, Evan Clark, Wei Xia, Hank Liao:
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models. CoRR abs/2401.03506 (2024) - 2023
- [i12]Oscar Chang, Hank Liao, Dmitriy Serdyuk, Ankit Parag Shah, Olivier Siohan:
Conformers are All You Need for Visual Speech Recogntion. CoRR abs/2302.10915 (2023) - [i11]Guanlong Zhao, Yongqiang Wang, Jason Pelecanos, Yu Zhang, Hank Liao, Yiling Huang, Han Lu, Quan Wang:
USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models. CoRR abs/2309.08023 (2023) - [i10]Yiling Huang, Weiran Wang, Guanlong Zhao, Hank Liao, Wei Xia, Quan Wang:
Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network. CoRR abs/2309.08489 (2023) - [i9]Oscar Chang, Otavio Braga, Hank Liao, Dmitriy Serdyuk, Olivier Siohan:
On Robustness to Missing Video for Audiovisual Speech Recognition. CoRR abs/2312.10088 (2023) - 2022
- [j2]Oscar Chang, Otavio Braga, Hank Liao, Dmitriy Serdyuk, Olivier Siohan:
On Robustness to Missing Video for Audiovisual Speech Recognition. Trans. Mach. Learn. Res. 2022 (2022) - [i8]Otavio Braga, Takaki Makino, Olivier Siohan, Hank Liao:
End-to-End Multi-Person Audio/Visual Automatic Speech Recognition. CoRR abs/2205.05586 (2022) - 2020
- [c20]Otavio Braga, Takaki Makino, Olivier Siohan, Hank Liao:
End-to-End Multi-Person Audio/Visual Automatic Speech Recognition. ICASSP 2020: 6994-6998
2010 – 2019
- 2019
- [c19]Chung-Cheng Chiu, Anjuli Kannan, Rohit Prabhavalkar, Zhifeng Chen, Tara N. Sainath, Yonghui Wu, Wei Han, Yu Zhang, Ruoming Pang, Sergey Kishchenko, Patrick Nguyen, Arun Narayanan, Hank Liao, Shuyuan Zhang:
A Comparison of End-to-End Models for Long-Form Speech Recognition. ASRU 2019: 889-896 - [c18]Takaki Makino, Hank Liao, Yannis M. Assael, Brendan Shillingford, Basilio Garcia, Otavio Braga, Olivier Siohan:
Recurrent Neural Network Transducer for Audio-Visual Speech Recognition. ASRU 2019: 905-912 - [c17]Brendan Shillingford, Yannis M. Assael, Matthew W. Hoffman, Thomas Paine, Cían Hughes, Utsav Prabhu, Hank Liao, Hasim Sak, Kanishka Rao, Lorrayne Bennett, Marie Mulville, Misha Denil, Ben Coppin, Ben Laurie, Andrew W. Senior, Nando de Freitas:
Large-Scale Visual Speech Recognition. INTERSPEECH 2019: 4135-4139 - [i7]Antonios Anastasopoulos, Shankar Kumar, Hank Liao:
Neural Language Modeling with Visual Features. CoRR abs/1903.02930 (2019) - [i6]Ke Hu, Hasim Sak, Hank Liao:
Adversarial Training for Multilingual Acoustic Modeling. CoRR abs/1906.07093 (2019) - [i5]Chung-Cheng Chiu, Wei Han, Yu Zhang, Ruoming Pang, Sergey Kishchenko, Patrick Nguyen, Arun Narayanan, Hank Liao, Shuyuan Zhang, Anjuli Kannan, Rohit Prabhavalkar, Zhifeng Chen, Tara N. Sainath, Yonghui Wu:
A comparison of end-to-end models for long-form speech recognition. CoRR abs/1911.02242 (2019) - [i4]Takaki Makino, Hank Liao, Yannis M. Assael, Brendan Shillingford, Basilio Garcia, Otavio Braga, Olivier Siohan:
Recurrent Neural Network Transducer for Audio-Visual Speech Recognition. CoRR abs/1911.04890 (2019) - 2018
- [c16]Kazuki Irie, Shankar Kumar, Michael Nirschl, Hank Liao:
RADMM: Recurrent Adaptive Mixture Model with Applications to Domain Robust Language Modeling. ICASSP 2018: 6079-6083 - [i3]Brendan Shillingford, Yannis M. Assael, Matthew W. Hoffman, Thomas Paine, Cían Hughes, Utsav Prabhu, Hank Liao, Hasim Sak, Kanishka Rao, Lorrayne Bennett, Marie Mulville, Ben Coppin, Ben Laurie, Andrew W. Senior, Nando de Freitas:
Large-Scale Visual Speech Recognition. CoRR abs/1807.05162 (2018) - 2017
- [c15]Hagen Soltau, Hank Liao, Hasim Sak:
Reducing the computational complexity for whole word models. ASRU 2017: 63-68 - [c14]Shankar Kumar, Michael Nirschl, Daniel Niels Holtmann-Rice, Hank Liao, Ananda Theertha Suresh, Felix X. Yu:
Lattice rescoring strategies for long short term memory language models in speech recognition. ASRU 2017: 165-172 - [c13]Hagen Soltau, Hank Liao, Hasim Sak:
Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition. INTERSPEECH 2017: 3707-3711 - [i2]Shankar Kumar, Michael Nirschl, Daniel N. Holtmann-Rice, Hank Liao, Ananda Theertha Suresh, Felix X. Yu:
Lattice Rescoring Strategies for Long Short Term Memory Language Models in Speech Recognition. CoRR abs/1711.05448 (2017) - 2016
- [c12]Vitaly Kuznetsov, Hank Liao, Mehryar Mohri, Michael Riley, Brian Roark:
Learning N-Gram Language Models from Uncertain Data. INTERSPEECH 2016: 2323-2327 - [i1]Hagen Soltau, Hank Liao, Hasim Sak:
Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition. CoRR abs/1610.09975 (2016) - 2015
- [c11]Yanbo Xu, Olivier Siohan, David Simcha, Sanjiv Kumar, Hank Liao:
Exemplar-based large vocabulary speech recognition using k-nearest neighbors. ICASSP 2015: 5167-5171 - [c10]Hank Liao, Golan Pundak, Olivier Siohan, Melissa K. Carroll, Noah Coccaro, Qi-Ming Jiang, Tara N. Sainath, Andrew W. Senior, Françoise Beaufays, Michiel Bacchiani:
Large vocabulary automatic speech recognition for children. INTERSPEECH 2015: 1611-1615 - 2014
- [c9]Andrew W. Senior, Georg Heigold, Michiel Bacchiani, Hank Liao:
GMM-free DNN acoustic model training. ICASSP 2014: 5602-5606 - 2013
- [c8]Hank Liao, Erik McDermott, Andrew W. Senior:
Large scale deep neural network acoustic modeling with semi-supervised training data for YouTube video transcription. ASRU 2013: 368-373 - [c7]Hank Liao:
Speaker adaptation of context dependent deep neural networks. ICASSP 2013: 7947-7951 - 2012
- [c6]Khe Chai Sim, Shengdong Zhao, Kai Yu, Hank Liao:
ICMI'12 grand challenge: haptic voice recognition. ICMI 2012: 363-370 - [p1]Hank Liao:
Uncertainty Decoding. Techniques for Noise Robustness in Automatic Speech Recognition 2012: 463-486 - 2010
- [c5]Hank Liao, Christopher Alberti, Michiel Bacchiani, Olivier Siohan:
Decision tree state clustering with word and syllable features. INTERSPEECH 2010: 2958-2961
2000 – 2009
- 2009
- [c4]Christopher Alberti, Michiel Bacchiani, Ari Bezman, Ciprian Chelba, Anastassia Drofa, Hank Liao, Pedro J. Moreno, Ted Power, Arnaud Sahuguet, Maria Shugrina, Olivier Siohan:
An audio indexing system for election video material. ICASSP 2009: 4873-4876 - 2008
- [j1]Hank Liao, Mark J. F. Gales:
Issues with uncertainty decoding for noise robust automatic speech recognition. Speech Commun. 50(4): 265-277 (2008) - 2007
- [c3]Hank Liao, Mark J. F. Gales:
Adaptive Training with Joint Uncertainty Decoding for Robust Recognition of Noisy Data. ICASSP (4) 2007: 389-392 - 2006
- [c2]Hank Liao, Mark J. F. Gales:
Issues with uncertainty decoding for noise robust speech recognition. INTERSPEECH 2006 - 2005
- [c1]Hank Liao, Mark J. F. Gales:
Joint uncertainty decoding for noise robust speech recognition. INTERSPEECH 2005: 3129-3132
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-08 20:18 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint