default search action

combined dblp search
author search
venue search
publication search

ask others

Thomas Hain

> Home > Persons

Person information

affiliation: University of Sheffield, England, UK

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c175]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/ParkCH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/ParkCH24
Chanho Park, Mingjie Chen, Thomas Hain:
Automatic Speech Recognition System-Independent Word Error Rate Estimation. LREC/COLING 2024: 1979-1987
[c174]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/eacl/MeghananiH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eacl/MeghananiH24
Amit Meghanani, Thomas Hain:
Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations. EACL (1) 2024: 1959-1967
[c173]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/IakovenkoH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/IakovenkoH24
Olga Iakovenko, Thomas Hain:
Methods of Automatic Matrix Language Determination for Code-Switched Speech. EMNLP 2024: 5791-5800
[c172]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/CloseHG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/CloseHG24
George Close, Thomas Hain, Stefan Goetze:
Hallucination in Perceptual Metric-Driven Speech Enhancement Networks. EUSIPCO 2024: 21-25
[c171]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/ParkKH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/ParkKH24
Chanho Park, Hyunsik Kang, Thomas Hain:
Character Error Rate Estimation for Automatic Speech Recognition of Short Utterances. EUSIPCO 2024: 131-135
[c170]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/DoIDH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/DoIDH24
Cong-Thanh Do, Shuhei Imai, Rama Doddipatla, Thomas Hain:
Improving Accented Speech Recognition Using Data Augmentation Based on Unsupervised Text-to-Speech Synthesis. EUSIPCO 2024: 136-140
[c169]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/SutherlandCHGB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/SutherlandCHGB24
Robert Sutherland, George Close, Thomas Hain, Stefan Goetze, Jon Barker:
Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement. EUSIPCO 2024: 421-425
[c168]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MogridgeCSHBGR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MogridgeCSHBGR24
Rhiannon Mogridge, George Close, Robert Sutherland, Thomas Hain, Jon Barker, Stefan Goetze, Anton Ragni:
Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users Using Intermediate ASR Features and Human Memory Models. ICASSP 2024: 306-310
[c167]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CloseRHG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CloseRHG24
George Close, William Ravenscroft, Thomas Hain, Stefan Goetze:
Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement. ICASSP 2024: 351-355
[c166]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AhmadFH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AhmadFH24
Rehan Ahmad, Muhammad Umar Farooq, Thomas Hain:
Progressive Unsupervised Domain Adaptation for ASR Using Ensemble Models and Multi-Stage Training. ICASSP 2024: 11466-11470
[c165]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RavenscroftGH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RavenscroftGH24
William Ravenscroft, Stefan Goetze, Thomas Hain:
Combining Conformer and Dual-Path-Transformer Networks for Single Channel Noisy Reverberant Speech Separation. ICASSP 2024: 11491-11495
[c164]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MeghananiH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MeghananiH24
Amit Meghanani, Thomas Hain:
SCORE: Self-Supervised Correspondence Fine-Tuning for Improved Content Representations. ICASSP 2024: 12086-12090
[c163]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/ChenZLLWM0LRWW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/ChenZLLWM0LRWW024
Mingjie Chen, Hezhao Zhang, Yuanchao Li, Jiachen Luo, Wen Wu, Ziyang Ma, Peter Bell, Catherine Lai, Joshua D. Reiss, Lin Wang, Philip C. Woodland, Xie Chen, Huy Phan, Thomas Hain:
1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem. Odyssey 2024: 260-265
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-13611
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-13611
Rhiannon Mogridge, George Close, Robert Sutherland, Thomas Hain, Jon Barker, Stefan Goetze, Anton Ragni:
Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users using Intermediate ASR Features and Human Memory Models. CoRR abs/2401.13611 (2024)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-06260
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-06260
Amit Meghanani, Thomas Hain:
SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations. CoRR abs/2403.06260 (2024)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-08738
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-08738
Amit Meghanani, Thomas Hain:
Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations. CoRR abs/2403.08738 (2024)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-11732
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-11732
George Close, Thomas Hain, Stefan Goetze:
Hallucination in Perceptual Metric-Driven Speech Enhancement Networks. CoRR abs/2403.11732 (2024)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-16743
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-16743
Chanho Park, Mingjie Chen, Thomas Hain:
Automatic Speech Recognition System-Independent Word Error Rate Estimation. CoRR abs/2404.16743 (2024)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-20064
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-20064
Mingjie Chen, Hezhao Zhang, Yuanchao Li, Jiachen Luo, Wen Wu, Ziyang Ma, Peter Bell, Catherine Lai, Joshua D. Reiss, Lin Wang, Philip C. Woodland, Xie Chen, Huy Phan, Thomas Hain:
1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem. CoRR abs/2405.20064 (2024)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07162
Ziyang Ma, Mingjie Chen, Hezhao Zhang, Zhisheng Zheng, Wenxi Chen, Xiquan Li, Jiaxin Ye, Xie Chen, Thomas Hain:
EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark. CoRR abs/2406.07162 (2024)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-08914
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-08914
William Ravenscroft, George Close, Stefan Goetze, Thomas Hain, Mohammad Soleymanpour, Anurag Chowdhury, Mark C. Fuhs:
Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition. CoRR abs/2406.08914 (2024)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-09153
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-09153
Amit Meghanani, Thomas Hain:
LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks. CoRR abs/2406.09153 (2024)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-04047
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-04047
Cong-Thanh Do, Shuhei Imai, Rama Doddipatla, Thomas Hain:
Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis. CoRR abs/2407.04047 (2024)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-13333
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-13333
Robert Sutherland, George Close, Thomas Hain, Stefan Goetze, Jon Barker:
Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement. CoRR abs/2407.13333 (2024)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-02521
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-02521
Olga Iakovenko, Thomas Hain:
Methods for Automatic Matrix Language Determination of Code-Switched Speech. CoRR abs/2410.02521 (2024)
2023
[c162]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/FarooqAH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/FarooqAH23
Muhammad Umar Farooq, Rehan Ahmad, Thomas Hain:
MUST: A Multilingual Student-Teacher Learning Approach for Low-Resource Speech Recognition. ASRU 2023: 1-6
[c161]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/IslamHS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/IslamHS23
Elaf Islam, Thomas Hain, Protima Nomo Sudro:
Simulation of Teacher-Learner Interaction in English Language Pronunciation Learning. ASRU 2023: 1-6
[c160]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MeghananiH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MeghananiH23
Amit Meghanani, Thomas Hain:
Deriving Translational Acoustic Sub-Word Embeddings. ASRU 2023: 1-8
[c159]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/RavenscroftGH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/RavenscroftGH23
William Ravenscroft, Stefan Goetze, Thomas Hain:
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments. ASRU 2023: 1-7
[c158]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/SudroRH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/SudroRH23
Protima Nomo Sudro, Anton Ragni, Thomas Hain:
Adapting Pretrained Models for Adult to Child Voice Conversion. EUSIPCO 2023: 271-275
[c157]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/RavenscroftGH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/RavenscroftGH23
William Ravenscroft, Stefan Goetze, Thomas Hain:
On Data Sampling Strategies for Training Neural Network Speech Separation Models. EUSIPCO 2023: 331-335
[c156]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/OllerenshawJH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/OllerenshawJH23
Anna Ollerenshaw, Md Asif Jalal, Thomas Hain:
Probing Statistical Representations for End-to-End ASR. EUSIPCO 2023: 401-405
[c155]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AhmadJFOH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AhmadJFOH23
Rehan Ahmad, Md Asif Jalal, Muhammad Umar Farooq, Anna Ollerenshaw, Thomas Hain:
Towards Domain Generalisation in ASR with Elitist Sampling and Ensemble Knowledge Distillation. ICASSP 2023: 1-5
[c154]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CloseRHG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CloseRHG23
George Close, William Ravenscroft, Thomas Hain, Stefan Goetze:
Perceive and Predict: Self-Supervised Speech Representation Based Loss Functions for Speech Enhancement. ICASSP 2023: 1-5
[c153]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RavenscroftGH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RavenscroftGH23
William Ravenscroft, Stefan Goetze, Thomas Hain:
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation. ICASSP 2023: 1-5
[c152]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoDLH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoDLH23
Cong-Thanh Do, Rama Doddipatla, Mohan Li, Thomas Hain:
Domain Adaptive Self-supervised Training of Automatic Speech Recognition. INTERSPEECH 2023: 4389-4393
[c151]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FarooqH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FarooqH23
Muhammad Umar Farooq, Thomas Hain:
Learning Cross-lingual Mappings for Data Augmentation to Improve Low-Resource Speech Recognition. INTERSPEECH 2023: 5072-5076
[c150]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/slte/IslamPH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slte/IslamPH23
Elaf Islam, Chanho Park, Thomas Hain:
Exploring Speech Representations for Proficiency Assessment in Language Learning. SLaTE 2023: 151-155
[c149]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/CloseHG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/CloseHG23
George Close, Thomas Hain, Stefan Goetze:
The Effect of Spoken Language on Speech Enhancement Using Self-Supervised Speech Representation Loss Functions. WASPAA 2023: 1-5
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-04388
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-04388
George Close, William Ravenscroft, Thomas Hain, Stefan Goetze:
Perceive and predict: self-supervised speech representation based loss functions for speech enhancement. CoRR abs/2301.04388 (2023)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-00550
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-00550
Rehan Ahmad, Md Asif Jalal, Muhammad Umar Farooq, Anna Ollerenshaw, Thomas Hain:
Towards domain generalisation in ASR with elitist sampling and ensemble knowledge distillation. CoRR abs/2303.00550 (2023)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-07142
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-07142
William Ravenscroft, Stefan Goetze, Thomas Hain:
On Data Sampling Strategies for Training Neural Network Speech Separation Models. CoRR abs/2304.07142 (2023)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-08577
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-08577
Muhammad Umar Farooq, Thomas Hain:
Learning Cross-lingual Mappings for Data Augmentation to Improve Low-Resource Speech Recognition. CoRR abs/2306.08577 (2023)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-17500
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-17500
Anna Ollerenshaw, Md Asif Jalal, Rosanna Milner, Thomas Hain:
Empirical Interpretation of the Relationship Between Speech Acoustic Context and Emotion Recognition. CoRR abs/2306.17500 (2023)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-13423
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-13423
George Close, Thomas Hain, Stefan Goetze:
Non Intrusive Intelligibility Predictor for Hearing Impaired Individuals using Self Supervised Speech Representations. CoRR abs/2307.13423 (2023)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-14502
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-14502
George Close, Thomas Hain, Stefan Goetze:
The Effect of Spoken Language on Speech Enhancement using Self-Supervised Speech Representation Loss Functions. CoRR abs/2307.14502 (2023)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-06125
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-06125
William Ravenscroft, Stefan Goetze, Thomas Hain:
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments. CoRR abs/2310.06125 (2023)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-08225
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-08225
Chanho Park, Chengsong Lu, Mingjie Chen, Thomas Hain:
Fast Word Error Rate Estimation Using Self-Supervised Representations For Speech And Text. CoRR abs/2310.08225 (2023)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-18865
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-18865
Muhammad Umar Farooq, Rehan Ahmad, Thomas Hain:
MUST: A Multilingual Student-Teacher Learning approach for low-resource speech recognition. CoRR abs/2310.18865 (2023)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-08979
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-08979
George Close, William Ravenscroft, Thomas Hain, Stefan Goetze:
Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement. CoRR abs/2312.08979 (2023)
2022
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/HasanJHD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/HasanJHD22
Madina Hasan, Nicholas Jefferson, Thomas Hain, Jeremy Dawson:
Automatic detection of behavioural codes in team interactions. Comput. Speech Lang. 74: 101339 (2022)
[c148]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/RavenscroftGH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/RavenscroftGH22
William Ravenscroft, Stefan Goetze, Thomas Hain:
Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation. EUSIPCO 2022: 80-84
[c147]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/CloseHG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/CloseHG22
George Close, Thomas Hain, Stefan Goetze:
MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data. EUSIPCO 2022: 165-169
[c146]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/OllerenshawJH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/OllerenshawJH22
Anna Ollerenshaw, Md Asif Jalal, Thomas Hain:
Insights of Neural Representations in Multi-Banded and Multi-Channel Convolutional Transformers for End-to-End ASR. EUSIPCO 2022: 434-438
[c145]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaenzH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaenzH22
Jose Antonio Lopez Saenz, Thomas Hain:
A Model for Assessor Bias in Automatic Pronunciation Assessment. ICASSP 2022: 7267-7271
[c144]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ParkAH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ParkAH22
Chanho Park, Rehan Ahmad, Thomas Hain:
Unsupervised Data Selection for Speech Recognition with Contrastive Loss Ratios. ICASSP 2022: 8587-8591
[c143]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CloseHGH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CloseHGH22
George Close, Samuel Hollands, Stefan Goetze, Thomas Hain:
Non-intrusive Speech Intelligibility Metric Prediction for Hearing Impaired Individuals. INTERSPEECH 2022: 3483-3487
[c142]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FarooqH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FarooqH22
Muhammad Umar Farooq, Thomas Hain:
Investigating the Impact of Crosslingual Acoustic-Phonetic Similarities on Multilingual Speech Recognition. INTERSPEECH 2022: 3849-3853
[c141]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FarooqNH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FarooqNH22
Muhammad Umar Farooq, Darshan Adiga Haniya Narayana, Thomas Hain:
Non-Linear Pairwise Language Mappings for Low-Resource Multilingual Acoustic Model Fusion. INTERSPEECH 2022: 4850-4854
[c140]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/RavenscroftGH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/RavenscroftGH22
William Ravenscroft, Stefan Goetze, Thomas Hain:
Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation. IWAENC 2022: 1-5
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-12369
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-12369
George Close, Thomas Hain, Stefan Goetze:
MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data. CoRR abs/2203.12369 (2022)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-06439
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-06439
William Ravenscroft, Stefan Goetze, Thomas Hain:
Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation. CoRR abs/2204.06439 (2022)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-08455
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-08455
William Ravenscroft, Stefan Goetze, Thomas Hain:
Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation. CoRR abs/2205.08455 (2022)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-09456
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-09456
Anna Ollerenshaw, Md Asif Jalal, Thomas Hain:
Insights on Neural Representations for End-to-End Speech Recognition. CoRR abs/2205.09456 (2022)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-02104
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-02104
Rosanna Milner, Md Asif Jalal, Raymond W. M. Ng, Thomas Hain:
A cross-corpus study on speech emotion recognition. CoRR abs/2207.02104 (2022)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-03390
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-03390
Muhammad Umar Farooq, Thomas Hain:
Investigating the Impact of Cross-lingual Acoustic-Phonetic Similarities on Multilingual Speech Recognition. CoRR abs/2207.03390 (2022)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-03391
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-03391
Muhammad Umar Farooq, Darshan Adiga Haniya Narayana, Thomas Hain:
Non-Linear Pairwise Language Mappings for Low-Resource Multilingual Acoustic Model Fusion. CoRR abs/2207.03391 (2022)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-12028
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-12028
Chanho Park, Rehan Ahmad, Thomas Hain:
Unsupervised data selection for Speech Recognition with contrastive loss ratios. CoRR abs/2207.12028 (2022)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15305
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15305
William Ravenscroft, Stefan Goetze, Thomas Hain:
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation. CoRR abs/2210.15305 (2022)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01993
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01993
Anna Ollerenshaw, Md Asif Jalal, Thomas Hain:
Probing Statistical Representations For End-To-End ASR. CoRR abs/2211.01993 (2022)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-02000
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-02000
Anna Ollerenshaw, Md Asif Jalal, Thomas Hain:
Dynamic Kernels and Channel Attention with Multi-Layer Embedding Aggregation for Speaker Verification. CoRR abs/2211.02000 (2022)
2021
[j15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jbd/HannaniESHO21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jbd/HannaniESHO21
Asmaa El Hannani, Rahhal Errattahi, Fatima Zahra Salmam, Thomas Hain, Hassan Ouahmane:
Evaluation of the effectiveness and efficiency of state-of-the-art features and models for automatic speech recognition error detection. J. Big Data 8(1): 5 (2021)
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/ShiHH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/ShiHH21
Yanpei Shi, Qiang Huang, Thomas Hain:
H-VECTORS: Improving the robustness in utterance-level speaker embeddings using a hierarchical attention model. Neural Networks 142: 329-339 (2021)
[c139]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/FriedlRSHSHS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/FriedlRSHSHS21
Korbinian Friedl, Georgios Rizos, Lukas Stappen, Madina Hasan, Lucia Specia, Thomas Hain, Björn W. Schuller:
Uncertainty Aware Review Hallucination for Science Article Classification. ACL/IJCNLP (Findings) 2021: 5004-5009
[c138]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SaenzJMH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SaenzJMH21
Jose Antonio Lopez Saenz, Md Asif Jalal, Rosanna Milner, Thomas Hain:
Attention Based Model for Segmental Pronunciation Error Detection. ASRU 2021: 725-732
[c137]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenSH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenSH21
Mingjie Chen, Yanpei Shi, Thomas Hain:
Towards Low-Resource Stargan Voice Conversion Using Weight Adaptive Instance Normalization. ICASSP 2021: 5949-5953
[c136]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0008H21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0008H21
Qiang Huang, Thomas Hain:
Improving Audio Anomalies Recognition Using Temporal Convolutional Attention Networks. ICASSP 2021: 6473-6477
[c135]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DoDH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DoDH21
Cong-Thanh Do, Rama Doddipatla, Thomas Hain:
Multiple-Hypothesis CTC-Based Semi-Supervised Adaptation of End-to-End Speech Recognition. ICASSP 2021: 6978-6982
[c134]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OllerenshawJH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OllerenshawJH21
Anna Ollerenshaw, Md. Asif Jalal, Thomas Hain:
Insights on Neural Representations for End-to-End Speech Recognition. Interspeech 2021: 4079-4083
[c133]
- view
  authority control:
- export record
  dblp key:
  - conf/pricai/HuangCXKH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/pricai/HuangCXKH21
Shengjie Huang, Mingjie Chen, Yanyan Xu, Dengfeng Ke, Thomas Hain:
WINVC: One-Shot Voice Conversion with Weight Adaptive Instance Normalization. PRICAI (2) 2021: 559-573
[c132]
- view
  authority control:
- export record
  dblp key:
  - conf/slsp/SaenzH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slsp/SaenzH21
Jose Antonio Lopez Saenz, Thomas Hain:
Use of Speaker Metadata for Improving Automatic Pronunciation Assessment. SLSP 2021: 61-72
[c131]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ShiH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ShiH21
Yanpei Shi, Thomas Hain:
Contextual Joint Factor Acoustic Embeddings. SLT 2021: 750-757
[c130]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ShiH21a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ShiH21a
Yanpei Shi, Thomas Hain:
Supervised Speaker Embedding De-Mixing in Two-Speaker Environment. SLT 2021: 758-765
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-15515
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-15515
Cong-Thanh Do, Rama Doddipatla, Thomas Hain:
Multiple-hypothesis CTC-based semi-supervised adaptation of end-to-end speech recognition. CoRR abs/2103.15515 (2021)
2020
[c129]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/DoZH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/DoZH20
Cong-Thanh Do, Shucong Zhang, Thomas Hain:
Selective Adaptation of End-to-End Speech Recognition using Hybrid CTC/Attention Architecture for Noise Robustness. EUSIPCO 2020: 321-325
[c128]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiHH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiHH20
Yanpei Shi, Qiang Huang, Thomas Hain:
H-Vectors: Utterance-Level Speaker Embedding Using a Hierarchical Attention Model. ICASSP 2020: 7579-7583
[c127]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Shi0H20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Shi0H20
Yanpei Shi, Qiang Huang, Thomas Hain:
Speaker Re-Identification with Speaker Dependent Speech Enhancement. INTERSPEECH 2020: 1530-1534
[c126]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StappenRHHS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StappenRHHS20
Lukas Stappen, Georgios Rizos, Madina Hasan, Thomas Hain, Björn W. Schuller:
Uncertainty-Aware Machine Support for Paper Reviewing on the Interspeech 2019 Submission Corpus. INTERSPEECH 2020: 1808-1812
[c125]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Shi0H20a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Shi0H20a
Yanpei Shi, Qiang Huang, Thomas Hain:
Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification. INTERSPEECH 2020: 2992-2996
[c124]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JalalMHM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JalalMHM20
Md Asif Jalal, Rosanna Milner, Thomas Hain, Roger K. Moore:
Removing Bias with Residual Mixture of Multi-View Attention for Speech Emotion Recognition. INTERSPEECH 2020: 4084-4088
[c123]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JalalMH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JalalMH20
Md. Asif Jalal, Rosanna Milner, Thomas Hain:
Empirical Interpretation of Speech Emotion Perception with Attention Based Model for Speech Emotion Recognition. INTERSPEECH 2020: 4113-4117
[c122]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0008H20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0008H20
Qiang Huang, Thomas Hain:
Exploration of Audio Quality Assessment and Anomaly Localisation Using Attention Models. INTERSPEECH 2020: 4611-4615
[c121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SailorH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SailorH20
Hardik B. Sailor, Thomas Hain:
Multilingual Speech Recognition Using Language-Specific Phoneme Recognition as Auxiliary Task for Indian Languages. INTERSPEECH 2020: 4756-4760
[c120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenH20
Mingjie Chen, Thomas Hain:
Unsupervised Acoustic Unit Representation Learning for Voice Conversion Using WaveNet Auto-Encoders. INTERSPEECH 2020: 4866-4870
[c119]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/Shi0H20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/Shi0H20
Yanpei Shi, Qiang Huang, Thomas Hain:
Robust Speaker Recognition Using Speech Enhancement And Attention Model. Odyssey 2020: 451-458
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-05031
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-05031
Yanpei Shi, Qiang Huang, Thomas Hain:
Robust Speaker Recognition Using Speech Enhancement And Attention Model. CoRR abs/2001.05031 (2020)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-06397
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-06397
Yanpei Shi, Thomas Hain:
Supervised Speaker Embedding De-Mixing in Two-Speaker Environment. CoRR abs/2001.06397 (2020)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-07817
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-07817
Yanpei Shi, Qiang Huang, Thomas Hain:
Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification. CoRR abs/2005.07817 (2020)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-07818
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-07818
Yanpei Shi, Qiang Huang, Thomas Hain:
Speaker Re-identification with Speaker Dependent Speech Enhancement. CoRR abs/2005.07818 (2020)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08053
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08053
Qiang Huang, Thomas Hain:
Exploration of Audio Quality Assessment and Anomaly Localisation Using Attention Models. CoRR abs/2005.08053 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-06892
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-06892
Mingjie Chen, Thomas Hain:
Unsupervised Acoustic Unit Representation Learning for Voice Conversion using WaveNet Auto-encoders. CoRR abs/2008.06892 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11286
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11286
Qiang Huang, Thomas Hain:
Improving Audio Anomalies Recognition Using Temporal Convolutional Attention Network. CoRR abs/2010.11286 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11646
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11646
Mingjie Chen, Yanpei Shi, Thomas Hain:
Towards Low-Resource StarGAN Voice Conversion using Weight Adaptive Instance Normalization. CoRR abs/2010.11646 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-16071
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-16071
Yanpei Shi, Mingjie Chen, Qiang Huang, Thomas Hain:
T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model. CoRR abs/2010.16071 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/ErrattahiHHO19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/ErrattahiHHO19
Rahhal Errattahi, Asmaa El Hannani, Thomas Hain, Hassan Ouahmane:
System-independent ASR error detection and classification using Recurrent Neural Network. Comput. Speech Lang. 55: 187-199 (2019)
[j12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/DeenaHDSH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/DeenaHDSH19
Salil Deena, Madina Hasan, Mortaza Doulaty, Oscar Saz, Thomas Hain:
Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition and Alignment. IEEE ACM Trans. Audio Speech Lang. Process. 27(3): 572-582 (2019)
[c118]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MilnerJNH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MilnerJNH19
Rosanna Milner, Md Asif Jalal, Raymond W. M. Ng, Thomas Hain:
A Cross-Corpus Study on Speech Emotion Recognition. ASRU 2019: 304-311
[c117]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/JalalMH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/JalalMH19
Md Asif Jalal, Roger K. Moore, Thomas Hain:
Spatio-Temporal Context Modelling for Speech Emotion Classification. ASRU 2019: 853-859
[c116]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SailorDJLH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SailorDJLH19
Hardik B. Sailor, Salil Deena, Md Asif Jalal, Rasa Lileikyte, Thomas Hain:
Unsupervised Adaptation of Acoustic Models for ASR Using Utterance-Level Embeddings from Squeeze and Excitation Networks. ASRU 2019: 980-987
[c115]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangH19
Qiang Huang, Thomas Hain:
Detecting Mismatch Between Speech and Transcription Using Cross-Modal Attention. INTERSPEECH 2019: 584-588
[c114]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JalalLMH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JalalLMH19
Md Asif Jalal, Erfan Loweimi, Roger K. Moore, Thomas Hain:
Learning Temporal Clusters Using Capsule Routing for Speech Emotion Recognition. INTERSPEECH 2019: 1701-1705
[c113]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoulatyH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoulatyH19
Mortaza Doulaty, Thomas Hain:
Latent Dirichlet Allocation Based Acoustic Data Selection for Automatic Speech Recognition. INTERSPEECH 2019: 3228-3232
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-01302
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-01302
Mortaza Doulaty, Thomas Hain:
Latent Dirichlet Allocation Based Acoustic Data Selection for Automatic Speech Recognition. CoRR abs/1907.01302 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-11200
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-11200
Yanpei Shi, Qiang Huang, Thomas Hain:
Improving Robustness In Speaker Identification Using A Two-Stage Attention Model. CoRR abs/1909.11200 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-07601
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-07601
Yanpei Shi, Qiang Huang, Thomas Hain:
Contextual Joint Factor Acoustic Embeddings. CoRR abs/1910.07601 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-07900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-07900
Yanpei Shi, Qiang Huang, Thomas Hain:
H-VECTORS: Utterance-level Speaker Embedding Using A Hierarchical Attention Model. CoRR abs/1910.07900 (2019)
2018
[j11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/mta/SazDDHKMNOH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mta/SazDDHKMNOH18
Oscar Saz, Salil Deena, Mortaza Doulaty, Madina Hasan, Bilal Khaliq, Rosanna Milner, Raymond W. M. Ng, Julia Olcoz, Thomas Hain:
Lightly supervised alignment of subtitles on multi-genre broadcasts. Multim. Tools Appl. 77(23): 30533-30550 (2018)
[c112]
- view
  authority control:
- export record
  dblp key:
  - conf/atsip/ErrattahiHHO18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atsip/ErrattahiHHO18
Rahhal Errattahi, Asmaa El Hannani, Thomas Hain, Hassan Ouahmane:
Towards a generic approach for automatic speech recognition error detection and classification. ATSIP 2018: 1-6
[c111]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LoweimiBH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LoweimiBH18
Erfan Loweimi, Jon Barker, Thomas Hain:
Exploring the Use of Group Delay for Generalised VTS Based Noise Compensation. ICASSP 2018: 4824-4828
[c110]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LoweimiBH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LoweimiBH18
Erfan Loweimi, Jon Barker, Thomas Hain:
On the Usefulness of the Speech Phase Spectrum for Pitch Extraction. INTERSPEECH 2018: 696-700
[c109]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NicolaoSH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NicolaoSH18
Mauro Nicolao, Michiel Sanders, Thomas Hain:
Improved Acoustic Modelling for Automatic Literacy Assessment of Children. INTERSPEECH 2018: 1666-1670
[c108]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ErrattahiDHOH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ErrattahiDHOH18
Rahhal Errattahi, Salil Deena, Asmaa El Hannani, Hassan Ouahmane, Thomas Hain:
Improving ASR Error Detection with RNNLM Adaptation. SLT 2018: 190-196
2017
[j10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/csl/SazH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/SazH17
Oscar Saz, Thomas Hain:
Acoustic adaptation to dynamic background conditions with asynchronous transformations. Comput. Speech Lang. 41: 180-194 (2017)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/NgNH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/NgNH17
Raymond W. M. Ng, Mauro Nicolao, Thomas Hain:
Unsupervised crosslingual adaptation of tokenisers for spoken language recognition. Comput. Speech Lang. 46: 327-342 (2017)
[c107]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/DeenaNMSH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/DeenaNMSH17
Salil Deena, Raymond W. M. Ng, Pranava Swaroop Madhyastha, Lucia Specia, Thomas Hain:
Exploring the use of acoustic embeddings in neural machine translation. ASRU 2017: 450-457
[c106]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MilnerH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MilnerH17
Rosanna Milner, Thomas Hain:
DNN approach to speaker diarisation using speaker channels. ICASSP 2017: 4925-4929
[c105]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LoweimiBH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LoweimiBH17
Erfan Loweimi, Jon Barker, Thomas Hain:
Statistical normalisation of phase-based feature representation for robust speech recognition. ICASSP 2017: 5310-5314
[c104]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NgKLH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NgKLH17
Raymond W. M. Ng, Alvin C. M. Kwan, Tan Lee, Thomas Hain:
Shefce: A Cantonese-English bilingual speech corpus for pronunciation assessment. ICASSP 2017: 5825-5829
[c103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LoweimiBSH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LoweimiBSH17
Erfan Loweimi, Jon Barker, Oscar Saz-Torralba, Thomas Hain:
Robust Source-Filter Separation of Speech Signal in the Phase Domain. INTERSPEECH 2017: 414-418
[c102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LoweimiBH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LoweimiBH17
Erfan Loweimi, Jon Barker, Thomas Hain:
Channel Compensation in the Generalised Vector Taylor Series Approach to Robust ASR. INTERSPEECH 2017: 2466-2470
[c101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DeenaNMSH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DeenaNMSH17
Salil Deena, Raymond W. M. Ng, Pranava Swaroop Madhyastha, Lucia Specia, Thomas Hain:
Semi-Supervised Adaptation of RNNLMs by Fine-Tuning with Domain-Specific Auxiliary Features. INTERSPEECH 2017: 2715-2719
[c100]
- view
  authority control:
- export record
  dblp key:
  - conf/iwssip/WuNSH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwssip/WuNSH17
Chenhao Wu, Raymond W. M. Ng, Oscar Saz-Torralba, Thomas Hain:
Analysing acoustic model changes for active learning in automatic speech recognition. IWSSIP 2017: 1-5
2016
[c99]
- view
  authority control:
- export record
  dblp key:
  - conf/aiccsa/ErrattahiHOH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aiccsa/ErrattahiHOH16
Rahhal Errattahi, Asmaa El Hannani, Hassan Ouahmane, Thomas Hain:
Automatic speech recognition errors detection using supervised learning techniques. AICCSA 2016: 1-6
[c98]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MilnerH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MilnerH16
Rosanna Milner, Thomas Hain:
Segment-oriented evaluation of speaker diarisation performance. ICASSP 2016: 5460-5464
[c97]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NgSSH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NgSSH16
Raymond W. M. Ng, Kashif Shah, Lucia Specia, Thomas Hain:
Groupwise learning for ASR k-best list reranking in spoken language translation. ICASSP 2016: 6120-6124
[c96]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Al-ShareefH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Al-ShareefH16
Sarah Al-Shareef, Thomas Hain:
Colloquialising Modern Standard Arabic Text for Improved Speech Recognition. INTERSPEECH 2016: 1345-1349
[c95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HainCSDHNMDL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HainCSDHNMDL16
Thomas Hain, Jeremy Christian, Oscar Saz, Salil Deena, Madina Hasan, Raymond W. M. Ng, Rosanna Milner, Mortaza Doulaty, Yulan Liu:
webASR 2 - Improved Cloud Based Speech Technology. INTERSPEECH 2016: 1613-1617
[c94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OlcozSH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OlcozSH16
Julia Olcoz, Oscar Saz, Thomas Hain:
Error Correction in Lightly Supervised Alignment of Broadcast Subtitles. INTERSPEECH 2016: 2110-2114
[c93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoulatySNH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoulatySNH16
Mortaza Doulaty, Oscar Saz, Raymond W. M. Ng, Thomas Hain:
Automatic Genre and Show Identification of Broadcast Media. INTERSPEECH 2016: 2115-2119
[c92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MilnerH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MilnerH16
Rosanna Milner, Thomas Hain:
DNN-Based Speaker Clustering for Speaker Diarisation. INTERSPEECH 2016: 2185-2189
[c91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DeenaHDSH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DeenaHDSH16
Salil Deena, Madina Hasan, Mortaza Doulaty, Oscar Saz, Thomas Hain:
Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition. INTERSPEECH 2016: 2343-2347
[c90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CasanuevaHG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CasanuevaHG16
Iñigo Casanueva, Thomas Hain, Phil D. Green:
Improving Generalisation to New Speakers in Spoken Dialogue State Tracking. INTERSPEECH 2016: 2726-2730
[c89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NgCH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NgCH16
Raymond W. M. Ng, Bhusan Chettri, Thomas Hain:
Combining Weak Tokenisers for Phonotactic Language Recognition in a Resource-Constrained Setting. INTERSPEECH 2016: 2939-2943
[c88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LoweimiBH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LoweimiBH16
Erfan Loweimi, Jon Barker, Thomas Hain:
Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition. INTERSPEECH 2016: 3798-3802
[c87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuFHH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuFHH16
Yulan Liu, Charles Fox, Madina Hasan, Thomas Hain:
The Sheffield Wargame Corpus - Day Two and Day Three. INTERSPEECH 2016: 3833-3837
[c86]
- view
  - electronic edition @ lrec-conf.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/lrec/AlHarbiH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/AlHarbiH16
Ghada AlHarbi, Thomas Hain:
The OpenCourseWare Metadiscourse (OCWMD) Corpus. LREC 2016
[c85]
- view
  - electronic edition @ lrec-conf.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/lrec/NicolaoCCGH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/NicolaoCCGH16
Mauro Nicolao, Heidi Christensen, Stuart P. Cunningham, Phil D. Green, Thomas Hain:
A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus. LREC 2016
[c84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/NgNSHCDLH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/NgNSHCDLH16
Raymond W. M. Ng, Mauro Nicolao, Oscar Saz, Madina Hasan, Bhusan Chettri, Mortaza Doulaty, Tan Lee, Thomas Hain:
The Sheffield language recognition system in NIST LRE 2015. Odyssey 2016: 181-187
[c83]
- view
  authority control:
- export record
  dblp key:
  - conf/sigdial/CasanuevaHNG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigdial/CasanuevaHNG16
Iñigo Casanueva, Thomas Hain, Mauro Nicolao, Phil D. Green:
Using phone features to improve dialogue state tracking generalisation to unseen states. SIGDIAL Conference 2016: 80-89
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/DoulatySNH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DoulatySNH16
Mortaza Doulaty, Oscar Saz, Raymond W. M. Ng, Thomas Hain:
Automatic Genre and Show Identification of Broadcast Media. CoRR abs/1606.03333 (2016)
2015
[c82]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/DoulatySNH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/DoulatySNH15
Mortaza Doulaty, Oscar Saz, Raymond W. M. Ng, Thomas Hain:
Latent Dirichlet Allocation based organisation of broadcast media archives for deep neural network adaptation. ASRU 2015: 130-136
[c81]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SazDDMNHLH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SazDDMNHLH15
Oscar Saz, Mortaza Doulaty, Salil Deena, Rosanna Milner, Raymond W. M. Ng, Madina Hasan, Yulan Liu, Thomas Hain:
The 2015 sheffield system for transcription of Multi-Genre Broadcast media. ASRU 2015: 624-631
[c80]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MilnerSDDNH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MilnerSDDNH15
Rosanna Milner, Oscar Saz, Salil Deena, Mortaza Doulaty, Raymond W. M. Ng, Thomas Hain:
The 2015 sheffield system for longitudinal diarisation of broadcast media. ASRU 2015: 632-638
[c79]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/BellGHKLLMRSWW15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/BellGHKLLMRSWW15
Peter Bell, Mark J. F. Gales, Thomas Hain, Jonathan Kilgour, Pierre Lanchantin, Xunying Liu, Andrew McParland, Steve Renals, Oscar Saz, Mirjam Wester, Philip C. Woodland:
The MGB challenge: Evaluating multi-genre broadcast media recognition. ASRU 2015: 687-693
[c78]
- view
  - electronic edition @ educationaldatamining.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/edm/AlHarbiH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/edm/AlHarbiH15
Ghada AlHarbi, Thomas Hain:
Using Topic Segmentation Models for the Automatic Organisation of MOOCs resources. EDM 2015: 524-527
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuKH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuKH15
Yulan Liu, Penny Karanasou, Thomas Hain:
An investigation into speaker informed DNN front-end for LVCSR. ICASSP 2015: 4300-4304
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NgSASH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NgSASH15
Raymond W. M. Ng, Kashif Shah, Wilker Aziz, Lucia Specia, Thomas Hain:
Quality estimation for asr k-best list rescoring in spoken language translation. ICASSP 2015: 5226-5230
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NicolaoBH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NicolaoBH15
Mauro Nicolao, Amy V. Beeston, Thomas Hain:
Automatic assessment of English learner pronunciation using discriminative classifiers. ICASSP 2015: 5351-5355
[c74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HasanDH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HasanDH15
Madina Hasan, Rama Doddipatla, Thomas Hain:
Noise-matched training of CRF based sentence end detection models. INTERSPEECH 2015: 349-353
[c73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LoweimiBH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LoweimiBH15
Erfan Loweimi, Jon Barker, Thomas Hain:
Source-filter separation of speech signal in the phase domain. INTERSPEECH 2015: 598-602
[c72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NgSSH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NgSSH15
Raymond W. M. Ng, Kashif Shah, Lucia Specia, Thomas Hain:
A study on the stability and effectiveness of features in quality estimation for spoken language translation. INTERSPEECH 2015: 2257-2261
[c71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoulatySH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoulatySH15
Mortaza Doulaty, Oscar Saz, Thomas Hain:
Data-selective transfer learning for multi-domain speech recognition. INTERSPEECH 2015: 2897-2901
[c70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoulatySH15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoulatySH15a
Mortaza Doulaty, Oscar Saz, Thomas Hain:
Unsupervised domain discovery using latent dirichlet allocation for acoustic modelling in speech recognition. INTERSPEECH 2015: 3640-3644
[c69]
- view
  authority control:
- export record
  dblp key:
  - conf/sigdial/CasanuevaHCMG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigdial/CasanuevaHCMG15
Iñigo Casanueva, Thomas Hain, Heidi Christensen, Ricard Marxer, Phil D. Green:
Knowledge transfer between speakers for personalised dialogue management. SIGDIAL Conference 2015: 12-21
[c68]
- view
  authority control:
- export record
  dblp key:
  - conf/slsp/LoweimiDBH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slsp/LoweimiDBH15
Erfan Loweimi, Mortaza Doulaty, Jon Barker, Thomas Hain:
Long-Term Statistical Feature Extraction from Speech Signal and Its Application in Emotion Recognition. SLSP 2015: 173-184
[c67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/slte/AlHarbiNH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slte/AlHarbiNH15
Ghada AlHarbi, Raymond W. M. Ng, Thomas Hain:
Annotating meta-discourse in academic lectures from different disciplines. SLaTE 2015: 161-166
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/DoulatySH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DoulatySH15
Mortaza Doulaty, Oscar Saz, Thomas Hain:
Data-selective Transfer Learning for Multi-Domain Speech Recognition. CoRR abs/1509.02409 (2015)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/DoulatySH15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DoulatySH15a
Mortaza Doulaty, Oscar Saz, Thomas Hain:
Unsupervised Domain Discovery using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition. CoRR abs/1509.02412 (2015)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/NgDDASSHASH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NgDDASSHASH15
Raymond W. M. Ng, Mortaza Doulaty, Rama Doddipatla, Wilker Aziz, Kashif Shah, Oscar Saz, Madina Hasan, Ghada AlHarbi, Lucia Specia, Thomas Hain:
The USFD Spoken Language Translation System for IWSLT 2014. CoRR abs/1509.03870 (2015)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SazDH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SazDH15
Oscar Saz, Mortaza Doulaty, Thomas Hain:
Background-tracking Acoustic Features for Genre Identification of Broadcast Shows. CoRR abs/1509.04934 (2015)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/DoulatySNH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DoulatySNH15
Mortaza Doulaty, Oscar Saz, Raymond W. M. Ng, Thomas Hain:
Latent Dirichlet Allocation Based Organisation of Broadcast Media Archives for Deep Neural Network Adaptation. CoRR abs/1511.05076 (2015)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SazDDMNHLH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SazDDMNHLH15
Oscar Saz, Mortaza Doulaty, Salil Deena, Rosanna Milner, Raymond W. M. Ng, Madina Hasan, Yulan Liu, Thomas Hain:
The 2015 Sheffield System for Transcription of Multi-Genre Broadcast Media. CoRR abs/1512.06643 (2015)
2014
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/KamperWHN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/KamperWHN14
Herman Kamper, Febe de Wet, Thomas Hain, Thomas Niesler:
Capitalising on North American speech resources for the development of a South African English large vocabulary speech recognition system. Comput. Speech Lang. 28(6): 1255-1268 (2014)
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuZH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuZH14
Yulan Liu, Pengyuan Zhang, Thomas Hain:
Using neural network front-ends on far field multiple microphones based speech recognition. ICASSP 2014: 5542-5546
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SazH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SazH14
Oscar Saz, Thomas Hain:
Using contextual information in joint factor eigenspace MLLR for speech recognition in diverse scenarios. ICASSP 2014: 6314-6318
[c64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CasanuevaCHG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CasanuevaCHG14
I. Casanueva, Heidi Christensen, Thomas Hain, Phil D. Green:
Adaptive speech recognition and dialogue management for users with speech disorders. INTERSPEECH 2014: 1033-1037
[c63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoddipatlaHH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoddipatlaHH14
Rama Doddipatla, Madina Hasan, Thomas Hain:
Speaker dependent bottleneck layer training for speaker adaptation in automatic speech recognition. INTERSPEECH 2014: 2199-2203
[c62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FoxH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FoxH14
Charles Fox, Thomas Hain:
Extending Limabeam with discrimination and coarse gradients. INTERSPEECH 2014: 2440-2444
[c61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HasanDH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HasanDH14
Madina Hasan, Rama Doddipatla, Thomas Hain:
Multi-pass sentence-end detection of lecture speech. INTERSPEECH 2014: 2902-2906
[c60]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iwslt/NgDDASSHASH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/NgDDASSHASH14
Raymond W. M. Ng, Mortaza Doulaty, Rama Doddipatla, Wilker Aziz, Kashif Shah, Oscar Saz, Madina Hasan, Ghada AlHaribi, Lucia Specia, Thomas Hain:
The USFD SLT system for IWSLT 2014. IWSLT (Evaluation Campaign) 2014
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SazDH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SazDH14
Oscar Saz, Mortaza Doulaty, Thomas Hain:
Background-tracking acoustic features for genre identification of broadcast shows. SLT 2014: 118-123
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ZhangLH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ZhangLH14
Pengyuan Zhang, Yulan Liu, Thomas Hain:
Semi-supervised DNN training in meeting recognition. SLT 2014: 141-146
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ChristensenCCGH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ChristensenCCGH14
Heidi Christensen, I. Casanueva, Stuart P. Cunningham, Phil D. Green, Thomas Hain:
Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data. SLT 2014: 254-259
2013
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FoxH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FoxH13
Charles Fox, Thomas Hain:
Lightly supervised learning from a damaged natural speech corpus. ICASSP 2013: 8086-8090
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NgHC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NgHC13
Raymond W. M. Ng, Thomas Hain, Trevor Cohn:
Adaptation of lecture speech recognition system with machine translation output. ICASSP 2013: 8401-8405
[c54]
- view
- export record
  dblp key:
  - conf/interspeech/Lanchantin13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lanchantin13
Pierre Lanchantin, Peter Bell, Mark J. F. Gales, Thomas Hain, Xunying Liu, Yanhua Long, Jennifer Quinnell, Steve Renals, Oscar Saz, Matthew Stephen Seigel, Pawel Swietojanski, Philip C. Woodland:
Automatic Transcription of Multi-genre Media Archives. SLAM@INTERSPEECH 2013: 26-31
[c53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FoxLZH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FoxLZH13
Charles Fox, Yulan Liu, Erich Zwyssig, Thomas Hain:
The sheffield wargames corpus. INTERSPEECH 2013: 1116-1120
[c52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChristensenGH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChristensenGH13
Heidi Christensen, Phil D. Green, Thomas Hain:
Learning speaker-specific pronunciations of disordered speech. INTERSPEECH 2013: 1159-1163
[c51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SazH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SazH13
Oscar Saz, Thomas Hain:
Asynchronous factorisation of speaker and background with feature transforms in speech recognition. INTERSPEECH 2013: 1238-1242
[c50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChristensenABGHKS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChristensenABGHKS13
Heidi Christensen, Magda B. Aniol, Peter Bell, Phil D. Green, Thomas Hain, Simon King, Pawel Swietojanski:
Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech. INTERSPEECH 2013: 3642-3645
[c49]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/slpat/ChristensenCCGH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slpat/ChristensenCCGH13
Heidi Christensen, Iñigo Casanueva, Stuart P. Cunningham, Phil D. Green, Thomas Hain:
homeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition. SLPAT 2013: 29-34
2012
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HainBDGGHHKLW12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HainBDGGHHKLW12
Thomas Hain, Lukás Burget, John Dines, Philip N. Garner, Frantisek Grézl, Asmaa El Hannani, Marijn Huijbregts, Martin Karafiát, Mike Lincoln, Vincent Wan:
Transcribing Meetings With the AMIDA Systems. IEEE Trans. Speech Audio Process. 20(2): 486-498 (2012)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/GibsonH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GibsonH12
Matthew Gibson, Thomas Hain:
Correctness-Adjusted Unsupervised Discriminative Acoustic Model Adaptation. IEEE Trans. Speech Audio Process. 20(10): 2648-2656 (2012)
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GibsonH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GibsonH12
Matthew Gibson, Thomas Hain:
Application of SVM-based correctness predictions to unsupervised discriminative speaker adaptation. ICASSP 2012: 4341-4344
[c47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LecorveDHM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LecorveDHM12
Gwénolé Lecorvé, John Dines, Thomas Hain, Petr Motlícek:
Supervised and unsupervised Web-based language model domain adaptation. INTERSPEECH 2012: 182-185
[c46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NgHH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NgHH12
Raymond W. M. Ng, Thomas Hain, Keikichi Hirose:
An alignment matching method to explore pseudosyllable properties across different corpora. INTERSPEECH 2012: 863-866
[c45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChristensenCFGH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChristensenCFGH12
Heidi Christensen, Stuart P. Cunningham, Charles Fox, Phil D. Green, Thomas Hain:
A comparative study of adaptive, automatic recognition of disordered speech. INTERSPEECH 2012: 1776-1779
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Al-ShareefH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Al-ShareefH12
Sarah Al-Shareef, Thomas Hain:
CRF-based Diacritisation of Colloquial Arabic for Automatic Speech Recognition. INTERSPEECH 2012: 1824-1827
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/AlHarbiH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/AlHarbiH12
Ghada AlHarbi, Thomas Hain:
Automatic transcription of academic lectures from diverse disciplines. SLT 2012: 398-403
[c42]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/sltu/KamperWHN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sltu/KamperWHN12
Herman Kamper, Febe de Wet, Thomas Hain, Thomas Niesler:
Resource development and experiments in automatic south african broadcast news transcription. SLTU 2012: 102-106
[c41]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/taln/LecorveDHM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/taln/LecorveDHM12
Gwénolé Lecorvé, John Dines, Thomas Hain, Petr Motlícek:
Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web (Impact of the level of supervision on Web-based language model domain adaptation) [in French]. JEP-TALN-RECITAL 2012 2012: 193-200
2011
[c40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MarinoH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MarinoH11
Davide Marino, Thomas Hain:
An Analysis of Automatic Speech Recognition with Multiple Microphones. INTERSPEECH 2011: 1281-1284
[c39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Al-ShareefH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Al-ShareefH11
Sarah Al-Shareef, Thomas Hain:
An Investigation in Speech Recognition for Colloquial Arabic. INTERSPEECH 2011: 2869-2872
[c38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KemptonMH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KemptonMH11
Timothy Kempton, Roger K. Moore, Thomas Hain:
Cross-Language Phone Recognition when the Target Language Phoneme Inventory is not Known. INTERSPEECH 2011: 3165-3168
[c37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WrigleyH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WrigleyH11
Stuart N. Wrigley, Thomas Hain:
Web-Based Automatic Speech Recognition Service - webASR. INTERSPEECH 2011: 3265-3268
[c36]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/WrigleyH11a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WrigleyH11a
Stuart N. Wrigley, Thomas Hain:
Making an Automatic Speech Recognition Service Freely Available on the Web. INTERSPEECH 2011: 3325-3326
[c35]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/TuckerFWWH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuckerFWWH11
Roger C. F. Tucker, Dan Fry, Vincent Wan, Stuart N. Wrigley, Thomas Hain:
Extending Audio Notetaker to Browse WebASR Transcriptions. INTERSPEECH 2011: 3329-3330
2010
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/HannaniH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/HannaniH10
Asmaa El Hannani, Thomas Hain:
Automatic Optimization of Speech Decoder Parameters. IEEE Signal Process. Lett. 17(1): 95-98 (2010)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/0002H10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/0002H10
Matt Gibson, Thomas Hain:
Error Approximation and Minimum Phone Error Acoustic Model Estimation. IEEE Trans. Speech Audio Process. 18(6): 1269-1279 (2010)
[c34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HainBDGHHKLW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HainBDGHHKLW10
Thomas Hain, Lukás Burget, John Dines, Philip N. Garner, Asmaa El Hannani, Marijn Huijbregts, Martin Karafiát, Mike Lincoln, Vincent Wan:
The AMIDA 2009 meeting transcription system. INTERSPEECH 2010: 358-361

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GarnerDHHKKLWZ09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GarnerDHHKKLWZ09
Philip N. Garner, John Dines, Thomas Hain, Asmaa El Hannani, Martin Karafiát, Danil Korchagin, Mike Lincoln, Vincent Wan, Le Zhang:
Real-time ASR from meetings. INTERSPEECH 2009: 2119-2122
2008
[c32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HainHWW08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HainHWW08
Thomas Hain, Asmaa El Hannani, Stuart N. Wrigley, Vincent Wan:
Automatic speech recognition for scientific purposes - webASR. INTERSPEECH 2008: 504-507
[c31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KarafiatBHC08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KarafiatBHC08
Martin Karafiát, Lukás Burget, Thomas Hain, Jan Cernocký:
Discrimininative training of narrow band - wide band adapted systems for meeting recognition. INTERSPEECH 2008: 1217-1220
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/WanDHH08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/WanDHH08
Vincent Wan, John Dines, Asmaa El Hannani, Thomas Hain:
Bob: A lexicon and pronunciation dictionary generator. SLT 2008: 217-220
2007
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/RenalsHB07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/RenalsHB07
Steve Renals, Thomas Hain, Hervé Bourlard:
Recognition and understanding of meetings the AMI and AMIDA projects. ASRU 2007: 238-247
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/clear/HainBDGKLLW07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/clear/HainBDGKLLW07
Thomas Hain, Lukás Burget, John Dines, Giulia Garau, Martin Karafiát, David A. van Leeuwen, Mike Lincoln, Vincent Wan:
The 2007 AMI(DA) System for Meeting Transcription. CLEAR 2007: 414-428
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HainWBKDVGL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HainWBKDVGL07
Thomas Hain, Vincent Wan, Lukás Burget, Martin Karafiát, John Dines, Jithendra Vepa, Giulia Garau, Mike Lincoln:
The AMI System for the Transcription of Speech in Meetings. ICASSP (4) 2007: 357-360
[c26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GibsonH07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GibsonH07
Matthew Gibson, Thomas Hain:
Temporal masking for unsupervised minimum Bayes risk speaker adaptation. INTERSPEECH 2007: 238-241
[c25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KarafiatBCH07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KarafiatBCH07
Martin Karafiát, Lukás Burget, Jan Cernocký, Thomas Hain:
Application of CMLLR in narrow band wide band adapted systems. INTERSPEECH 2007: 282-285
2006
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HainWEGLMPW06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HainWEGLMPW06
Thomas Hain, Philip C. Woodland, Gunnar Evermann, Mark J. F. Gales, Xunying Liu, Gareth L. Moore, Daniel Povey, Lan Wang:
Corrections to "Automatic Transcription of Conversational Telephone Speech". IEEE Trans. Speech Audio Process. 14(2): 727-727 (2006)
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WanH06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WanH06
Vincent Wan, Thomas Hain:
Strategies for Language Model Web-Data Collection. ICASSP (1) 2006: 1069-1072
[c23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DinesVH06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DinesVH06
John Dines, Jithendra Vepa, Thomas Hain:
The segmentation of multi-channel meeting recordings for automatic speech recognition. INTERSPEECH 2006
[c22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GibsonH06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GibsonH06
Matthew Gibson, Thomas Hain:
Hypothesis spaces for minimum Bayes risk training in large vocabulary speech recognition. INTERSPEECH 2006
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/UragaH06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/UragaH06
Esmeralda Uraga, Thomas Hain:
Automatic speech recognition experiments with articulatory data. INTERSPEECH 2006
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/mlmi/Al-HamesHCSPMMLOBBCGJMRRRRRSTZ06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlmi/Al-HamesHCSPMMLOBBCGJMRRRRRSTZ06
Marc A. Al-Hames, Thomas Hain, Jan Cernocký, Sascha Schreiber, Mannes Poel, Ronald Müller, Sébastien Marcel, David A. van Leeuwen, Jean-Marc Odobez, Sileye O. Ba, Hervé Bourlard, Fabien Cardinaux, Daniel Gatica-Perez, Adam Janin, Petr Motlícek, Stephan Reiter, Steve Renals, Jeroen van Rest, Rutger Rienks, Gerhard Rigoll, Kevin Smith, Andrew H. C. Thean, Pavel Zemcík:
Audio-Visual Processing in Meetings: Seven Questions and Current AMI Answers. MLMI 2006: 24-35
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/mlmi/MooreDMVCH06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlmi/MooreDMVCH06
Darren Moore, John Dines, Mathew Magimai-Doss, Jithendra Vepa, Octavian Cheng, Thomas Hain:
Juicer: A Weighted Finite-State Transducer Speech Decoder. MLMI 2006: 285-296
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/mlmi/HainBDGKLVW06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlmi/HainBDGKLVW06
Thomas Hain, Lukás Burget, John Dines, Giulia Garau, Martin Karafiát, Mike Lincoln, Jithendra Vepa, Vincent Wan:
The AMI Meeting Transcription System: Progress and Performance. MLMI 2006: 419-431
2005
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/Hain05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/Hain05
Thomas Hain:
Implicit modelling of pronunciation variation in automatic speech recognition. Speech Commun. 46(2): 171-188 (2005)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HainWEGLMPW05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HainWEGLMPW05
Thomas Hain, Philip C. Woodland, Gunnar Evermann, Mark J. F. Gales, Xunying Liu, Gareth L. Moore, Daniel Povey, Lan Wang:
Automatic transcription of conversational telephone speech. IEEE Trans. Speech Audio Process. 13(6): 1173-1185 (2005)
[c17]
- no documents available
  - details & citations
- export record
  dblp key:
  - conf/amcs/HainM05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/amcs/HainM05
Thomas Hain, David Mercer:
Fast Floating Point Square Root. AMCS 2005: 33-39
[c16]
- no documents available
  - details & citations
- export record
  dblp key:
  - conf/cisst/HainL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cisst/HainL05
Thomas Hain, David Langan:
A Fast, Practical Algorithm for the Trapezoidation of Simple Polygons. CISST 2005: 98-108
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GarauRH05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GarauRH05
Giulia Garau, Steve Renals, Thomas Hain:
Applying vocal tract length normalization to meeting recordings. INTERSPEECH 2005: 265-268
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HainDGKMWOR05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HainDGKMWOR05
Thomas Hain, John Dines, Giulia Garau, Martin Karafiát, Darren Moore, Vincent Wan, Roeland Ordelman, Steve Renals:
Transcription of conference room meetings: an investigation. INTERSPEECH 2005: 1661-1664
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/mlmi/CarlettaABFGHKKKKLLLMPRW05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlmi/CarlettaABFGHKKKKLLLMPRW05
Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Maël Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, Iain McCowan, Wilfried M. Post, Dennis Reidsma, Pierre Wellner:
The AMI Meeting Corpus: A Pre-announcement. MLMI 2005: 28-39
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/mlmi/HainBDMGKLMWOR05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlmi/HainBDMGKLMWOR05
Thomas Hain, Lukás Burget, John Dines, Iain McCowan, Giulia Garau, Martin Karafiát, Mike Lincoln, Darren Moore, Vincent Wan, Roeland Ordelman, Steve Renals:
The Development of the AMI System for the Transcription of Speech in Meetings. MLMI 2005: 344-356
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/mlmi/HainBDGKLMMWOR05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlmi/HainBDGKLMMWOR05
Thomas Hain, Lukás Burget, John Dines, Giulia Garau, Martin Karafiát, Mike Lincoln, Iain McCowan, Darren Moore, Vincent Wan, Roeland Ordelman, Steve Renals:
The 2005 AMI System for the Transcription of Speech in Meetings. MLMI 2005: 450-462
2004
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/EvermannCGHLMWW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/EvermannCGHLMWW04
Gunnar Evermann, Ho Yin Chan, Mark J. F. Gales, Thomas Hain, Xunying Liu, David Mrva, Lan Wang, Philip C. Woodland:
Development of the 2003 CU-HTK conversational telephone speech transcription system. ICASSP (1) 2004: 249-252
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimUGHW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimUGHW04
Do Yeong Kim, Srinivasan Umesh, Mark J. F. Gales, Thomas Hain, Philip C. Woodland:
Using VTLN for broadcast news transcription. INTERSPEECH 2004: 1953-1956
2001
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HainWEP01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HainWEP01
Thomas Hain, Philip C. Woodland, Gunnar Evermann, Daniel Povey:
New features in the CU-HTK system for transcription of conversational telephone speech. ICASSP 2001: 57-60
2000
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HainW00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HainW00
Thomas Hain, Philip C. Woodland:
Modelling sub-phone insertions and deletions in continuous speech recognition. INTERSPEECH 2000: 172-175

1990 – 1999

see FAQ

What is the meaning of the colors in the publication lists?

1999
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HainWNW99
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HainWNW99
Thomas Hain, Philip C. Woodland, Thomas Niesler, Edward W. D. Whittaker:
The 1998 HTK system for transcription of conversational telephone speech. ICASSP 1999: 57-60
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/interspeech/HainW99
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HainW99
Thomas Hain, Philip C. Woodland:
Dynamic HMM selection for continuous speech recognition. EUROSPEECH 1999
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/interspeech/WoodlandOHMNTW99
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WoodlandOHMNTW99
Philip C. Woodland, J. J. Odell, Thomas Hain, Gareth L. Moore, Thomas Niesler, Andreas Tuerk, Edward W. D. Whittaker:
Improvements in accuracy and speed in the HTK broadcast news transcription system. EUROSPEECH 1999: 1043-1046
1998
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WoodlandHJNTY98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WoodlandHJNTY98
Philip C. Woodland, Thomas Hain, Sue E. Johnson, Thomas Niesler, Andreas Tuerk, Steve J. Young:
Experiments in broadcast news transcription. ICASSP 1998: 909-912
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HainW98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HainW98
Thomas Hain, Philip C. Woodland:
Segmentation and classification of broadcast news audio. ICSLP 1998
1994
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HurtgenH94
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HurtgenH94
Bemd Hurtgen, Thomas Hain:
On the convergence of fractal transforms. ICASSP (5) 1994: 561-564

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.