default search action
Aku Rouhe
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [b1]Aku Rouhe:
Attention-based End-to-End Models in Language Technology ; Attentiopohjaiset kokonaismallit kieliteknologiassa. Aalto University, Espoo, Finland, 2024 - [j4]Aku Rouhe, Tamás Grósz, Mikko Kurimo:
Principled Comparisons for End-to-End Speech Recognition: Attention vs Hybrid at the 1000-Hour Scale. IEEE ACM Trans. Audio Speech Lang. Process. 32: 623-638 (2024) - [i7]Mirco Ravanelli, Titouan Parcollet, Adel Moumen, Sylvain de Langen, Cem Subakan, Peter Plantinga, Yingzhi Wang, Pooneh Mousavi, Luca Della Libera, Artem Ploujnikov, Francesco Paissan, Davide Borra, Salah Zaiem, Zeyu Zhao, Shucong Zhang, Georgios Karakasidis, Sung-Lin Yeh, Pierre Champion, Aku Rouhe, Rudolf Braun, Florian Mai, Juan Zuluaga-Gomez, Seyed Mahed Mousavi, Andreas Nautsch, Xuechen Liu, Sangeet Sagar, Jarod Duret, Salima Mdhaffar, Gaëlle Laperrière, Mickael Rouvier, Renato De Mori, Yannick Estève:
Open-Source Conversational AI with SpeechBrain 1.0. CoRR abs/2407.00463 (2024) - 2023
- [j3]Anssi Moisio, Dejan Porjazovski, Aku Rouhe, Yaroslav Getman, Anja Virkkunen, Ragheb Al-Ghezi, Mietta Lennes, Tamás Grósz, Krister Lindén, Mikko Kurimo:
Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks. Lang. Resour. Evaluation 57(3): 1295-1327 (2023) - [j2]Anja Virkkunen, Aku Rouhe, Nhan Phan, Mikko Kurimo:
Finnish parliament ASR corpus. Lang. Resour. Evaluation 57(4): 1645-1670 (2023) - [c14]Tamás Grósz, Yaroslav Getman, Ragheb Al-Ghezi, Aku Rouhe, Mikko Kurimo:
Investigating wav2vec2 context representations and the effects of fine-tuning, a case-study of a Finnish model. INTERSPEECH 2023: 196-200 - [c13]Reima Karhila, Sari Ylinen, Anna-Riikka Smolander, Aku Rouhe, Ragheb Al-Ghezi, Yaroslav Getman, Tamás Grósz, Maria Uther, Mikko Kurimo:
A pronunciation Scoring System Embedded into Children's Foreign Language Learning Games with Experimental Verification of Learning Benefits. SLaTE 2023: 21-25 - 2022
- [c12]Aku Rouhe, Anja Virkkunen, Juho Leinonen, Mikko Kurimo:
Low Resource Comparison of Attention-based and Hybrid ASR Exploiting wav2vec 2.0. INTERSPEECH 2022: 3543-3547 - [i6]Anssi Moisio, Dejan Porjazovski, Aku Rouhe, Yaroslav Getman, Anja Virkkunen, Tamás Grósz, Krister Lindén, Mikko Kurimo:
Lahjoita puhetta - a large-scale corpus of spoken Finnish with some benchmarks. CoRR abs/2203.12906 (2022) - [i5]Anja Virkkunen, Aku Rouhe, Nhan Phan, Mikko Kurimo:
Finnish Parliament ASR corpus - Analysis, benchmarks and statistics. CoRR abs/2203.14876 (2022) - 2021
- [c11]Ragheb Al-Ghezi, Yaroslav Getman, Aku Rouhe, Raili Hildén, Mikko Kurimo:
Self-Supervised End-to-End ASR for Low Resource L2 Swedish. Interspeech 2021: 1429-1433 - [c10]Tuomas Kaseva, Hemant Kumar Kathania, Aku Rouhe, Mikko Kurimo:
Speaker Verification Experiments for Adults and Children Using Shared Embedding Spaces. NoDaLiDa 2021: 86-93 - [c9]Aku Rouhe, Astrid Van Camp, Mittul Singh, Hugo Van hamme, Mikko Kurimo:
An Equal Data Setting for Attention-Based Encoder-Decoder and HMM/DNN Models: A Case Study in Finnish ASR. SPECOM 2021: 602-613 - [i4]Mirco Ravanelli, Titouan Parcollet, Peter Plantinga, Aku Rouhe, Samuele Cornell, Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan Zhong, Ju-Chieh Chou, Sung-Lin Yeh, Szu-Wei Fu, Chien-Feng Liao, Elena Rastorgueva, François Grondin, William Aris, Hwidong Na, Yan Gao, Renato De Mori, Yoshua Bengio:
SpeechBrain: A General-Purpose Speech Toolkit. CoRR abs/2106.04624 (2021) - 2020
- [j1]Umut Sulubacak, Ozan Caglayan, Stig-Arne Grönroos, Aku Rouhe, Desmond Elliott, Lucia Specia, Jörg Tiedemann:
Multimodal machine translation through visuals and speech. Mach. Transl. 34(2-3): 97-147 (2020) - [c8]Aku Rouhe, Tuomas Kaseva, Mikko Kurimo:
Speaker-Aware Training of Attention-Based End-to-End Speech Recognition Using Neural Speaker Embeddings. ICASSP 2020: 7064-7068 - [c7]Abhilash Jain, Aku Rouhe, Stig-Arne Grönroos, Mikko Kurimo:
Finnish ASR with Deep Transformer Models. INTERSPEECH 2020: 3630-3634 - [i3]Abhilash Jain, Aku Rouhe, Stig-Arne Grönroos, Mikko Kurimo:
Finnish Language Modeling with Deep Transformer Models. CoRR abs/2003.11562 (2020)
2010 – 2019
- 2019
- [c6]Tuomas Kaseva, Aku Rouhe, Mikko Kurimo:
Spherediar: An Effective Speaker Diarization System for Meeting Data. ASRU 2019: 373-380 - [i2]Umut Sulubacak, Ozan Caglayan, Stig-Arne Grönroos, Aku Rouhe, Desmond Elliott, Lucia Specia, Jörg Tiedemann:
Multimodal Machine Translation through Visuals and Speech. CoRR abs/1911.12798 (2019) - 2018
- [c5]Aku Rouhe, Reima Karhila, Aija Elg, Minnaleena Toivola, Peter Smit, Anna-Riikka Smolander, Mikko Kurimo:
Captaina: Integrated Pronunciation Practice and Data Collection Portal. INTERSPEECH 2018: 1051-1052 - [c4]Umut Sulubacak, Jörg Tiedemann, Aku Rouhe, Stig-Arne Grönroos, Mikko Kurimo:
The MeMAD Submission to the IWSLT 2018 Speech Translation Task. IWSLT 2018: 89-94 - [i1]Umut Sulubacak, Jörg Tiedemann, Aku Rouhe, Stig-Arne Grönroos, Mikko Kurimo:
The MeMAD Submission to the IWSLT 2018 Speech Translation Task. CoRR abs/1810.10320 (2018) - 2017
- [c3]Aku Rouhe, Reima Karhila, Peter Smit, Mikko Kurimo:
Reading Validation for Pronunciation Evaluation in the Digitala Project. INTERSPEECH 2017: 2050-2051 - [c2]Aku Rouhe, Reima Karhila, Heini Kallio, Mikko Kurimo:
A pipeline for automatic assessment of foreign language pronunciation. SLaTE 2017: 190 - 2016
- [c1]Reima Karhila, Aku Rouhe, Peter Smit, André Mansikkaniemi, Heini Kallio, Erik Lindroos, Raili Hildén, Martti Vainio, Mikko Kurimo:
Digitala: An Augmented Test and Review Process Prototype for High-Stakes Spoken Foreign Language Examination. INTERSPEECH 2016: 784-785
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-20 00:37 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint