


default search action
Tuomas Virtanen
Person information
- affiliation: Tampere University of Technology, Finland
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j47]Huang Xie
, Khazar Khorrami
, Okko Räsänen
, Tuomas Virtanen
:
Text-Based Audio Retrieval by Learning From Similarities Between Audio Captions. IEEE Signal Process. Lett. 32: 221-225 (2025) - 2024
- [j46]Laura Hekanaho
, Maija Hirvonen
, Tuomas Virtanen:
Language-based machine perception: linguistic perspectives on the compilation of captioning datasets. Digit. Scholarsh. Humanit. 39(3): 864-883 (2024) - [j45]Szymon Drgas
, Lars Bramsløw
, Archontis Politis
, Gaurav Naithani
, Tuomas Virtanen
:
Dynamic Processing Neural Network Architecture for Hearing Loss Compensation. IEEE ACM Trans. Audio Speech Lang. Process. 32: 203-214 (2024) - [j44]Michael Neri
, Archontis Politis
, Daniel Aleksander Krause
, Marco Carli
, Tuomas Virtanen
:
Speaker Distance Estimation in Enclosures From Single-Channel Audio. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2242-2254 (2024) - [c182]Aapo Hakala, Trevor Kincy, Tuomas Virtanen:
Automatic Live Music Song Identification Using Multi-level Deep Sequence Similarity Learning. EUSIPCO 2024: 31-35 - [c181]Wang Dai, Xiaofei Li, Archontis Politis, Tuomas Virtanen:
Reference Channel Selection by Multi-Channel Masking for End-to-End Multi-Channel Speech Enhancement. EUSIPCO 2024: 241-245 - [c180]John Martinsson, Olof Mogren, Maria Sandsten, Tuomas Virtanen:
From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning. EUSIPCO 2024: 902-906 - [c179]Mikko Heikkinen, Archontis Politis, Tuomas Virtanen:
Neural Ambisonics Encoding For Compact Irregular Microphone Arrays. ICASSP 2024: 701-705 - [c178]Yuzhu Wang
, Archontis Politis
, Tuomas Virtanen:
Attention-Driven Multichannel Speech Enhancement in Moving Sound Source Scenarios. ICASSP 2024: 11221-11225 - [c177]Martin Moritz, Toni Olán, Tuomas Virtanen:
Noise-to-Mask Ratio Loss for Deep Neural Network Based Audio Watermarking. IS2 2024: 1-6 - [c176]Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen:
Multi-Label Zero-Shot Audio Classification with Temporal Attention. IWAENC 2024: 250-254 - [i84]Mikko Heikkinen, Archontis Politis, Tuomas Virtanen:
Neural Ambisonics encoding for compact irregular microphone arrays. CoRR abs/2401.05916 (2024) - [i83]John Martinsson, Olof Mogren, Maria Sandsten, Tuomas Virtanen:
From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning. CoRR abs/2403.08525 (2024) - [i82]Michael Neri, Archontis Politis, Daniel Krause, Marco Carli, Tuomas Virtanen:
Speaker Distance Estimation in Enclosures from Single-Channel Audio. CoRR abs/2403.17514 (2024) - [i81]Andreas Triantafyllopoulos, Iosif Tsangko, Alexander Gebhard, Annamaria Mesaros, Tuomas Virtanen, Björn W. Schuller:
Computer Audition: From Task-Specific Machine Learning to Foundation Models. CoRR abs/2407.15672 (2024) - [i80]Martin Moritz, Toni Olán, Tuomas Virtanen:
Noise-to-mask Ratio Loss for Deep Neural Network based Audio Watermarking. CoRR abs/2408.15553 (2024) - [i79]Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen:
Multi-label Zero-Shot Audio Classification with Temporal Attention. CoRR abs/2409.00408 (2024) - [i78]Jaime Garcia-Martinez, David Diaz-Guerra, Archontis Politis, Tuomas Virtanen, Julio J. Carabias-Orti, Pedro Vera-Candeas:
SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source Separation. CoRR abs/2409.10995 (2024) - [i77]Annamaria Mesaros, Romain Serizel, Toni Heittola, Tuomas Virtanen, Mark D. Plumbley:
A decade of DCASE: Achievements, practices, evaluations and future challenges. CoRR abs/2410.04951 (2024) - [i76]Esa Räsänen, Niko Gullsten, Otto Pulkkinen, Tuomas Virtanen:
Timing and Dynamics of the Rosanna Shuffle. CoRR abs/2411.06892 (2024) - 2023
- [c175]Paul Magron, Tuomas Virtanen
:
Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints. EUSIPCO 2023: 36-40 - [c174]David Diaz-Guerra
, Archontis Politis
, Tuomas Virtanen
:
Position Tracking of a Varying Number of Sound Sources with Sliding Permutation Invariant Training. EUSIPCO 2023: 251-255 - [c173]Khazar Khorrami
, María Andrea Cruz Blandón
, Tuomas Virtanen, Okko Räsänen
:
Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System. EUSIPCO 2023: 431-435 - [c172]Parthasaarathy Sudarsanam
, Tuomas Virtanen
:
Attention-Based Methods For Audio Question Answering. EUSIPCO 2023: 750-754 - [c171]Huang Xie
, Okko Räsänen
, Tuomas Virtanen
:
On Negative Sampling for Contrastive Audio-Text Retrieval. ICASSP 2023: 1-5 - [c170]Wei Xie, Yanxiong Li, Qianhua He, Wenchang Cao, Tuomas Virtanen
:
Few-shot Class-incremental Audio Classification Using Adaptively-refined Prototypes. INTERSPEECH 2023: 301-305 - [c169]Kazuki Shimada, Archontis Politis, Parthasaarathy Sudarsanam, Daniel Aleksander Krause, Kengo Uchida, Sharath Adavanne, Aapo Hakala, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji:
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. NeurIPS 2023 - [c168]Diep Luong, Minh Tran, Shayan Gharib, Konstantinos Drossos
, Tuomas Virtanen
:
Representation Learning for Audio Privacy Preservation Using Source Separation and Robust Adversarial Learning. WASPAA 2023: 1-5 - [c167]Michael Neri
, Archontis Politis
, Daniel Krause, Marco Carli, Tuomas Virtanen
:
Single-Channel Speaker Distance Estimation in Reverberant Environments. WASPAA 2023: 1-5 - [d18]Archontis Politis
, Kazuki Shimada
, Parthasaarathy Sudarsanam, Aapo Hakala, Shusuke Takahashi, Daniel Aleksander Krause, Naoya Takahashi, Sharath Adavanne
, Yuichiro Koyama, Kengo Uchida, Yuki Mitsufuji
, Tuomas Virtanen
:
STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023. Version 1.0.0. Zenodo, 2023 [all versions] - [d17]Archontis Politis
, Kazuki Shimada
, Parthasaarathy Sudarsanam, Aapo Hakala, Shusuke Takahashi, Daniel Aleksander Krause, Naoya Takahashi, Sharath Adavanne
, Yuichiro Koyama, Kengo Uchida, Yuki Mitsufuji
, Tuomas Virtanen
:
STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023. Version 1.1.0. Zenodo, 2023 [all versions] - [i75]Paul Magron, Tuomas Virtanen:
Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints. CoRR abs/2303.01864 (2023) - [i74]Wang Dai, Archontis Politis, Tuomas Virtanen:
Multi-Channel Masking with Learnable Filterbank for Sound Source Separation. CoRR abs/2303.07816 (2023) - [i73]Shayan Gharib, Minh Tran, Diep Luong, Konstantinos Drossos, Tuomas Virtanen:
Adversarial Representation Learning for Robust Privacy Preservation in Audio. CoRR abs/2305.00011 (2023) - [i72]Wei Xie, Yanxiong Li, Qianhua He, Wenchang Cao, Tuomas Virtanen:
Few-shot Class-incremental Audio Classification Using Adaptively-refined Prototypes. CoRR abs/2305.18045 (2023) - [i71]Parthasaarathy Sudarsanam, Tuomas Virtanen:
Attention-Based Methods For Audio Question Answering. CoRR abs/2305.19769 (2023) - [i70]Khazar Khorrami, María Andrea Cruz Blandón, Tuomas Virtanen, Okko Räsänen:
Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System. CoRR abs/2306.02972 (2023) - [i69]David Diaz-Guerra, Archontis Politis, Antonio Miguel, José Ramón Beltrán, Tuomas Virtanen:
Permutation Invariant Recurrent Neural Networks for Sound Source Tracking Applications. CoRR abs/2306.08510 (2023) - [i68]Kazuki Shimada, Archontis Politis, Parthasaarathy Sudarsanam
, Daniel Krause, Kengo Uchida, Sharath Adavanne, Aapo Hakala, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji:
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. CoRR abs/2306.09126 (2023) - [i67]Huang Xie, Khazar Khorrami, Okko Räsänen, Tuomas Virtanen:
Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances. CoRR abs/2306.09820 (2023) - [i66]Diep Luong, Minh Tran, Shayan Gharib, Konstantinos Drossos, Tuomas Virtanen:
Representation Learning for Audio Privacy Preservation using Source Separation and Robust Adversarial Learning. CoRR abs/2308.04960 (2023) - [i65]Szymon Drgas, Lars Bramsløw, Archontis Politis, Gaurav Naithani, Tuomas Virtanen:
Dynamic Processing Neural Network Architecture For Hearing Loss Compensation. CoRR abs/2310.16550 (2023) - [i64]Yuzhu Wang, Archontis Politis, Tuomas Virtanen:
Attention-Driven Multichannel Speech Enhancement in Moving Sound Source Scenarios. CoRR abs/2312.10756 (2023) - 2022
- [j43]Björn W. Schuller
, Yonina C. Eldar, Maja Pantic, Shrikanth Narayanan, Tuomas Virtanen
, Jianhua Tao:
Editorial: Intelligent Signal Analysis for Contagious Virus Diseases. IEEE J. Sel. Top. Signal Process. 16(2): 159-163 (2022) - [j42]Shanshan Wang
, Archontis Politis
, Annamaria Mesaros
, Tuomas Virtanen
:
Self-Supervised Learning of Audio Representations From Audio-Visual Data Using Spatial Alignment. IEEE J. Sel. Top. Signal Process. 16(6): 1467-1479 (2022) - [c166]Irene Martín-Morató, Francesco Paissan, Alberto Ancilotto, Toni Heittola, Annamaria Mesaros, Elisabetta Farella, Alessio Brutti, Tuomas Virtanen:
Low-Complexity Acoustic Scene Classification in DCASE 2022 Challenge. DCASE 2022 - [c165]Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Sharath Adavanne, Daniel Krause, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji, Tuomas Virtanen:
STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. DCASE 2022 - [c164]Huang Xie, Samuel Lipping, Tuomas Virtanen:
Language-Based Audio Retrieval Task in DCASE 2022 Challenge. DCASE 2022 - [c163]Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen:
Zero-Shot Audio Classification using Image Embeddings. EUSIPCO 2022: 1-5 - [c162]Ville-Veikko Eklund, Aleksandr Diment, Tuomas Virtanen:
Noise, Device and Room Robustness Methods for Pronunciation Error Detection. EUSIPCO 2022: 140-144 - [c161]Samuel Lipping, Parthasaarathy Sudarsanam, Konstantinos Drossos, Tuomas Virtanen:
Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering. EUSIPCO 2022: 1140-1144 - [c160]Huang Xie
, Okko Räsänen
, Konstantinos Drossos
, Tuomas Virtanen
:
Unsupervised Audio-Caption Aligning Learns Correspondences Between Individual Sound Events and Textual Phrases. ICASSP 2022: 8867-8871 - [c159]Yanxiong Li, Wenchang Cao, Konstantinos Drossos
, Tuomas Virtanen
:
Domestic Activity Clustering from Audio via Depthwise Separable Convolutional Autoencoder Network. MMSP 2022: 1-6 - [c158]Gaurav Naithani
, Kirsi Pietilä, Riitta Niemistö, Erkki Paajanen, Tero Takala, Tuomas Virtanen
:
Subjective Evaluation of Deep Neural Network Based Speech Enhancement Systems in Real-World Conditions. MMSP 2022: 1-6 - [d16]Samuel Lipping, Parthasaarathy Sudarsanam, Konstantinos Drossos
, Tuomas Virtanen
:
Clotho-AQA dataset. Zenodo, 2022 - [d15]Archontis Politis
, Sharath Adavanne
, Tuomas Virtanen
:
TAU Spatial Room Impulse Response Database (TAU-SRIR DB). Zenodo, 2022 - [d14]Adavanne Politis, Yuki Mitsufuji
, Parthasaarathy Sudarsanam, Kazuki Shimada
, Sharath Adavanne
, Yuichiro Koyama, Daniel Krause, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen
:
STARSS22: Sony-TAu Realistic Spatial Soundscapes 2022 dataset. Version 1.0.0. Zenodo, 2022 [all versions] - [d13]Archontis Politis
, Yuki Mitsufuji
, Parthasaarathy Sudarsanam, Kazuki Shimada
, Sharath Adavanne
, Yuichiro Koyama, Daniel Aleksander Krause, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen
:
STARSS22: Sony-TAu Realistic Spatial Soundscapes 2022 dataset. Version 1.1.0. Zenodo, 2022 [all versions] - [i63]Samuel Lipping, Parthasaarathy Sudarsanam
, Konstantinos Drossos, Tuomas Virtanen:
Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering. CoRR abs/2204.09634 (2022) - [i62]Shanshan Wang, Archontis Politis, Annamaria Mesaros, Tuomas Virtanen:
Self-supervised Learning of Audio Representations from Audio-Visual Data using Spatial Alignment. CoRR abs/2206.00970 (2022) - [i61]Archontis Politis, Kazuki Shimada
, Parthasaarathy Sudarsanam
, Sharath Adavanne, Daniel Krause, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji
, Tuomas Virtanen:
STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events. CoRR abs/2206.01948 (2022) - [i60]Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen:
Zero-Shot Audio Classification using Image Embeddings. CoRR abs/2206.04984 (2022) - [i59]Yanxiong Li, Wenchang Cao, Konstantinos Drossos, Tuomas Virtanen:
Domestic Activity Clustering from Audio via Depthwise Separable Convolutional Autoencoder Network. CoRR abs/2208.02406 (2022) - [i58]Gaurav Naithani, Kirsi Pietilä, Riitta Niemistö, Erkki Paajanen, Tero Takala, Tuomas Virtanen:
Subjective Evaluation of Deep Neural Network Based Speech Enhancement Systems in Real-World Conditions. CoRR abs/2208.05057 (2022) - [i57]David Diaz-Guerra, Archontis Politis, Tuomas Virtanen:
Position tracking of a varying number of sound sources with sliding permutation invariant training. CoRR abs/2210.14536 (2022) - [i56]Huang Xie, Okko Räsänen, Tuomas Virtanen:
On Negative Sampling for Contrastive Audio-Text Retrieval. CoRR abs/2211.04070 (2022) - 2021
- [j41]Szymon Drgas, Tuomas Virtanen
:
Joint speaker separation and recognition using non-negative matrix deconvolution with adaptive dictionary. Comput. Speech Lang. 70: 101223 (2021) - [j40]Annamaria Mesaros
, Toni Heittola
, Tuomas Virtanen
, Mark D. Plumbley
:
Sound Event Detection: A tutorial. IEEE Signal Process. Mag. 38(5): 67-83 (2021) - [j39]Archontis Politis
, Annamaria Mesaros
, Sharath Adavanne
, Toni Heittola
, Tuomas Virtanen
:
Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019. IEEE ACM Trans. Audio Speech Lang. Process. 29: 684-698 (2021) - [j38]Huang Xie
, Tuomas Virtanen
:
Zero-Shot Audio Classification Via Semantic Embeddings. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1233-1242 (2021) - [c157]Shanshan Wang, Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
Audio-Visual Scene Classification: Analysis of DCASE 2021 Challenge Submissions. DCASE 2021: 45-49 - [c156]Irene Martín-Morató, Toni Heittola, Annamaria Mesaros, Tuomas Virtanen:
Low-Complexity Acoustic Scene Classification for Multi-Device Audio: Analysis of DCASE 2021 Challenge Systems. DCASE 2021: 85-89 - [c155]Archontis Politis, Sharath Adavanne, Daniel Krause, Antoine Deleforge, Prerak Srivastava, Tuomas Virtanen:
A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection. DCASE 2021: 125-129 - [c154]Shanshan Wang
, Gaurav Naithani
, Archontis Politis
, Tuomas Virtanen
:
Deep Neural Network Based Low-Latency Speech Separation with Asymmetric Analysis-Synthesis Window Pair. EUSIPCO 2021: 301-305 - [c153]Pasi Pertilä, Emre Cakir
, Aapo Hakala, Eemi Fagerlund, Tuomas Virtanen
, Archontis Politis
, Antti J. Eronen:
Mobile Microphone Array Speech Detection and Localization in Diverse Everyday Environments. EUSIPCO 2021: 406-410 - [c152]Slobodan Djukanovic, Yash Patel, Jirí Matas
, Tuomas Virtanen
:
Neural network-based acoustic vehicle counting. EUSIPCO 2021: 561-565 - [c151]An Tran, Konstantinos Drossos
, Tuomas Virtanen
:
WaveTransformer: An Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information. EUSIPCO 2021: 576-580 - [c150]Huang Xie
, Okko Räsänen
, Tuomas Virtanen
:
Zero-Shot Audio Classification with Factored Linear and Nonlinear Acoustic-Semantic Projections. ICASSP 2021: 326-330 - [c149]Xavier Favory, Konstantinos Drossos
, Tuomas Virtanen
, Xavier Serra:
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags. ICASSP 2021: 596-600 - [c148]Shanshan Wang
, Annamaria Mesaros
, Toni Heittola, Tuomas Virtanen
:
A Curated Dataset of Urban Scenes for Audio-Visual Scene Analysis. ICASSP 2021: 626-630 - [c147]Björn W. Schuller, Tuomas Virtanen
, Maria Riveiro, Georgios Rizos, Jing Han, Annamaria Mesaros
, Konstantinos Drossos
:
Towards Sonification in Multimodal and User-friendlyExplainable Artificial Intelligence. ICMI 2021: 788-792 - [c146]Sharath Adavanne
, Archontis Politis
, Tuomas Virtanen
:
Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers. WASPAA 2021: 211-215 - [d12]Konstantinos Drossos
, Samuel Lipping, Tuomas Virtanen
:
Clotho dataset. Version 2.0. Zenodo, 2021 [all versions] - [d11]Konstantinos Drossos
, Samuel Lipping, Tuomas Virtanen
:
Clotho dataset. Version 2.1. Zenodo, 2021 [all versions] - [d10]Archontis Politis
, Sharath Adavanne
, Tuomas Virtanen
:
TAU-NIGENS Spatial Sound Events 2021. Version 1. Zenodo, 2021 [all versions] - [d9]Archontis Politis
, Sharath Adavanne
, Tuomas Virtanen
:
TAU-NIGENS Spatial Sound Events 2021. Version 1.1.0. Zenodo, 2021 [all versions] - [d8]Archontis Politis
, Sharath Adavanne
, Tuomas Virtanen
:
TAU-NIGENS Spatial Sound Events 2021. Version 1.2.0. Zenodo, 2021 [all versions] - [i55]Shanshan Wang, Toni Heittola, Annamaria Mesaros, Tuomas Virtanen:
Audio-visual scene classification: analysis of DCASE 2021 Challenge submissions. CoRR abs/2105.13675 (2021) - [i54]Archontis Politis, Sharath Adavanne, Daniel Krause, Antoine Deleforge, Prerak Srivastava, Tuomas Virtanen:
A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection. CoRR abs/2106.06999 (2021) - [i53]Shanshan Wang, Gaurav Naithani, Archontis Politis, Tuomas Virtanen:
Deep neural network Based Low-latency Speech Separation with Asymmetric analysis-Synthesis Window Pair. CoRR abs/2106.11794 (2021) - [i52]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers. CoRR abs/2111.00030 (2021) - 2020
- [j37]Paul Magron
, Tuomas Virtanen
:
Online Spectrogram Inversion for Low-Latency Audio Source Separation. IEEE Signal Process. Lett. 27: 306-310 (2020) - [j36]Shuyang Zhao
, Toni Heittola
, Tuomas Virtanen
:
Active Learning for Sound Event Detection. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2895-2905 (2020) - [c145]Emre Çakir, Konstantinos Drossos, Tuomas Virtanen:
Multi-Task Regularization Based on Infrequent Classes for Audio Captioning. DCASE 2020: 6-10 - [c144]Toni Heittola, Annamaria Mesaros, Tuomas Virtanen:
Acoustic Scene Classification in DCASE 2020 Challenge: Generalization Across Devices and Low Complexity Solutions. DCASE 2020: 56-60 - [c143]Khoa Nguyen, Konstantinos Drossos, Tuomas Virtanen:
Temporal Sub-Sampling of Audio Feature Sequences for Automated Audio Captioning. DCASE 2020: 110-114 - [c142]Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and Detection. DCASE 2020: 165-169 - [c141]Niccolò Nicodemo
, Gaurav Naithani
, Konstantinos Drossos
, Tuomas Virtanen
, Roberto Saletti
:
Memory Requirement Reduction of Deep Neural Networks for Field Programmable Gate Arrays Using Low-Bit Quantization of Parameters. EUSIPCO 2020: 466-470 - [c140]Yanxiong Li, Mingle Liu, Konstantinos Drossos
, Tuomas Virtanen
:
Sound Event Detection Via Dilated Convolutional Recurrent Neural Networks. ICASSP 2020: 286-290 - [c139]Konstantinos Drossos
, Samuel Lipping, Tuomas Virtanen
:
Clotho: an Audio Captioning Dataset. ICASSP 2020: 736-740 - [c138]Konstantinos Drossos
, Stylianos I. Mimilakis, Shayan Gharib, Yanxiong Li, Tuomas Virtanen
:
Sound Event Detection with Depthwise Separable and Dilated Convolutions. IJCNN 2020: 1-7 - [c137]Slobodan Djukanovic, Jiri Matas
, Tuomas Virtanen
:
Robust Audio-Based Vehicle Counting in Low-to-Moderate Traffic Flow. IV 2020: 1608-1614 - [c136]Pyry Pyykkönen, Stylianos I. Mimilakis, Konstantinos Drossos
, Tuomas Virtanen
:
Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation. MMSP 2020: 1-6 - [d7]Konstantinos Drossos
, Samuel Lipping, Tuomas Virtanen
:
Audio captioning DCASE 2020 evaluation (testing) split. Zenodo, 2020 - [d6]Xavier Favory
, Konstantinos Drossos
, Tuomas Virtanen
, Xavier Serra
:
Dataset used in COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations. Zenodo, 2020 - [d5]Shayan Gharib, Konstantinos Drossos
, Eemi Fagerlund, Tuomas Virtanen
:
VOICe Dataset. Zenodo, 2020 - [i51]Konstantinos Drossos, Stylianos Ioannis Mimilakis, Shayan Gharib, Yanxiong Li, Tuomas Virtanen:
Sound Event Detection with Depthwise Separable and Dilated Convolutions. CoRR abs/2002.00476 (2020) - [i50]Shuyang Zhao, Toni Heittola, Tuomas Virtanen:
Active Learning for Sound Event Detection. CoRR abs/2002.05033 (2020) - [i49]Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and Detection. CoRR abs/2006.01919 (2020) - [i48]Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra:
COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations. CoRR abs/2006.08386 (2020) - [i47]Khoa Nguyen, Konstantinos Drossos, Tuomas Virtanen:
Temporal Sub-sampling of Audio Feature Sequences for Automated Audio Captioning. CoRR abs/2007.02676 (2020) - [i46]Pyry Pyykkönen, Stylianos Ioannis Mimilakis, Konstantinos Drossos, Tuomas Virtanen:
Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation. CoRR abs/2007.02683 (2020) - [i45]Emre Çakir, Konstantinos Drossos, Tuomas Virtanen:
Multi-task Regularization Based on Infrequent Classes for Audio Captioning. CoRR abs/2007.04660 (2020) - [i44]Konstantinos Drossos, Stylianos Ioannis Mimilakis, Tuomas Virtanen:
Conditioned Time-Dilated Convolutions for Sound Event Detection. CoRR abs/2007.05183 (2020) - [i43]Archontis Politis, Annamaria Mesaros, Sharath Adavanne, Toni Heittola, Tuomas Virtanen:
Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019. CoRR abs/2009.02792 (2020) - [i42]An Tran, Konstantinos Drossos, Tuomas Virtanen:
WaveTransformer: A Novel Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information. CoRR abs/2010.11098 (2020) - [i41]Slobodan Djukanovic, Yash Patel, Jiri Matas, Tuomas Virtanen:
Neural Network-based Acoustic Vehicle Counting. CoRR abs/2010.11659 (2020) - [i40]Slobodan Djukanovic, Jiri Matas, Tuomas Virtanen:
Robust Audio-Based Vehicle Counting in Low-to-Moderate Traffic Flow. CoRR abs/2010.11716 (2020) - [i39]Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra:
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags. CoRR abs/2010.14171 (2020)
2010 – 2019
- 2019
- [j35]Víctor M. García-Molla, Pablo San Juan Sebastián, Tuomas Virtanen
, Antonio M. Vidal, Pedro Alonso:
Generalization of the K-SVD algorithm for minimization of β-divergence. Digit. Signal Process. 92: 47-53 (2019) - [j34]Sharath Adavanne
, Archontis Politis
, Joonas Nikunen
, Tuomas Virtanen
:
Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks. IEEE J. Sel. Top. Signal Process. 13(1): 34-48 (2019) - [j33]Hendrik Purwins
, Bo Li
, Tuomas Virtanen
, Jan Schlüter
, Shuo-Yiin Chang, Tara N. Sainath
:
Deep Learning for Audio Signal Processing. IEEE J. Sel. Top. Signal Process. 13(2): 206-219 (2019) - [j32]Paul Magron
, Tuomas Virtanen
:
Complex ISNMF: A Phase-Aware Model for Monaural Audio Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 27(1): 20-31 (2019) - [j31]Annamaria Mesaros
, Aleksandr Diment, Benjamin Elizalde
, Toni Heittola
, Emmanuel Vincent
, Bhiksha Raj, Tuomas Virtanen
:
Sound Event Detection in the DCASE 2017 Challenge. IEEE ACM Trans. Audio Speech Lang. Process. 27(6): 992-1006 (2019) - [j30]Pablo San Juan Sebastián
, Tuomas Virtanen
, Víctor M. García-Molla, Antonio M. Vidal:
Analysis of an efficient parallel implementation of active-set Newton algorithm. J. Supercomput. 75(3): 1298-1309 (2019) - [c135]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
A Multi-room Reverberant Dataset for Sound Event Localization and Detection. DCASE 2019: 10-14 - [c134]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network. DCASE 2019: 20-24 - [c133]Konstantinos Drossos, Shayan Gharib, Paul Magron, Tuomas Virtanen:
Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling. DCASE 2019: 59-63 - [c132]Samuel Lipping, Konstantinos Drossos, Tuomas Virtanen:
Crowdsourcing a Dataset of Audio Captions. DCASE 2019: 139-143 - [c131]Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
Acoustic Scene Classification in DCASE 2019 Challenge: Closed and Open Set Classification and Data Mismatch Setups. DCASE 2019: 164-168 - [c130]M. N. Istiaq Ahsan, Csaba Kertész, Annamaria Mesaros
, Toni Heittola, Andrew Knight, Tuomas Virtanen
:
Audio-Based Epileptic Seizure Detection. EUSIPCO 2019: 1-5 - [c129]Shanshan Wang
, Gaurav Naithani
, Tuomas Virtanen
:
Low-latency Deep Clustering for Speech Separation. ICASSP 2019: 76-80 - [c128]Irene Martín-Morató
, Annamaria Mesaros
, Toni Heittola, Tuomas Virtanen
, Maximo Cobos
, Francesc J. Ferri:
Sound Event Envelope Estimation in Polyphonic Mixtures. ICASSP 2019: 935-939 - [c127]Aleksandr Diment, Eemi Fagerlund, Adrian Benfield, Tuomas Virtanen
:
Detection of Typical Pronunciation Errors in Non-native English Speech Using Convolutional Recurrent Neural Networks. IJCNN 2019: 1-8 - [c126]Helen L. Bear, Toni Heittola, Annamaria Mesaros
, Emmanouil Benetos
, Tuomas Virtanen
:
City Classification from Multiple Real-World Sound Scenes. WASPAA 2019: 11-15 - [c125]Konstantinos Drossos
, Paul Magron, Tuomas Virtanen
:
Unsupervised Adversarial Domain Adaptation Based on The Wasserstein Distance For Acoustic Scene Classification. WASPAA 2019: 259-263 - [c124]Huang Xie
, Tuomas Virtanen
:
Zero-Shot Audio Classification Based On Class Label Embeddings. WASPAA 2019: 264-267 - [c123]Marc C. Green, Sharath Adavanne
, Damian T. Murphy, Tuomas Virtanen
:
Acoustic Scene Classification Using Higher-Order Ambisonic Features. WASPAA 2019: 328-332 - [c122]Annamaria Mesaros
, Sharath Adavanne
, Archontis Politis
, Toni Heittola, Tuomas Virtanen
:
Joint Measurement of Localization and Detection of Sound Events. WASPAA 2019: 333-337 - [d4]Sharath Adavanne
, Archontis Politis
, Annamaria Mesaros
, Toni Heittola
, Tuomas Virtanen
:
Sound event localization and detection (SELDnet) results. Zenodo, 2019 - [d3]Konstantinos Drossos
, Shayan Gharib, Paul Magron
, Tuomas Virtanen
:
Code of the method presented in the paper: Drossos et al, "Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling," in proceedings of DCASE 2019. Zenodo, 2019 - [d2]Konstantinos Drossos
, Samuel Lipping, Tuomas Virtanen
:
Clotho dataset. Version 1.0. Zenodo, 2019 [all versions] - [i38]Shanshan Wang, Gaurav Naithani, Tuomas Virtanen:
Low-Latency Deep Clustering For Speech Separation. CoRR abs/1902.07033 (2019) - [i37]Konstantinos Drossos, Paul Magron, Tuomas Virtanen:
Unsupervised Adversarial Domain Adaptation Based On The Wasserstein Distance For Acoustic Scene Classification. CoRR abs/1904.10678 (2019) - [i36]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network. CoRR abs/1904.12769 (2019) - [i35]Hendrik Purwins, Bo Li, Tuomas Virtanen, Jan Schlüter, Shuo-Yiin Chang, Tara N. Sainath:
Deep Learning for Audio Signal Processing. CoRR abs/1905.00078 (2019) - [i34]Helen L. Bear, Toni Heittola, Annamaria Mesaros, Emmanouil Benetos, Tuomas Virtanen:
City classification from multiple real-world sound scenes. CoRR abs/1905.00979 (2019) - [i33]Huang Xie, Tuomas Virtanen:
Zero-Shot Audio Classification Based on Class Label Embeddings. CoRR abs/1905.01926 (2019) - [i32]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
A multi-room reverberant dataset for sound event localization and detection. CoRR abs/1905.08546 (2019) - [i31]Konstantinos Drossos, Shayan Gharib, Paul Magron, Tuomas Virtanen:
Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling. CoRR abs/1907.08506 (2019) - [i30]Samuel Lipping, Konstantinos Drossos, Tuomas Virtanen:
Crowdsourcing a Dataset of Audio Captions. CoRR abs/1907.09238 (2019) - [i29]Konstantinos Drossos, Samuel Lipping, Tuomas Virtanen:
Clotho: An Audio Captioning Dataset. CoRR abs/1910.09387 (2019) - [i28]Niccolò Nicodemo, Gaurav Naithani, Konstantinos Drossos, Tuomas Virtanen, Roberto Saletti:
Memory Requirement Reduction of Deep Neural Networks Using Low-bit Quantization of Parameters. CoRR abs/1911.00527 (2019) - [i27]Paul Magron, Tuomas Virtanen:
Online Spectrogram Inversion for Low-Latency Audio Source Separation. CoRR abs/1911.03128 (2019) - [i26]Shayan Gharib, Konstantinos Drossos, Eemi Fagerlund, Tuomas Virtanen:
VOICe: A Sound Event Detection Dataset For Generalizable Domain Adaptation. CoRR abs/1911.07098 (2019) - 2018
- [j29]Gaurav Naithani
, Jaana Kivinummi, Tuomas Virtanen
, Outi Tammela, Mikko J. Peltola
, Jukka M. Leppänen:
Automatic segmentation of infant cry signals using hidden Markov models. EURASIP J. Audio Speech Music. Process. 2018: 1 (2018) - [j28]Katariina Mahkonen
, Tuomas Virtanen
, Joni-Kristian Kämäräinen:
Cascade of Boolean detector combinations. EURASIP J. Image Video Process. 2018: 61 (2018) - [j27]Joonas Nikunen
, Aleksandr Diment, Tuomas Virtanen
:
Separation of Moving Sound Sources Using Multichannel NMF and Acoustic Tracking. IEEE ACM Trans. Audio Speech Lang. Process. 26(2): 281-295 (2018) - [j26]Annamaria Mesaros
, Toni Heittola, Emmanouil Benetos
, Peter Foster, Mathieu Lagrange, Tuomas Virtanen
, Mark D. Plumbley
:
Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge. IEEE ACM Trans. Audio Speech Lang. Process. 26(2): 379-393 (2018) - [j25]Julio J. Carabias-Orti
, Joonas Nikunen
, Tuomas Virtanen
, Pedro Vera-Candeas
:
Multichannel Blind Sound Source Separation Using Spatial Covariance Model With Level and Time Differences and Nonnegative Matrix Factorization. IEEE ACM Trans. Audio Speech Lang. Process. 26(9): 1512-1527 (2018) - [c121]Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
A multi-device dataset for urban acoustic scene classification. DCASE 2018: 9-13 - [c120]Shayan Gharib, Konstantinos Drossos, Emre Cakir, Dmitriy Serdyuk, Tuomas Virtanen:
Unsupervised adversarial domain adaptation for acoustic scene classification. DCASE 2018: 138-142 - [c119]Sharath Adavanne
, Archontis Politis
, Tuomas Virtanen
:
Direction of Arrival Estimation for Multiple Sound Sources Using Convolutional Recurrent Neural Network. EUSIPCO 2018: 1462-1466 - [c118]Paul Magron, Tuomas Virtanen
:
Bayesian Anisotropic Gaussian Model for Audio Source Separation. ICASSP 2018: 166-170 - [c117]Joonas Nikunen, Tuomas Virtanen
:
Estimation of Time-Varying Room Impulse Responses of Multiple Sound Sources from Observed Mixture and Isolated Source Signals. ICASSP 2018: 421-425 - [c116]Stylianos Ioannis Mimilakis, Konstantinos Drossos
, João Felipe Santos, Gerald Schuller, Tuomas Virtanen
, Yoshua Bengio:
Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask. ICASSP 2018: 721-725 - [c115]Sharath Adavanne
, Archontis Politis
, Tuomas Virtanen
:
Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features. IJCNN 2018: 1-7 - [c114]Emre Cakir
, Tuomas Virtanen
:
End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input. IJCNN 2018: 1-7 - [c113]Konstantinos Drossos
, Stylianos Ioannis Mimilakis, Dmitriy Serdyuk, Gerald Schuller, Tuomas Virtanen
, Yoshua Bengio:
MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation. IJCNN 2018: 1-8 - [c112]Paul Magron, Konstantinos Drossos, Stylianos Ioannis Mimilakis, Tuomas Virtanen:
Reducing Interference with Phase Recovery in DNN-based Monaural Singing Voice Separation. INTERSPEECH 2018: 332-336 - [c111]Paul Magron, Tuomas Virtanen:
Expectation-Maximization Algorithms for Itakura-Saito Nonnegative Matrix Factorization. INTERSPEECH 2018: 856-860 - [c110]Mikko Parviainen, Pasi Pertilä, Tuomas Virtanen
, Peter Grosche:
Time-Frequency Masking Strategies for Single-Channel Low-Latency Speech Enhancement Using Neural Networks. IWAENC 2018: 51-55 - [c109]Shuyang Zhao, Toni Heittola, Tuomas Virtanen
:
An Active Learning Method Using Clustering and Committee-Based Sample Selection for Sound Event Classification. IWAENC 2018: 116-120 - [c108]Paul Magron, Tuomas Virtanen
:
Towards Complex Nonnegative Matrix Factorization with the Beta-Divergence. IWAENC 2018: 156-160 - [c107]Guangpu Huang
, Toni Heittola, Tuomas Virtanen
:
Using Sequential Information in Polyphonic Sound Event Detection. IWAENC 2018: 291-295 - [c106]Gaurav Naithani
, Joonas Nikunen, Lars Bramslow, Tuomas Virtanen
:
Deep Neural Network Based Speech Separation Optimizing an Objective Estimator of Intelligibility for Low Latency Applications. IWAENC 2018: 386-390 - [c105]Annamaria Mesaros
, Toni Heittola, Tuomas Virtanen
:
Acoustic Scene Classification: An Overview of Dcase 2017 Challenge Entries. IWAENC 2018: 411-415 - [c104]Konstantinos Drossos
, Paul Magron, Stylianos Ioannis Mimilakis, Tuomas Virtanen
:
Harmonic-Percussive Source Separation with Deep Neural Networks and Phase Recovery. IWAENC 2018: 421-425 - [c103]Paul Magron, Tuomas Virtanen
:
On Modeling the STFT Phase of Audio Signals with the Von Mises Distribution. IWAENC 2018: 550-554 - [c102]Shayan Gharib, Honain Derrar, Daisuke Niizumi, Tuukka Senttula, Janne Tommola, Toni Heittola, Tuomas Virtanen
, Heikki Huttunen
:
Acoustic Scene Classification: a Competition Review. MLSP 2018: 1-6 - [i25]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features. CoRR abs/1801.09522 (2018) - [i24]Konstantinos Drossos, Stylianos Ioannis Mimilakis, Dmitriy Serdyuk, Gerald Schuller, Tuomas Virtanen, Yoshua Bengio:
MaD TwinNet: Masker-Denoiser Architecture with Twin Networks for Monaural Sound Source Separation. CoRR abs/1802.00300 (2018) - [i23]Paul Magron, Tuomas Virtanen:
Complex ISNMF: a Phase-Aware Model for Monaural Audio Source Separation. CoRR abs/1802.03156 (2018) - [i22]Konstantinos Drossos, Stylianos Ioannis Mimilakis, Andreas Floros, Tuomas Virtanen, Gerald Schuller:
Close Miking Empirical Practice Verification: A Source Separation Approach. CoRR abs/1802.05132 (2018) - [i21]Emre Çakir, Tuomas Virtanen:
End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input. CoRR abs/1805.03647 (2018) - [i20]Sharath Adavanne, Archontis Politis, Joonas Nikunen, Tuomas Virtanen:
Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks. CoRR abs/1807.00129 (2018) - [i19]Gaurav Naithani, Joonas Nikunen, Lars Bramsløw, Tuomas Virtanen:
Deep neural network based speech separation optimizing an objective estimator of intelligibility for low latency applications. CoRR abs/1807.06899 (2018) - [i18]Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
A multi-device dataset for urban acoustic scene classification. CoRR abs/1807.09840 (2018) - [i17]Konstantinos Drossos, Paul Magron, Stylianos Ioannis Mimilakis, Tuomas Virtanen:
Harmonic-Percussive Source Separation with Deep Neural Networks and Phase Recovery. CoRR abs/1807.11298 (2018) - [i16]Shayan Gharib, Honain Derrar, Daisuke Niizumi, Tuukka Senttula, Janne Tommola, Toni Heittola, Tuomas Virtanen, Heikki Huttunen:
Acoustic Scene Classification: A Competition Review. CoRR abs/1808.02357 (2018) - [i15]Shayan Gharib, Konstantinos Drossos, Emre Çakir, Dmitriy Serdyuk, Tuomas Virtanen:
Unsupervised adversarial domain adaptation for acoustic scene classification. CoRR abs/1808.05777 (2018) - 2017
- [j24]Gaël Richard, Tuomas Virtanen
, Juan Pablo Bello
, Nobutaka Ono
, Hervé Glotin:
Introduction to the Special Section on Sound Scene and Event Analysis. IEEE ACM Trans. Audio Speech Lang. Process. 25(6): 1169-1171 (2017) - [j23]Emre Çakir
, Giambattista Parascandolo, Toni Heittola, Heikki Huttunen
, Tuomas Virtanen
:
Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection. IEEE ACM Trans. Audio Speech Lang. Process. 25(6): 1291-1303 (2017) - [j22]Szymon Drgas
, Tuomas Virtanen
, Jörg Lücke, Antti Hurmalainen:
Binary Non-Negative Matrix Deconvolution for Audio Dictionary Learning. IEEE ACM Trans. Audio Speech Lang. Process. 25(8): 1644-1656 (2017) - [c101]Sharath Adavanne, Tuomas Virtanen:
Sound Event Detection Using Weakly Labeled Dataset with Stacked Convolutional and Recurrent Neural Network. DCASE 2017: 12-16 - [c100]Emre Cakir, Tuomas Virtanen:
Convolutional Recurrent Neural Networks for Rare Sound Event Detection. DCASE 2017: 27-31 - [c99]Annamaria Mesaros, Toni Heittola, Aleksandr Diment, Benjamin Elizalde, Ankit Shah, Emmanuel Vincent, Bhiksha Raj, Tuomas Virtanen:
DCASE2017 Challenge Setup: Tasks, Datasets and Baseline System. DCASE 2017: 85-92 - [c98]Daniela Caballero, Roberto Araya, Hanna Kronholm, Jouni Viiri, André Mansikkaniemi, Sami Lehesvuori
, Tuomas Virtanen
, Mikko Kurimo:
ASR in Classroom Today: Automatic Visualization of Conceptual Network in Science Classrooms. EC-TEL 2017: 541-544 - [c97]Joonas Nikunen, Tuomas Virtanen
:
Time-difference of arrival model for spherical microphone arrays and application to direction of arrival estimation. EUSIPCO 2017: 1255-1259 - [c96]Sharath Adavanne
, Konstantinos Drossos
, Emre Cakir
, Tuomas Virtanen
:
Stacked convolutional and recurrent neural networks for bird audio detection. EUSIPCO 2017: 1729-1733 - [c95]Emre Cakir
, Sharath Adavanne
, Giambattista Parascandolo, Konstantinos Drossos
, Tuomas Virtanen
:
Convolutional recurrent neural networks for bird audio detection. EUSIPCO 2017: 1744-1748 - [c94]Shuyang Zhao, Toni Heittola, Tuomas Virtanen
:
Active learning for sound event classification by clustering unlabeled data. ICASSP 2017: 751-755 - [c93]Sharath Adavanne
, Pasi Pertilä, Tuomas Virtanen
:
Sound event detection using spatial features and convolutional recurrent neural network. ICASSP 2017: 771-775 - [c92]Michele Valenti, Stefano Squartini
, Aleksandr Diment, Giambattista Parascandolo, Tuomas Virtanen
:
A convolutional neural network approach for acoustic scene classification. IJCNN 2017: 1547-1554 - [c91]Stylianos Ioannis Mimilakis, Konstantinos Drossos
, Tuomas Virtanen
, Gerald Schuller:
A recurrent encoder-decoder approach with skip-filtering connections for monaural singing voice separation. MLSP 2017: 1-6 - [c90]Aleksandr Diment, Tuomas Virtanen
:
Transfer learning of weakly labelled audio. WASPAA 2017: 6-10 - [c89]Shuyang Zhao, Toni Heittola, Tuomas Virtanen
:
Learning vocal mode classifiers from heterogeneous data sources. WASPAA 2017: 16-20 - [c88]Gaurav Naithani
, Tom Barker, Giambattista Parascandolo, Lars Bramslow, Niels Henrik Pontoppidan
, Tuomas Virtanen
:
Low latency sound source separation using convolutional recurrent neural networks. WASPAA 2017: 71-75 - [c87]Paul Magron, Jonathan Le Roux, Tuomas Virtanen
:
Consistent anisotropic Wiener filtering for audio source separation. WASPAA 2017: 269-273 - [c86]Annamaria Mesaros
, Toni Heittola, Tuomas Virtanen
:
Assessment of human and machine performance in acoustic scene classification: Dcase 2016 case study. WASPAA 2017: 319-323 - [c85]Konstantinos Drossos
, Sharath Adavanne
, Tuomas Virtanen
:
Automated audio captioning with recurrent neural networks. WASPAA 2017: 374-378 - [e3]Tuomas Virtanen, Annamaria Mesaros, Toni Heittola, Aleksandr Diment, Emmanuel Vincent, Emmanouil Benetos, Benjamin Elizalde:
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, DCASE 2017, Munich, Germany, November 16-17, 2017. 2017, ISBN 978-952-15-4042-4 [contents] - [d1]Annamaria Mesaros
, Toni Heittola
, Tuomas Virtanen, Emmanouil Benetos
, Mathieu Lagrange, Grégoire Lafay, Peter Foster, Mark D. Plumbley
:
DCASE2016 Challenge Submissions Package. Zenodo, 2017 - [i14]Emre Çakir, Giambattista Parascandolo, Toni Heittola, Heikki Huttunen, Tuomas Virtanen:
Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection. CoRR abs/1702.06286 (2017) - [i13]Emre Çakir, Sharath Adavanne, Giambattista Parascandolo, Konstantinos Drossos, Tuomas Virtanen:
Convolutional Recurrent Neural Networks for Bird Audio Detection. CoRR abs/1703.02317 (2017) - [i12]Sharath Adavanne, Konstantinos Drossos, Emre Çakir, Tuomas Virtanen:
Stacked Convolutional and Recurrent Neural Networks for Bird Audio Detection. CoRR abs/1706.02047 (2017) - [i11]Sharath Adavanne, Pasi Pertilä, Tuomas Virtanen:
Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network. CoRR abs/1706.02291 (2017) - [i10]Miroslav Malik, Sharath Adavanne, Konstantinos Drossos, Tuomas Virtanen, Dasa Ticha, Roman Jarina:
Stacked Convolutional and Recurrent Neural Networks for Music Emotion Recognition. CoRR abs/1706.02292 (2017) - [i9]Sharath Adavanne, Giambattista Parascandolo, Pasi Pertilä, Toni Heittola, Tuomas Virtanen:
Sound Event Detection in Multichannel Audio Using Spatial and Harmonic Features. CoRR abs/1706.02293 (2017) - [i8]Konstantinos Drossos, Sharath Adavanne, Tuomas Virtanen:
Automated Audio Captioning with Recurrent Neural Networks. CoRR abs/1706.10006 (2017) - [i7]Stylianos Ioannis Mimilakis, Konstantinos Drossos, Tuomas Virtanen, Gerald Schuller:
A Recurrent Encoder-Decoder Approach with Skip-filtering Connections for Monaural Singing Voice Separation. CoRR abs/1709.00611 (2017) - [i6]Sharath Adavanne, Tuomas Virtanen:
A report on sound event detection with different binaural features. CoRR abs/1710.02997 (2017) - [i5]Sharath Adavanne, Tuomas Virtanen:
Sound event detection using weakly labeled dataset with stacked convolutional and recurrent neural network. CoRR abs/1710.02998 (2017) - [i4]Joonas Nikunen, Aleksandr Diment, Tuomas Virtanen:
Separation of Moving Sound Sources Using Multichannel NMF and Acoustic Tracking. CoRR abs/1710.10005 (2017) - [i3]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network. CoRR abs/1710.10059 (2017) - [i2]Stylianos Ioannis Mimilakis, Konstantinos Drossos, João Felipe Santos, Gerald Schuller, Tuomas Virtanen, Yoshua Bengio:
Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask. CoRR abs/1711.01437 (2017) - 2016
- [j21]Joonas Nikunen, Aleksandr Diment, Tuomas Virtanen
, Miikka Vilermo:
Binaural rendering of microphone array captures based on source separation. Speech Commun. 76: 157-169 (2016) - [j20]Tom Barker, Tuomas Virtanen
:
Blind Separation of Audio Mixtures Through Nonnegative Tensor Factorization of Modulation Spectrograms. IEEE ACM Trans. Audio Speech Lang. Process. 24(12): 2377-2389 (2016) - [c84]Sharath Adavanne, Giambattista Parascandolo, Pasi Pertilä, Toni Heittola, Tuomas Virtanen:
Sound Event Detection in Multichannel Audio Using Spatial and Harmonic Features. DCASE 2016: 6-10 - [c83]Michele Valenti, Aleksandr Diment, Giambattista Parascandolo, Stefano Squartini, Tuomas Virtanen:
DCASE 2016 Acoustic Scene Classification Using Convolutional Neural Networks. DCASE 2016: 95-99 - [c82]Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
TUT database for acoustic scene classification and sound event detection. EUSIPCO 2016: 1128-1132 - [c81]Katariina Mahkonen, Antti Hurmalainen, Tuomas Virtanen
, Joni-Kristian Kamarainen:
Cascade processing for speeding up sliding window sparse classification. EUSIPCO 2016: 2305-2309 - [c80]Aleksandr Diment, Mikko Parviainen, Tuomas Virtanen
, Roman Zelov, Alex Glasman:
Noise-robust detection of whispering in telephone calls using deep neural networks. EUSIPCO 2016: 2310-2314 - [c79]Gaurav Naithani
, Giambattista Parascandolo, Tom Barker, Niels Henrik Pontoppidan
, Tuomas Virtanen
:
Low-latency sound source separation using deep neural networks. GlobalSIP 2016: 272-276 - [c78]Giambattista Parascandolo, Heikki Huttunen
, Tuomas Virtanen
:
Recurrent neural networks for polyphonic sound event detection in real life recordings. ICASSP 2016: 6440-6444 - [c77]Emre Cakir
, Ezgi Can Ozan, Tuomas Virtanen
:
Filterbank learning for deep neural network based polyphonic sound event detection. IJCNN 2016: 3399-3406 - [e2]Tuomas Virtanen, Annamaria Mesaros, Toni Heittola, Mark D. Plumbley, Peter Foster, Emmanouil Benetos, Mathieu Lagrange:
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, DCASE 2016, Budapest, Hungary, September 3, 2016. 2016, ISBN 978-952-15-3807-0 [contents] - [i1]Giambattista Parascandolo, Heikki Huttunen, Tuomas Virtanen:
Recurrent Neural Networks for Polyphonic Sound Event Detection in Real Life Recordings. CoRR abs/1604.00861 (2016) - 2015
- [j19]Umut Simsekli, Tuomas Virtanen
, Ali Taylan Cemgil
:
Non-negative tensor factorization models for Bayesian audio processing. Digit. Signal Process. 47: 178-191 (2015) - [j18]Tuomas Virtanen
, Jort Florent Gemmeke, Bhiksha Raj, Paris Smaragdis:
Compositional Models for Audio Processing: Uncovering the structure of sound mixtures. IEEE Signal Process. Mag. 32(2): 125-144 (2015) - [j17]Deepak Baby
, Tuomas Virtanen
, Jort F. Gemmeke, Hugo Van hamme
:
Coupled Dictionaries for Exemplar-Based Speech Enhancement and Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 23(11): 1788-1799 (2015) - [c76]Aleksandr Diment, Emre Cakir, Toni Heittola, Tuomas Virtanen:
Automatic recognition of environmental sound events using all-pole group delay features. EUSIPCO 2015: 729-733 - [c75]Emre Cakir, Toni Heittola, Heikki Huttunen, Tuomas Virtanen:
Multi-label vs. combined single-label sound event detection with deep neural networks. EUSIPCO 2015: 2551-2555 - [c74]Szymon Drgas
, Tuomas Virtanen
:
Speaker Verification Using Adaptive Dictionaries in Non-negative Spectrogram Deconvolution. LVA/ICA 2015: 462-469 - [c73]Annamaria Mesaros
, Toni Heittola, Onur Dikmen
, Tuomas Virtanen
:
Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations. ICASSP 2015: 151-155 - [c72]Tom Barker, Tuomas Virtanen
, Niels Henrik Pontoppidan
:
Low-latency sound-source-separation using non-negative matrix factorisation with coupled analysis and synthesis dictionaries. ICASSP 2015: 241-245 - [c71]Antti Hurmalainen, Rahim Saeidi, Tuomas Virtanen
:
Similarity induced group sparsity for non-negative matrix factorisation. ICASSP 2015: 4425-4429 - [c70]Deepak Baby
, Jort F. Gemmeke, Tuomas Virtanen
, Hugo Van hamme
:
Exemplar-based speech enhancement for deep neural network based automatic speech recognition. ICASSP 2015: 4485-4489 - [c69]Emre Cakir
, Toni Heittola, Heikki Huttunen
, Tuomas Virtanen
:
Polyphonic sound event detection using multi label deep neural networks. IJCNN 2015: 1-7 - [c68]Antti Hurmalainen, Rahim Saeidi, Tuomas Virtanen:
Noise robust speaker recognition with convolutive sparse coding. INTERSPEECH 2015: 244-248 - [c67]Aleksandr Diment, Tuomas Virtanen
:
Archetypal analysis for audio dictionary learning. WASPAA 2015: 1-5 - 2014
- [j16]Toni Heittola, Annamaria Mesaros
, Dani Korpi
, Antti J. Eronen, Tuomas Virtanen
:
Method for creating location-specific audio textures. EURASIP J. Audio Speech Music. Process. 2014: 9 (2014) - [j15]Joonas Nikunen, Tuomas Virtanen
:
Direction of Arrival Based Spatial Covariance Model for Blind Sound Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 22(3): 727-739 (2014) - [j14]Zhizheng Wu, Tuomas Virtanen
, Engsiong Chng
, Haizhou Li
:
Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 22(10): 1506-1521 (2014) - [c66]Katariina Mahkonen, Joni-Kristian Kämäräinen, Tuomas Virtanen
:
Lifelog Scene Change Detection Using Cascades of Audio and Video Detectors. ACCV Workshops (3) 2014: 434-444 - [c65]Oguzhan Gencoglu, Tuomas Virtanen, Heikki Huttunen:
Recognition of acoustic events using deep neural networks. EUSIPCO 2014: 506-510 - [c64]Tom Barker, Tuomas Virtanen
, Olivier Delhomme:
Ultrasound-coupled semi-supervised nonnegative matrix factorisation for speech enhancement. ICASSP 2014: 2129-2133 - [c63]Deepak Baby
, Tuomas Virtanen
, Tom Barker, Hugo Van hamme
:
Coupled dictionary training for exemplar-based speech enhancement. ICASSP 2014: 2883-2887 - [c62]Tuomas Virtanen, Bhiksha Raj, Jort F. Gemmeke, Hugo Van hamme
:
Active-set newton algorithm for non-negative sparse coding of audio. ICASSP 2014: 3092-3096 - [c61]Joonas Nikunen, Tuomas Virtanen
:
Multichannel audio separation by direction of arrival based spatial covariance model and non-negative matrix factorization. ICASSP 2014: 6677-6681 - [c60]Tom Barker, Tuomas Virtanen
:
Semi-supervised non-negative tensor factorisation of modulation spectrograms for monaural speech separation. IJCNN 2014: 3556-3561 - [c59]Tom Barker, Hugo Van hamme
, Tuomas Virtanen:
Modelling primitive streaming of simple tone sequences through factorisation of modulation pattern tensors. INTERSPEECH 2014: 1371-1375 - [c58]Deepak Baby
, Tuomas Virtanen
, Jort F. Gemmeke, Tom Barker, Hugo Van hamme
:
Exemplar-based noise robust automatic speech recognition using modulation spectrogram features. SLT 2014: 519-524 - 2013
- [j13]Antti Hurmalainen, Jort F. Gemmeke, Tuomas Virtanen
:
Modelling non-stationary noise with spectral factorisation in automatic speech recognition. Comput. Speech Lang. 27(3): 763-779 (2013) - [j12]Toni Heittola, Annamaria Mesaros
, Antti J. Eronen, Tuomas Virtanen
:
Context-dependent sound event detection. EURASIP J. Audio Speech Music. Process. 2013: 1 (2013) - [j11]Dani Korpi
, Toni Heittola, Timo Partala
, Antti J. Eronen, Annamaria Mesaros
, Tuomas Virtanen
:
On the human ability to discriminate audio ambiances from similar locations of an urban environment. Pers. Ubiquitous Comput. 17(4): 761-769 (2013) - [j10]Tuomas Virtanen
, Jort Florent Gemmeke, Bhiksha Raj:
Active-Set Newton Algorithm for Overcomplete Non-Negative Representations of Audio. IEEE Trans. Speech Audio Process. 21(11): 2277-2289 (2013) - [c57]Antti Hurmalainen, Tuomas Virtanen:
Learning state labels for sparse classification of speech with matrix deconvolution. ASRU 2013: 168-173 - [c56]Aleksandr Diment, Padmanabhan Rajan, Toni Heittola, Tuomas Virtanen
:
Group Delay Function from All-Pole Models for Musical Instrument Recognition. CMMR 2013: 606-618 - [c55]Aleksandr Diment, Toni Heittola, Tuomas Virtanen:
Semi-supervised learning for musical instrument recognition. EUSIPCO 2013: 1-5 - [c54]Antti Hurmalainen, Tuomas Virtanen:
Acquiring variable length speech bases for factorisation-based noise robust speech recognition. EUSIPCO 2013: 1-5 - [c53]Jort F. Gemmeke, Tuomas Virtanen
, Kris Demuynck:
Exemplar-based joint channel and noise compensation. ICASSP 2013: 868-872 - [c52]Toni Heittola, Annamaria Mesaros
, Tuomas Virtanen
, Moncef Gabbouj
:
Supervised model training for overlapping sound events based on unsupervised source separation. ICASSP 2013: 8677-8681 - [c51]Tom Barker, Tuomas Virtanen:
Non-negative tensor factorisation of modulation spectrograms for monaural sound source separation. INTERSPEECH 2013: 827-831 - [c50]Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Exemplar-based unit selection for voice conversion utilizing temporal information. INTERSPEECH 2013: 3057-3061 - [c49]Forrest Briggs, Yonghong Huang, Raviv Raich, Konstantinos Eftaxias, Zhong Lei, William Cukierski, Sarah Frey Hadley, Adam Hadley, Matthew Betts
, Xiaoli Z. Fern, Jed Irvine, Lawrence Neal, Anil Thomas, Gábor Fodor, Grigorios Tsoumakas
, Hong Wei Ng, Thi Ngoc Tho Nguyen, Heikki Huttunen
, Pekka Ruusuvuori, Tapio Manninen, Aleksandr Diment, Tuomas Virtanen
, Julien Marzat, Joseph Defretin, Dave Callender, Chris Hurlburt, Ken Larrey, Maxim Milakov:
The 9th annual MLSP competition: New methods for acoustic classification of multiple simultaneous bird species in a noisy environment. MLSP 2013: 1-8 - [c48]Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Eng Siong Chng, Haizhou Li:
Exemplar-based voice conversion using non-negative spectrogram deconvolution. SSW 2013: 201-206 - [c47]Joonas Kauppinen, Anssi Klapuri, Tuomas Virtanen
:
Music self-similarity modeling using augmented nonnegative matrix factorization of block and stripe patterns. WASPAA 2013: 1-4 - 2012
- [j9]Elina Helander
, Hanna Silén, Tuomas Virtanen
, Moncef Gabbouj
:
Voice Conversion Using Dynamic Kernel Partial Least Squares Regression. IEEE Trans. Speech Audio Process. 20(3): 806-817 (2012) - [c46]Joonas Nikunen, Tuomas Virtanen, Pasi Pertilä, Miikka Vilermo:
Permutation alignment of frequency-domain ICA by the maximization of intra-source envelope correlations. EUSIPCO 2012: 1489-1493 - [c45]Antti Hurmalainen, Jort F. Gemmeke, Tuomas Virtanen:
Detection, separation and recognition of speech from continuous signals using spectral factorisation. EUSIPCO 2012: 2649-2653 - [c44]Francisco J. Rodríguez-Serrano, Julio J. Carabias-Orti
, Pedro Vera-Candeas
, Tuomas Virtanen
, Nicolás Ruiz-Reyes:
Multiple Instrument Mixtures Source Separation Evaluation Using Instrument-Dependent NMF Models. LVA/ICA 2012: 380-387 - [c43]Antti Hurmalainen, Tuomas Virtanen
:
Modelling spectro-temporal dynamics in factorisation-based noise-robust automatic speech recognition. ICASSP 2012: 4113-4116 - [c42]Felix Weninger, Martin Wöllmer, Jürgen T. Geiger, Björn W. Schuller
, Jort F. Gemmeke, Antti Hurmalainen, Tuomas Virtanen
, Gerhard Rigoll:
Non-negative matrix factorization for highly noise-robust ASR: To enhance or to recognize? ICASSP 2012: 4681-4684 - [c41]Antti Hurmalainen, Rahim Saeidi, Tuomas Virtanen:
Group Sparsity for Speaker Identity Discrimination in Factorisation-based Speech Recognition. INTERSPEECH 2012: 2138-2141 - [c40]Tuomas Virtanen:
Human sound perception - what can we learn from it when developing audio analysis algorithms? SAPA@INTERSPEECH 2012 - [c39]Ali Bahrami Rad
, Tuomas Virtanen
:
Phase spectrum prediction of audio signals. ISCCSP 2012: 1-5 - [c38]Rahim Saeidi, Antti Hurmalainen, Tuomas Virtanen, David A. van Leeuwen:
Exemplar-based sparse representation and sparse discrimination for noise robust speaker identification. Odyssey 2012: 248-255 - [p3]Tuomas Virtanen, Rita Singh, Bhiksha Raj:
Introduction. Techniques for Noise Robustness in Automatic Speech Recognition 2012: 1-5 - [p2]Rita Singh, Bhiksha Raj, Tuomas Virtanen:
The Basics of Automatic Speech Recognition. Techniques for Noise Robustness in Automatic Speech Recognition 2012: 7-30 - [p1]Bhiksha Raj, Tuomas Virtanen, Rita Singh:
The Problem of Robustness in Automatic Speech Recognition. Techniques for Noise Robustness in Automatic Speech Recognition 2012: 31-50 - [e1]Tuomas Virtanen, Rita Singh, Bhiksha Raj:
Techniques for Noise Robustness in Automatic Speech Recognition. Wiley 2012, ISBN 978-1-119-97088-0 [contents] - 2011
- [j8]Julio J. Carabias-Orti
, Tuomas Virtanen
, Pedro Vera-Candeas
, Nicolás Ruiz-Reyes
, Francisco J. Cañadas-Quesada
:
Musical Instrument Sound Multi-Excitation Model for Non-Negative Spectrogram Factorization. IEEE J. Sel. Top. Signal Process. 5(6): 1144-1158 (2011) - [j7]Jort F. Gemmeke, Tuomas Virtanen
, Antti Hurmalainen:
Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition. IEEE Trans. Speech Audio Process. 19(7): 2067-2080 (2011) - [c37]Jort F. Gemmeke, Antti Hurmalainen, Tuomas Virtanen, Yang Sun:
Toward a practical implementation of exemplar-based noise robust ASR. EUSIPCO 2011: 1490-1494 - [c36]Antti Hurmalainen, Jort F. Gemmeke, Tuomas Virtanen
:
Non-negative matrix deconvolution in noise robust speech recognition. ICASSP 2011: 4588-4591 - [c35]Katariina Mahkonen, Antti Hurmalainen, Tuomas Virtanen, Jort F. Gemmeke:
Mapping Sparse Representation to State Likelihoods in Noise-Robust Automatic Speech Recognition. INTERSPEECH 2011: 465-468 - [c34]Heikki Kallasjoki, Ulpu Remes, Jort F. Gemmeke, Tuomas Virtanen, Kalle J. Palomäki:
Uncertainty Measures for Improving Exemplar-Based Source Separation. INTERSPEECH 2011: 469-472 - [c33]Bhiksha Raj, Rita Singh, Tuomas Virtanen:
Phoneme-Dependent NMF for Speech Enhancement in Monaural Mixtures. INTERSPEECH 2011: 1217-1220 - [c32]Joonas Nikunen, Tuomas Virtanen
, Miikka Vilermo:
Multichannel audio upmixing based on non-negative tensor factorization representation. WASPAA 2011: 33-36 - 2010
- [j6]Marko Leonard Helén, Tuomas Virtanen
:
Audio Query by Example Using Similarity Measures between Probability Density Functions of Features. EURASIP J. Audio Speech Music. Process. 2010 (2010) - [j5]Annamaria Mesaros
, Tuomas Virtanen
:
Automatic Recognition of Lyrics in Singing. EURASIP J. Audio Speech Music. Process. 2010 (2010) - [j4]Anssi Klapuri, Tuomas Virtanen
:
Representing Musical Sounds With an Interpolating State Model. IEEE Trans. Speech Audio Process. 18(3): 613-624 (2010) - [j3]Elina Helander
, Tuomas Virtanen
, Jani Nurminen, Moncef Gabbouj
:
Voice Conversion Using Partial Least Squares Regression. IEEE Trans. Speech Audio Process. 18(5): 912-921 (2010) - [c31]Annamaria Mesaros, Toni Heittola, Antti J. Eronen, Tuomas Virtanen:
Acoustic event detection in real life recordings. EUSIPCO 2010: 1267-1271 - [c30]Toni Heittola, Annamaria Mesaros, Antti J. Eronen, Tuomas Virtanen:
Audio context recognition using audio event histograms. EUSIPCO 2010: 1272-1276 - [c29]Sami Keronen, Ulpu Remes, Kalle J. Palomäki, Tuomas Virtanen, Mikko Kurimo:
Comparison of noise robust methods in large vocabulary speech recognition. EUSIPCO 2010: 1973-1977 - [c28]Joonas Nikunen, Tuomas Virtanen
:
Noise-to-mask ratio minimization by weighted non-negative matrix factorization. ICASSP 2010: 25-28 - [c27]Annamaria Mesaros
, Tuomas Virtanen
:
Recognition of phonemes and words in singing. ICASSP 2010: 2146-2149 - [c26]Jort F. Gemmeke, Tuomas Virtanen
:
Noise robust exemplar-based connected digit recognition. ICASSP 2010: 4546-4549 - [c25]Anssi Klapuri, Tuomas Virtanen
, Toni Heittola:
Sound source separation in monaural music signals using excitation-filter model and em algorithm. ICASSP 2010: 5510-5513 - [c24]Bhiksha Raj, Tuomas Virtanen, Sourish Chaudhuri, Rita Singh:
Non-negative matrix factorization based compensation of music for automatic speech recognition. INTERSPEECH 2010: 717-720 - [c23]Tuomas Virtanen, Jort F. Gemmeke, Antti Hurmalainen:
State-based labelling for a sparse representation of speech and its application to robust speech recognition. INTERSPEECH 2010: 893-896 - [c22]Jort F. Gemmeke, Tuomas Virtanen:
Artificial and online acquired noise dictionaries for noise robust ASR. INTERSPEECH 2010: 2082-2085
2000 – 2009
- 2009
- [c21]Annamaria Mesaros, Tuomas Virtanen:
Adaptation of a speech recognizer for singing voice. EUSIPCO 2009: 1779-1783 - [c20]Tuomas Virtanen:
Spectral covariance in prior distributions of non-negative matrix factorization based speech separation. EUSIPCO 2009: 1933-1937 - [c19]Mikko Myllymäki, Tuomas Virtanen:
Non-stationary noise model compensation in voice activity detection. EUSIPCO 2009: 2186-2190 - [c18]Tuomas Virtanen, Toni Heittola:
Interpolating hidden Markov model and its application to automatic instrument recognition. ICASSP 2009: 49-52 - [c17]Tuomas Virtanen, Ali Taylan Cemgil
:
Mixtures of Gamma Priors for Non-negative Matrix Factorization Based Speech Separation. ICA 2009: 646-653 - [c16]Toni Heittola, Anssi Klapuri, Tuomas Virtanen:
Musical Instrument Recognition in Polyphonic Audio Using Source-Filter Model for Sound Separation. ISMIR 2009: 327-332 - 2008
- [c15]Mikko Myllymäki, Tuomas Virtanen:
Voice activity detection in the presence of breathing noise using neural network and hidden Markov model. EUSIPCO 2008: 1-5 - [c14]Tuomas Virtanen, Ali Taylan Cemgil
, Simon J. Godsill:
Bayesian extensions to non-negative matrix factorisation for audio signal modelling. ICASSP 2008: 1825-1828 - [c13]Matti Ryynänen, Tuomas Virtanen, Jouni Paulus
, Anssi Klapuri:
Accompaniment separation and karaoke application based on automatic melody transcription. ICME 2008: 1417-1420 - [c12]Tuomas Virtanen, Annamaria Mesaros, Matti Ryynänen:
Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music. SAPA@INTERSPEECH 2008: 17-22 - 2007
- [j2]Tuomas Virtanen
:
Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria. IEEE Trans. Speech Audio Process. 15(3): 1066-1074 (2007) - [c11]Marko Leonard Helén, Tuomas Virtanen:
Query by Example of Audio Signals using Euclidean Distance Between Gaussian Mixture Models. ICASSP (1) 2007: 225-228 - [c10]Annamaria Mesaros, Tuomas Virtanen, Anssi Klapuri:
Singer Identification in Polyphonic Music Using Vocal Separation and Pattern Recognition Methods. ISMIR 2007: 375-378 - 2006
- [c9]Tuomas Virtanen:
Speech recognition using factorial hidden Markov models for separation in the feature space. INTERSPEECH 2006 - 2005
- [c8]Marko Leonard Helén, Tuomas Virtanen:
Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine. EUSIPCO 2005: 1-4 - [c7]Anssi Klapuri, Tuomas Virtanen, Marko Leonard Helén:
Modeling musical sounds with an interpolating state model. EUSIPCO 2005: 1-4 - [c6]Jouni Paulus, Tuomas Virtanen:
Drum transcription with non-negative spectrogram factorisation. EUSIPCO 2005: 1-4 - 2004
- [c5]Tuomas Virtanen:
Separation of sound sources by convolutive sparse coding. SAPA@INTERSPEECH 2004: 55 - 2003
- [c4]Tuomas Virtanen:
Sound Source Separation Using Sparse Coding with Temporal Continuity Objective. ICMC 2003 - 2002
- [c3]Tuomas Virtanen, Anssi Klapuri:
Separation of harmonic sounds using linear models for the overtone series. ICASSP 2002: 1757-1760 - 2000
- [j1]Stephan M. Jakob, Ilkka Korhonen
, Esko Ruokonen, Tuomas Virtanen
, Alex Kogan, Jukka Takala:
Detection of artifacts in monitored trends in intensive care. Comput. Methods Programs Biomed. 63(3): 203-209 (2000) - [c2]Jukka Sillanpaa, Anssi Klapuri, Jarno Seppänen, Tuomas Virtanen:
Recognition of acoustic noise mixtures by combined bottom-up and top-down processing. EUSIPCO 2000: 1-4 - [c1]Tuomas Virtanen, Anssi Klapuri:
Separation of harmonic sound sources using sinusoidal modeling. ICASSP 2000: 765-768
Coauthor Index
aka: Emre Cakir
aka: Jort Florent Gemmeke
aka: Stylianos Ioannis Mimilakis

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-02-18 02:16 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint