default search action
Annamaria Mesaros
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j11]Daniel Aleksander Krause, Guillermo García-Barrios, Archontis Politis, Annamaria Mesaros:
Binaural Sound Source Distance Estimation and Localization for a Moving Listener. IEEE ACM Trans. Audio Speech Lang. Process. 32: 996-1011 (2024) - [c47]Manu Harju, Irene Martín-Morató, Toni Heittola, Annamaria Mesaros:
Sound Event Detection with Soft Labels: A New Perspective on Evaluation. EUSIPCO 2024: 66-70 - [c46]Manjunath Mulimani, Annamaria Mesaros:
Online Domain-Incremental Learning Approach to Classify Acoustic Scenes in All Locations. EUSIPCO 2024: 96-100 - [c45]Daniel Aleksander Krause, Archontis Politis, Annamaria Mesaros:
Sound Event Detection and Localization with Distance Estimation. EUSIPCO 2024: 286-290 - [c44]Shanshan Wang, Soumya Tripathy, Toni Heittola, Annamaria Mesaros:
Positive and Negative Sampling Strategies for Self-Supervised Learning on Audio-Video Data. ICASSP Workshops 2024: 545-549 - [c43]Manjunath Mulimani, Annamaria Mesaros:
Class-Incremental Learning for Multi-Label Audio Classification. ICASSP 2024: 916-920 - [i18]Manjunath Mulimani, Annamaria Mesaros:
Class-Incremental Learning for Multi-Label Audio Classification. CoRR abs/2401.04447 (2024) - [i17]Daniel Aleksander Krause, Archontis Politis, Annamaria Mesaros:
Sound Event Detection and Localization with Distance Estimation. CoRR abs/2403.11827 (2024) - [i16]Florian Schmid, Paul Primus, Toni Heittola, Annamaria Mesaros, Irene Martín-Morató, Khaled Koutini, Gerhard Widmer:
Data-Efficient Low-Complexity Acoustic Scene Classification in the DCASE 2024 Challenge. CoRR abs/2405.10018 (2024) - [i15]Samuele Cornell, Janek Ebbers, Constance Douwes, Irene Martín-Morató, Manu Harju, Annamaria Mesaros, Romain Serizel:
DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels. CoRR abs/2406.08056 (2024) - [i14]Manjunath Mulimani, Annamaria Mesaros:
Online Domain-Incremental Learning Approach to Classify Acoustic Scenes in All Locations. CoRR abs/2406.13386 (2024) - [i13]Andreas Triantafyllopoulos, Iosif Tsangko, Alexander Gebhard, Annamaria Mesaros, Tuomas Virtanen, Björn W. Schuller:
Computer Audition: From Task-Specific Machine Learning to Foundation Models. CoRR abs/2407.15672 (2024) - [i12]Annamaria Mesaros, Romain Serizel, Toni Heittola, Tuomas Virtanen, Mark D. Plumbley:
A decade of DCASE: Achievements, practices, evaluations and future challenges. CoRR abs/2410.04951 (2024) - 2023
- [j10]Irene Martín-Morató, Annamaria Mesaros:
Strong Labeling of Sound Events Using Crowdsourced Weak Labels and Annotator Competence Estimation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 902-914 (2023) - [c42]Irene Martín-Morató, Manu Harju, Paul Ahokas, Annamaria Mesaros:
Training Sound Event Detection with Soft Labels from Crowdsourced Annotations. ICASSP 2023: 1-5 - [c41]Shanshan Wang, Soumya Tripathy, Annamaria Mesaros:
Self-Supervised Learning of Audio Representations using Angular Contrastive Loss. ICASSP 2023: 1-5 - [i11]Irene Martín-Morató, Manu Harju, Paul Ahokas, Annamaria Mesaros:
Training sound event detection with soft labels from crowdsourced annotations. CoRR abs/2302.14572 (2023) - [i10]Manjunath Mulimani, Annamaria Mesaros:
Incremental Learning of Acoustic Scenes and Sound Events. CoRR abs/2302.14815 (2023) - [i9]Manu Harju, Annamaria Mesaros:
Evaluating Classification Systems Against Soft Labels with Fuzzy Precision and Recall. CoRR abs/2309.13938 (2023) - 2022
- [j9]Shanshan Wang, Archontis Politis, Annamaria Mesaros, Tuomas Virtanen:
Self-Supervised Learning of Audio Representations From Audio-Visual Data Using Spatial Alignment. IEEE J. Sel. Top. Signal Process. 16(6): 1467-1479 (2022) - [c40]Irene Martín-Morató, Manu Harju, Annamaria Mesaros:
A Summarization Approach to Evaluating Audio Captioning. DCASE 2022 - [c39]Irene Martín-Morató, Francesco Paissan, Alberto Ancilotto, Toni Heittola, Annamaria Mesaros, Elisabetta Farella, Alessio Brutti, Tuomas Virtanen:
Low-Complexity Acoustic Scene Classification in DCASE 2022 Challenge. DCASE 2022 - [c38]Guillermo García-Barrios, Daniel Aleksander Krause, Archontis Politis, Annamaria Mesaros, Juana M. Gutiérrez-Arriola, Rubén Fraile:
Binaural source localization using deep learning and head rotation information. EUSIPCO 2022: 36-40 - [c37]Daniel Aleksander Krause, Annamaria Mesaros:
Binaural Signal Representations for Joint Sound Event Detection and Acoustic Scene Classification. EUSIPCO 2022: 399-403 - [e6]Mathieu Lagrange, Annamaria Mesaros, Thomas Pellegrini, Gaël Richard, Romain Serizel, Dan Stowell:
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, DCASE 2022, Nancy, France, November 3-4, 2022. Tampere University 2022, ISBN 978-952-03-2677-7 [contents] - [i8]Shanshan Wang, Archontis Politis, Annamaria Mesaros, Tuomas Virtanen:
Self-supervised Learning of Audio Representations from Audio-Visual Data using Spatial Alignment. CoRR abs/2206.00970 (2022) - [i7]Daniel Aleksander Krause, Annamaria Mesaros:
Binaural Signal Representations for Joint Sound Event Detection and Acoustic Scene Classification. CoRR abs/2209.05900 (2022) - [i6]Shanshan Wang, Soumya Tripathy, Annamaria Mesaros:
Self-supervised learning of audio representations using angular contrastive loss. CoRR abs/2211.05442 (2022) - 2021
- [j8]Annamaria Mesaros, Toni Heittola, Tuomas Virtanen, Mark D. Plumbley:
Sound Event Detection: A tutorial. IEEE Signal Process. Mag. 38(5): 67-83 (2021) - [j7]Archontis Politis, Annamaria Mesaros, Sharath Adavanne, Toni Heittola, Tuomas Virtanen:
Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019. IEEE ACM Trans. Audio Speech Lang. Process. 29: 684-698 (2021) - [c36]Shanshan Wang, Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
Audio-Visual Scene Classification: Analysis of DCASE 2021 Challenge Submissions. DCASE 2021: 45-49 - [c35]Irene Martín-Morató, Toni Heittola, Annamaria Mesaros, Tuomas Virtanen:
Low-Complexity Acoustic Scene Classification for Multi-Device Audio: Analysis of DCASE 2021 Challenge Systems. DCASE 2021: 85-89 - [c34]Irene Martín-Morató, Annamaria Mesaros:
Diversity and Bias in Audio Captioning Datasets. DCASE 2021: 90-94 - [c33]Irene Martín-Morató, Annamaria Mesaros:
What is the ground truth? Reliability of multi-annotator data for audio tagging. EUSIPCO 2021: 76-80 - [c32]Shanshan Wang, Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
A Curated Dataset of Urban Scenes for Audio-Visual Scene Analysis. ICASSP 2021: 626-630 - [c31]Björn W. Schuller, Tuomas Virtanen, Maria Riveiro, Georgios Rizos, Jing Han, Annamaria Mesaros, Konstantinos Drossos:
Towards Sonification in Multimodal and User-friendlyExplainable Artificial Intelligence. ICMI 2021: 788-792 - [c30]Irene Martín-Morató, Manu Harju, Annamaria Mesaros:
Crowdsourcing Strong Labels for Sound Event Detection. WASPAA 2021: 246-250 - [c29]Daniel Aleksander Krause, Archontis Politis, Annamaria Mesaros:
Joint Direction and Proximity Classification of Overlapping Sound Events from Binaural Audio. WASPAA 2021: 331-335 - [e5]Frederic Font, Annamaria Mesaros, Daniel P. W. Ellis, Eduardo Fonseca, Magdalena Fuentes, Benjamin Elizalde:
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), Online, November 15-19, 2021. 2021, ISBN 978-84-09-36072-7 [contents] - [i5]Shanshan Wang, Toni Heittola, Annamaria Mesaros, Tuomas Virtanen:
Audio-visual scene classification: analysis of DCASE 2021 Challenge submissions. CoRR abs/2105.13675 (2021) - [i4]Daniel Aleksander Krause, Archontis Politis, Annamaria Mesaros:
Joint Direction and Proximity Classification of Overlapping Sound Events from Binaural Audio. CoRR abs/2107.12033 (2021) - 2020
- [c28]Toni Heittola, Annamaria Mesaros, Tuomas Virtanen:
Acoustic Scene Classification in DCASE 2020 Challenge: Generalization Across Devices and Low Complexity Solutions. DCASE 2020: 56-60 - [e4]Nobutaka Ono, Noboru Harada, Yohei Kawaguchi, Annamaria Mesaros, Keisuke Imoto, Yuma Koizumi, Tatsuya Komatsu:
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), Tokyo, Japan (full virtual), November 2-4, 2020. 2020, ISBN 978-4-600-00566-5 [contents] - [i3]Archontis Politis, Annamaria Mesaros, Sharath Adavanne, Toni Heittola, Tuomas Virtanen:
Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019. CoRR abs/2009.02792 (2020)
2010 – 2019
- 2019
- [j6]Annamaria Mesaros, Aleksandr Diment, Benjamin Elizalde, Toni Heittola, Emmanuel Vincent, Bhiksha Raj, Tuomas Virtanen:
Sound Event Detection in the DCASE 2017 Challenge. IEEE ACM Trans. Audio Speech Lang. Process. 27(6): 992-1006 (2019) - [c27]Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
Acoustic Scene Classification in DCASE 2019 Challenge: Closed and Open Set Classification and Data Mismatch Setups. DCASE 2019: 164-168 - [c26]M. N. Istiaq Ahsan, Csaba Kertész, Annamaria Mesaros, Toni Heittola, Andrew Knight, Tuomas Virtanen:
Audio-Based Epileptic Seizure Detection. EUSIPCO 2019: 1-5 - [c25]Irene Martín-Morató, Annamaria Mesaros, Toni Heittola, Tuomas Virtanen, Maximo Cobos, Francesc J. Ferri:
Sound Event Envelope Estimation in Polyphonic Mixtures. ICASSP 2019: 935-939 - [c24]Helen L. Bear, Toni Heittola, Annamaria Mesaros, Emmanouil Benetos, Tuomas Virtanen:
City Classification from Multiple Real-World Sound Scenes. WASPAA 2019: 11-15 - [c23]Annamaria Mesaros, Sharath Adavanne, Archontis Politis, Toni Heittola, Tuomas Virtanen:
Joint Measurement of Localization and Detection of Sound Events. WASPAA 2019: 333-337 - [d2]Sharath Adavanne, Archontis Politis, Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
Sound event localization and detection (SELDnet) results. Zenodo, 2019 - [i2]Helen L. Bear, Toni Heittola, Annamaria Mesaros, Emmanouil Benetos, Tuomas Virtanen:
City classification from multiple real-world sound scenes. CoRR abs/1905.00979 (2019) - 2018
- [j5]Annamaria Mesaros, Toni Heittola, Emmanouil Benetos, Peter Foster, Mathieu Lagrange, Tuomas Virtanen, Mark D. Plumbley:
Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge. IEEE ACM Trans. Audio Speech Lang. Process. 26(2): 379-393 (2018) - [c22]Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
A multi-device dataset for urban acoustic scene classification. DCASE 2018: 9-13 - [c21]Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
Acoustic Scene Classification: An Overview of Dcase 2017 Challenge Entries. IWAENC 2018: 411-415 - [e3]Mark D. Plumbley, Christian Kroos, Juan Pablo Bello, Gaël Richard, Daniel P. W. Ellis, Annamaria Mesaros:
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, DCASE 2018, Surrey, UK, November 19-20, 2018. 2018, ISBN 978-952-15-4262-6 [contents] - [i1]Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
A multi-device dataset for urban acoustic scene classification. CoRR abs/1807.09840 (2018) - 2017
- [c20]Annamaria Mesaros, Toni Heittola, Aleksandr Diment, Benjamin Elizalde, Ankit Shah, Emmanuel Vincent, Bhiksha Raj, Tuomas Virtanen:
DCASE2017 Challenge Setup: Tasks, Datasets and Baseline System. DCASE 2017: 85-92 - [c19]Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
Assessment of human and machine performance in acoustic scene classification: Dcase 2016 case study. WASPAA 2017: 319-323 - [e2]Tuomas Virtanen, Annamaria Mesaros, Toni Heittola, Aleksandr Diment, Emmanuel Vincent, Emmanouil Benetos, Benjamin Elizalde:
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, DCASE 2017, Munich, Germany, November 16-17, 2017. 2017, ISBN 978-952-15-4042-4 [contents] - [d1]Annamaria Mesaros, Toni Heittola, Tuomas Virtanen, Emmanouil Benetos, Mathieu Lagrange, Grégoire Lafay, Peter Foster, Mark D. Plumbley:
DCASE2016 Challenge Submissions Package. Zenodo, 2017 - 2016
- [c18]Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
TUT database for acoustic scene classification and sound event detection. EUSIPCO 2016: 1128-1132 - [e1]Tuomas Virtanen, Annamaria Mesaros, Toni Heittola, Mark D. Plumbley, Peter Foster, Emmanouil Benetos, Mathieu Lagrange:
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, DCASE 2016, Budapest, Hungary, September 3, 2016. 2016, ISBN 978-952-15-3807-0 [contents] - 2015
- [c17]Daniele Battaglino, Annamaria Mesaros, Ludovick Lepauloux, Laurent Pilati, Nicholas W. D. Evans:
Acoustic context recognition for mobile devices using a reduced complexity SVM. EUSIPCO 2015: 534-538 - [c16]Annamaria Mesaros, Toni Heittola, Onur Dikmen, Tuomas Virtanen:
Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations. ICASSP 2015: 151-155 - 2014
- [j4]Toni Heittola, Annamaria Mesaros, Dani Korpi, Antti J. Eronen, Tuomas Virtanen:
Method for creating location-specific audio textures. EURASIP J. Audio Speech Music. Process. 2014: 9 (2014) - [c15]Ehsan Amid, Annamaria Mesaros, Kalle J. Palomäki, Jorma Laaksonen, Mikko Kurimo:
Unsupervised feature extraction for multimedia event detection and ranking using audio content. ICASSP 2014: 5939-5943 - 2013
- [j3]Toni Heittola, Annamaria Mesaros, Antti J. Eronen, Tuomas Virtanen:
Context-dependent sound event detection. EURASIP J. Audio Speech Music. Process. 2013: 1 (2013) - [j2]Dani Korpi, Toni Heittola, Timo Partala, Antti J. Eronen, Annamaria Mesaros, Tuomas Virtanen:
On the human ability to discriminate audio ambiances from similar locations of an urban environment. Pers. Ubiquitous Comput. 17(4): 761-769 (2013) - [c14]Annamaria Mesaros, Toni Heittola, Kalle J. Palomäki:
Analysis of acoustic-semantic relationship for diversely annotated real-world audio data. ICASSP 2013: 813-817 - [c13]Toni Heittola, Annamaria Mesaros, Tuomas Virtanen, Moncef Gabbouj:
Supervised model training for overlapping sound events based on unsupervised source separation. ICASSP 2013: 8677-8681 - [c12]Annamaria Mesaros:
Singing voice identification and lyrics transcription for music information retrieval invited paper. SpeD 2013: 1-10 - [c11]Satoru Ishikawa, Markus Koskela, Mats Sjöberg, Jorma Laaksonen, Erkki Oja, Ehsan Amid, Kalle J. Palomäki, Annamaria Mesaros, Mikko Kurimo:
PicSOM Experiments in TRECVID 2013. TRECVID 2013 - [c10]Onur Dikmen, Annamaria Mesaros:
Sound event detection using non-negative dictionaries learned from annotated overlapping events. WASPAA 2013: 1-4 - [c9]Annamaria Mesaros, Toni Heittola, Kalle J. Palomäki:
Query-by-example retrieval of sound events using an integrated similarity measure of content and label. WIAMIS 2013: 1-4 - 2011
- [c8]Annamaria Mesaros, Toni Heittola, Anssi Klapuri:
Latent semantic analysis in sound event detection. EUSIPCO 2011: 1307-1311 - 2010
- [j1]Annamaria Mesaros, Tuomas Virtanen:
Automatic Recognition of Lyrics in Singing. EURASIP J. Audio Speech Music. Process. 2010 (2010) - [c7]Annamaria Mesaros, Toni Heittola, Antti J. Eronen, Tuomas Virtanen:
Acoustic event detection in real life recordings. EUSIPCO 2010: 1267-1271 - [c6]Toni Heittola, Annamaria Mesaros, Antti J. Eronen, Tuomas Virtanen:
Audio context recognition using audio event histograms. EUSIPCO 2010: 1272-1276 - [c5]Annamaria Mesaros, Tuomas Virtanen:
Recognition of phonemes and words in singing. ICASSP 2010: 2146-2149
2000 – 2009
- 2009
- [c4]Annamaria Mesaros, Tuomas Virtanen:
Adaptation of a speech recognizer for singing voice. EUSIPCO 2009: 1779-1783 - 2008
- [c3]Tuomas Virtanen, Annamaria Mesaros, Matti Ryynänen:
Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music. SAPA@INTERSPEECH 2008: 17-22 - 2007
- [c2]Annamaria Mesaros, Tuomas Virtanen, Anssi Klapuri:
Singer Identification in Polyphonic Music Using Vocal Separation and Pattern Recognition Methods. ISMIR 2007: 375-378 - 2005
- [c1]Annamaria Mesaros, Jaakko Astola:
The Mel-Frequency Cepstral Coefficients in the Context of Singer Identification. ISMIR 2005: 610-613
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-21 00:18 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint