default search action
Lukas Drude
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c32]Sergio Duarte Torres, Arunasish Sen, Aman Rana, Lukas Drude, Alejandro Gomez-Alanis, Andreas Schwarz, Leif Rädel, Volker Leutnant:
Promptformer: Prompted Conformer Transducer for ASR. ICASSP 2024: 11821-11825 - [i13]Sergio Duarte Torres, Arunasish Sen, Aman Rana, Lukas Drude, Alejandro Gómez Alanís, Andreas Schwarz, Leif Rädel, Volker Leutnant:
Promptformer: Prompted Conformer Transducer for ASR. CoRR abs/2401.07360 (2024) - 2023
- [c31]Belen Alastruey, Lukas Drude, Jahn Heymann, Simon Wiesler:
Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition. INTERSPEECH 2023: 4973-4977 - [i12]Belen Alastruey, Lukas Drude, Jahn Heymann, Simon Wiesler:
Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition. CoRR abs/2306.06954 (2023) - 2022
- [c30]Alejandro Gomez-Alanis, Lukas Drude, Andreas Schwarz, Rupak Vignesh Swaminathan, Simon Wiesler:
Contextual-Utterance Training for Automatic Speech Recognition. IberSPEECH 2022: 26-30 - [i11]Alejandro Gómez Alanís, Lukas Drude, Andreas Schwarz, Rupak Vignesh Swaminathan, Simon Wiesler:
Contextual-Utterance Training for Automatic Speech Recognition. CoRR abs/2210.16238 (2022) - 2021
- [j3]Reinhold Haeb-Umbach, Jahn Heymann, Lukas Drude, Shinji Watanabe, Marc Delcroix, Tomohiro Nakatani:
Far-Field Automatic Speech Recognition. Proc. IEEE 109(2): 124-148 (2021) - [c29]Lukas Drude, Jahn Heymann, Andreas Schwarz, Jean-Marc Valin:
Multi-Channel Opus Compression for Far-Field Automatic Speech Recognition with a Fixed Bitrate Budget. Interspeech 2021: 1669-1673 - [i10]Lukas Drude, Jahn Heymann, Andreas Schwarz, Jean-Marc Valin:
Multi-channel Opus compression for far-field automatic speech recognition with a fixed bitrate budget. CoRR abs/2106.07994 (2021) - 2020
- [b1]Lukas Drude:
Integration of neural networks and probabilistic spatial models for acoustic blind source separation. University of Paderborn, Germany, 2020 - [c28]Jens Heitkaemper, Darius Jakobeit, Christoph Böddeker, Lukas Drude, Reinhold Haeb-Umbach:
Demystifying TasNet: A Dissecting Approach. ICASSP 2020: 6359-6363 - [c27]Thilo von Neumann, Keisuke Kinoshita, Lukas Drude, Christoph Böddeker, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
End-to-End Training of Time Domain Audio Separation and Recognition. ICASSP 2020: 7004-7008 - [c26]Thilo von Neumann, Christoph Böddeker, Lukas Drude, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. INTERSPEECH 2020: 3097-3101 - [i9]Thilo von Neumann, Christoph Böddeker, Lukas Drude, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR. CoRR abs/2006.02786 (2020)
2010 – 2019
- 2019
- [j2]Lukas Drude, Reinhold Haeb-Umbach:
Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation. IEEE J. Sel. Top. Signal Process. 13(4): 815-826 (2019) - [c25]Janek Ebbers, Lukas Drude, Reinhold Haeb-Umbach, Andreas Brendel, Walter Kellermann:
Weakly Supervised Sound Activity Detection and Event Classification in Acoustic Sensor Networks. CAMSAP 2019: 301-305 - [c24]Lukas Drude, Daniel Hasenklever, Reinhold Haeb-Umbach:
Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation. ICASSP 2019: 695-699 - [c23]Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach, Keisuke Kinoshita, Tomohiro Nakatani:
Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR. ICASSP 2019: 6655-6659 - [c22]Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach:
Unsupervised Training of Neural Mask-Based Beamforming. INTERSPEECH 2019: 1253-1257 - [i8]Lukas Drude, Daniel Hasenklever, Reinhold Haeb-Umbach:
Unsupervised training of a deep clustering model for multichannel blind source separation. CoRR abs/1904.01340 (2019) - [i7]Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach:
Unsupervised training of neural mask-based beamforming. CoRR abs/1904.01578 (2019) - [i6]Lukas Drude, Jens Heitkaemper, Christoph Böddeker, Reinhold Haeb-Umbach:
SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition. CoRR abs/1910.13934 (2019) - [i5]Jens Heitkaemper, Darius Jakobeit, Christoph Böddeker, Lukas Drude, Reinhold Haeb-Umbach:
Demystifying TasNet: A Dissecting Approach. CoRR abs/1911.08895 (2019) - [i4]Thilo von Neumann, Keisuke Kinoshita, Lukas Drude, Christoph Böddeker, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
End-to-end training of time domain audio separation and recognition. CoRR abs/1912.08462 (2019) - 2018
- [c21]Lukas Drude, Jahn Heymann, Christoph Böddeker, Reinhold Haeb-Umbach:
NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing. ITG Symposium on Speech Communication 2018: 1-5 - [c20]Lukas Drude, Thilo von Neumann, Reinhold Haeb-Umbach:
Deep Attractor Networks for Speaker Re-Identification and Blind Source Separation. ICASSP 2018: 11-15 - [c19]Lukas Drude, Takuya Higuchi, Keisuke Kinoshita, Tomohiro Nakatani, Reinhold Haeb-Umbach:
Dual Frequency- and Block-Permutation Alignment for Deep Learning Based Block-Online Blind Source Separation. ICASSP 2018: 691-695 - [c18]Keisuke Kinoshita, Lukas Drude, Marc Delcroix, Tomohiro Nakatani:
Listening to Each Speaker One by One with Recurrent Selective Hearing Networks. ICASSP 2018: 5064-5068 - [c17]Lukas Drude, Christoph Böddeker, Jahn Heymann, Reinhold Haeb-Umbach, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani:
Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation. INTERSPEECH 2018: 3043-3047 - [c16]Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach, Keisuke Kinoshita, Tomohiro Nakatani:
Frame-Online DNN-WPE Dereverberation. IWAENC 2018: 466-470 - 2017
- [j1]Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach:
A generic neural acoustic beamforming architecture for robust multi-channel speech processing. Comput. Speech Lang. 46: 374-385 (2017) - [c15]Christoph Böddeker, Patrick Hanebrink, Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach:
Optimizing neural-network supported acoustic beamforming by algorithmic differentiation. ICASSP 2017: 171-175 - [c14]Jahn Heymann, Lukas Drude, Christoph Böddeker, Patrick Hanebrink, Reinhold Haeb-Umbach:
Beamnet: End-to-end training of a beamformer-supported multi-channel ASR system. ICASSP 2017: 5325-5329 - [c13]Janek Ebbers, Jahn Heymann, Lukas Drude, Thomas Glarner, Reinhold Haeb-Umbach, Bhiksha Raj:
Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery. INTERSPEECH 2017: 488-492 - [c12]Lukas Drude, Reinhold Haeb-Umbach:
Tight Integration of Spatial and Spectral Features for BSS with Deep Clustering Embeddings. INTERSPEECH 2017: 2650-2654 - [c11]Joerg Schmalenstroeer, Jahn Heymann, Lukas Drude, Christoph Böddeker, Reinhold Haeb-Umbach:
Multi-stage coherence drift based sampling rate synchronization for acoustic beamforming. MMSP 2017: 1-6 - [i3]Christoph Böddeker, Patrick Hanebrink, Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach:
On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming. CoRR abs/1701.00392 (2017) - [i2]Nikolas Wolfe, Aditya Sharma, Lukas Drude, Bhiksha Raj:
The Incredible Shrinking Neural Network: New Perspectives on Learning Representations Through The Lens of Pruning. CoRR abs/1701.04465 (2017) - [i1]Gerhard Kurz, Igor Gilitschenski, Florian Pfaff, Lukas Drude, Uwe D. Hanebeck, Reinhold Haeb-Umbach, Roland Yves Siegwart:
Directional Statistics and Filtering Using libDirectional. CoRR abs/1712.09718 (2017) - 2016
- [c10]Aleksej Chinaev, Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach:
Noise-Presence-Probability-Based Noise PSD Estimation by Using DNNs. ITG Symposium on Speech Communication 2016: 1-5 - [c9]Thomas Glarner, Mohammad Mahdi Momenzadeh, Lukas Drude, Reinhold Haeb-Umbach:
Factor Graph Decoding for Speech Presence Probability Estimation. ITG Symposium on Speech Communication 2016: 1-5 - [c8]Lukas Drude, Christoph Böddeker, Reinhold Haeb-Umbach:
Blind speech separation based on complex spherical k-mode clustering. ICASSP 2016: 141-145 - [c7]Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach:
Neural network based spectral mask estimation for acoustic beamforming. ICASSP 2016: 196-200 - [c6]Lukas Drude, Bhiksha Raj, Reinhold Haeb-Umbach:
On the Appropriateness of Complex-Valued Neural Networks for Speech Enhancement. INTERSPEECH 2016: 1745-1749 - 2015
- [c5]Jahn Heymann, Lukas Drude, Aleksej Chinaev, Reinhold Haeb-Umbach:
BLSTM supported GEV beamformer front-end for the 3RD CHiME challenge. ASRU 2015: 444-451 - [c4]Lukas Drude, Florian Jacob, Reinhold Haeb-Umbach:
DOA-estimation based on a complex Watson kernel method. EUSIPCO 2015: 255-259 - [c3]Oliver Walter, Lukas Drude, Reinhold Haeb-Umbach:
Source counting in speech mixtures by nonparametric Bayesian estimation of an infinite Gaussian mixture model. ICASSP 2015: 459-463 - 2014
- [c2]Lukas Drude, Aleksej Chinaev, Dang Hai Tran Vu, Reinhold Haeb-Umbach:
Source counting in speech mixtures using a variational EM approach for complex WATSON mixture models. ICASSP 2014: 6834-6838 - [c1]Lukas Drude, Aleksej Chinaev, Dang Hai Tran Vu, Reinhold Haeb-Umbach:
Towards online source counting in speech mixtures applying a variational EM for complex Watson mixture models. IWAENC 2014: 213-217
Coauthor Index
aka: Reinhold Haeb-Umbach
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:19 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint