![](https://dblp.uni-trier.de./img/logo.320x120.png)
![search dblp search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
![search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
default search action
EURASIP Journal on Audio, Speech, and Music Processing, Volume 2023
Volume 2023, Number 1, December 2023
- Damian Koszewski, Thomas Görne, Grazina Korvel
, Bozena Kostek
:
Automatic music signal mixing system based on one-dimensional Wave-U-Net autoencoders. 1 - Jiale Qian, Xinlu Liu, Yi Yu, Wei Li
:
Stripe-Transformer: deep stripe feature learning for music source separation. 2 - Prashanth H. C.
, Madhav Rao
, Dhanya Eledath, Ramasubramanian C:
Trainable windows for SincNet architecture. 3 - Mariusz Klec
, Alicja Wieczorkowska, Krzysztof Szklanny, Wlodzimierz Strus:
Beyond the Big Five personality traits for music recommendation systems. 4 - Noriyuki Tonami, Keisuke Imoto
:
Sound event triage: detecting sound events considering priority of classes. 5 - Jiangyu Han
, Yanhua Long
:
Heterogeneous separation consistency training for adaptation of unsupervised speech separation. 6 - Tingting Wang, Haiyan Guo, Zirui Ge
, Qiquan Zhang, Zhen Yang:
An MMSE graph spectral magnitude estimator for speech signals residing on an undirected multiple graph. 7 - Douglas D. O'Shaughnessy
:
Review of methods for coding of speech signals. 8 - Prashanth H. C., Madhav Rao, Dhanya Eledath, V. Ramasubramanian:
Correction: Trainable windows for SincNet architecture. 9 - Xiaoping Xie, Yongzhen Chen, Rufeng Shen, Dan Tian:
Research on monaural speech segregation based on feature selection. 10 - Alessandro Ilic Mezza
, Massimiliano Zanoni, Augusto Sarti:
A latent rhythm complexity model for attribute-controlled drum pattern generation. 11 - Oliviero Massi
, Alessandro Ilic Mezza, Riccardo Giampiccolo
, Alberto Bernardini:
Deep learning-based wave digital modeling of rate-dependent hysteretic nonlinearities for virtual analog applications. 12 - Fabian Ostermann
, Igor Vatolkin
, Martin Ebeling:
AAM: a dataset of Artificial Audio Multitracks for diverse music information retrieval tasks. 13 - Zimu Li, Yanyan Xu
, Dengfeng Ke, Kaile Su
:
Three-stage training and orthogonality regularization for spoken language recognition. 14 - Pu Wang
, Hugo Van hamme
:
Benefits of pre-trained mono- and cross-lingual speech representations for spoken language understanding of Dutch dysarthric speech. 15 - Xiao-Yuan Guo, Chun-Xian Gao, Hui Liu:
Voice activity detection in the presence of transient based on graph. 16 - Thomas Dietzen, Randall Ali, Maja Taseska, Toon van Waterschoot:
MYRiAD: a multi-array room acoustic database. 17 - Jean-Marie Lemercier
, Joachim Thiemann, Raphael Koning, Timo Gerkmann:
A neural network-supported two-stage algorithm for lightweight dereverberation on hearing devices. 18 - Mauricio Araneda-Hernandez, Felipe Bravo-Marquez, Denis Parra, Rodrigo F. Cádiz:
MUSIB: musical score inpainting benchmark. 19 - Ashwin Bellur, Karan Thakkar, Mounya Elhilali
:
Explicit-memory multiresolution adaptive framework for speech and music separation. 20 - Kunpeng Wang
, Hao Zhou, Jingxiang Cai, Wenna Li, Juan Yao:
Time-domain adaptive attention network for single-channel speech separation. 21 - Gang Liu
, Shifang Cai, Ce Wang:
Speech emotion recognition based on emotion perception. 22 - Tong Liu, Xiaochen Yuan
:
Paralinguistic and spectral feature extraction for speech emotion classification using machine learning techniques. 23 - Luca Comanducci
, Davide Gioiosa, Massimiliano Zanoni, Fabio Antonacci, Augusto Sarti:
Variational Autoencoders for chord sequence generation conditioned on Western harmonic music complexity. 24 - Zhe Han, Yuxuan Ke, Xiaodong Li, Chengshi Zheng
:
Parallel processing of distributed beamforming and multichannel linear prediction for speech denoising and deverberation in wireless acoustic sensor networks. 25 - Tugçe Melike Koçak
, Büsra Çilem Dibek, Esma Nafiye Polat, Nilüfer Kafesçioglu
, Cenk Demiroglu:
Automatic detection of attachment style in married couples through conversation analysis. 26 - Yuting Zhou, Hongjie Wan
:
Dual-branch attention module-based network with parameter sharing for joint sound event detection and localization. 27 - Xingwei Liang, Zehua Zhang, Ruifeng Xu
:
Multi-task deep cross-attention networks for far-field speaker verification and keyword spotting. 28 - Michael Günther
, Andreas Brendel
, Walter Kellermann:
Microphone utility estimation in acoustic sensor networks using single-channel signal features. 29 - Shiyun Xu, Zehua Zhang, Mingjiang Wang:
Channel and temporal-frequency attention UNet for monaural speech enhancement. 30 - Te Zeng
, Francis C. M. Lau:
Training audio transformers for cover song identification. 31 - Eric Grinstein
, Vincent W. Neo
, Patrick A. Naylor
:
Dual input neural networks for positional sound source localization. 32 - Zhiyong Chen
, Shugong Xu:
Learning domain-heterogeneous speaker recognition systems with personalized continual federated learning. 33 - Lekai Zhang, Yingfan Wang, Kailun He, Hailong Zhang, Baixi Xing, Xiaofeng Liu, Fo Hu
:
The power of humorous audio: exploring emotion regulation in traffic congestion through EEG-based study. 34 - Takao Kawamura
, Kouei Yamaoka, Yukoh Wakabayashi, Nobutaka Ono, Ryoichi Miyazaki:
Acoustic object canceller: removing a known signal from monaural recording using blind synchronization. 35 - Yicheng Hsu, Mingsian R. Bai
:
Learning-based robust speaker counting and separation with the aid of spatial coherence. 36 - Santiago Ruiz
, Toon van Waterschoot, Marc Moonen:
Cascade algorithms for combined acoustic feedback cancelation and noise reduction. 37 - Elisa Tengan
, Thomas Dietzen
, Filip Elvander, Toon van Waterschoot:
Direction-of-arrival and power spectral density estimation using a single directional microphone and group-sparse optimization. 38 - Amin Saremi
, Balaji Ramkumar, Ghazaleh Ghaffari, Zonghua Gu:
An acoustic echo canceller optimized for hands-free speech telecommunication in large vehicle cabins. 39 - Yan Li, Yapeng Wang
, Xu Yang, Sio Kei Im:
Speech emotion recognition based on Graph-LSTM neural network. 40 - Chunxi Wang, Maoshen Jia
, Xinfeng Zhang:
Deep encoder/decoder dual-path neural network for speech separation in noisy reverberation environments. 41 - Jian Guan
, Youde Liu, Qiuqiang Kong, Feiyang Xiao, Qiaoxi Zhu, Jiantong Tian, Wenwu Wang:
Transformer-based autoencoder with ID constraint for unsupervised anomalous sound detection. 42 - Jingtan Li
, Mengkai Sun
, Zhonghao Zhao, Xingcan Li, Gaigai Li, Chen Wu, Kun Qian
, Bin Hu, Yoshiharu Yamamoto, Björn W. Schuller:
Battling with the low-resource condition for snore sound recognition: introducing a meta-learning strategy. 43 - Le Ma, Xinda Wu, Ruiyuan Tang, Chongjun Zhong, Kejun Zhang:
YuYin: a multi-task learning model of multi-modal e-commerce background music recommendation. 44 - Hao Huang, Lin Wang, Jichen Yang
, Ying Hu, Liang He:
W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision. 45 - Stijn Kindt
, Jenthe Thienpondt, Luca Becker, Nilesh Madhu:
Robustness of ad hoc microphone clustering using speaker embeddings: evaluation under realistic and challenging scenarios. 46 - Kavya Manohar
, A. R. Jayan, Rajeev Rajan:
Improving speech recognition systems for the morphologically complex Malayalam language using subword tokens for language modeling. 47 - Zhaopeng Qian
, Kejing Xiao, Chongchong Yu:
A survey of technologies for automatic Dysarthric speech recognition. 48 - Lekshmi Chandrika Reghunath
, Rajeev Rajan:
Predominant audio source separation in polyphonic music. 49 - Huiwen Xue, Chenxin Sun, Mingcheng Tang, Chenrui Hu, Zhengqing Yuan, Min Huang, Zhongzhe Xiao
:
Effective acoustic parameters for automatic classification of performed and synthesized Guzheng music. 50 - Pierre-Amaury Grumiaux, Mathieu Lagrange
:
Efficient bandwidth extension of musical signals using a differentiable harmonic plus noise model. 51 - Masahiro Suzuki
:
Piano score rearrangement into multiple difficulty levels via notation-to-notation approach. 52 - Jing Wang, Hanyue Liu, Liang Xu, Wenjing Yang, Weiming Yi
, Fang Liu:
Lightweight target speaker separation network based on joint training. 53 - Walter Kellermann, Rainer Martin, Nobutaka Ono:
Signal processing and machine learning for speech and audio in acoustic sensor networks. 54 - Aleksej Chinaev
, Niklas Knaepper
, Gerald Enzner
:
Online distributed waveform-synchronization for acoustic sensor networks with dynamic topology. 55
![](https://dblp.uni-trier.de./img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.