default search action

combined dblp search
author search
venue search
publication search

ask others

EURASIP Journal on Audio, Speech, and Music Processing, Volume 2023

> Home > Journals > EURASIP Journal on Audio, Speech, and Music Processing

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Volume 2023, Number 1, December 2023

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/KoszewskiGKK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/KoszewskiGKK23
Damian Koszewski, Thomas Görne, Grazina Korvel, Bozena Kostek:
Automatic music signal mixing system based on one-dimensional Wave-U-Net autoencoders. 1
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/QianLYL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/QianLYL23
Jiale Qian, Xinlu Liu, Yi Yu, Wei Li:
Stripe-Transformer: deep stripe feature learning for music source separation. 2
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/CREC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/CREC23
Prashanth H. C., Madhav Rao, Dhanya Eledath, Ramasubramanian C:
Trainable windows for SincNet architecture. 3
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/KlecWSS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/KlecWSS23
Mariusz Klec, Alicja Wieczorkowska, Krzysztof Szklanny, Wlodzimierz Strus:
Beyond the Big Five personality traits for music recommendation systems. 4
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/TonamiI23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/TonamiI23
Noriyuki Tonami, Keisuke Imoto:
Sound event triage: detecting sound events considering priority of classes. 5
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/HanL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/HanL23
Jiangyu Han, Yanhua Long:
Heterogeneous separation consistency training for adaptation of unsupervised speech separation. 6
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/WangGGZY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/WangGGZY23
Tingting Wang, Haiyan Guo, Zirui Ge, Qiquan Zhang, Zhen Yang:
An MMSE graph spectral magnitude estimator for speech signals residing on an undirected multiple graph. 7
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/OShaughnessy23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/OShaughnessy23
Douglas D. O'Shaughnessy:
Review of methods for coding of speech signals. 8
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/CRER23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/CRER23
Prashanth H. C., Madhav Rao, Dhanya Eledath, V. Ramasubramanian:
Correction: Trainable windows for SincNet architecture. 9
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/XieCST23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/XieCST23
Xiaoping Xie, Yongzhen Chen, Rufeng Shen, Dan Tian:
Research on monaural speech segregation based on feature selection. 10
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/MezzaZS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/MezzaZS23
Alessandro Ilic Mezza, Massimiliano Zanoni, Augusto Sarti:
A latent rhythm complexity model for attribute-controlled drum pattern generation. 11
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/MassiMGB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/MassiMGB23
Oliviero Massi, Alessandro Ilic Mezza, Riccardo Giampiccolo, Alberto Bernardini:
Deep learning-based wave digital modeling of rate-dependent hysteretic nonlinearities for virtual analog applications. 12
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/OstermannVE23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/OstermannVE23
Fabian Ostermann, Igor Vatolkin, Martin Ebeling:
AAM: a dataset of Artificial Audio Multitracks for diverse music information retrieval tasks. 13
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/LiXKS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/LiXKS23
Zimu Li, Yanyan Xu, Dengfeng Ke, Kaile Su:
Three-stage training and orthogonality regularization for spoken language recognition. 14
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/Wangh23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/Wangh23
Pu Wang, Hugo Van hamme:
Benefits of pre-trained mono- and cross-lingual speech representations for spoken language understanding of Dutch dysarthric speech. 15
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/GuoGL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/GuoGL23
Xiao-Yuan Guo, Chun-Xian Gao, Hui Liu:
Voice activity detection in the presence of transient based on graph. 16
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/DietzenATW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/DietzenATW23
Thomas Dietzen, Randall Ali, Maja Taseska, Toon van Waterschoot:
MYRiAD: a multi-array room acoustic database. 17
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/LemercierTKG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/LemercierTKG23
Jean-Marie Lemercier, Joachim Thiemann, Raphael Koning, Timo Gerkmann:
A neural network-supported two-stage algorithm for lightweight dereverberation on hearing devices. 18
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/AranedaHernandezBPC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/AranedaHernandezBPC23
Mauricio Araneda-Hernandez, Felipe Bravo-Marquez, Denis Parra, Rodrigo F. Cádiz:
MUSIB: musical score inpainting benchmark. 19
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/BellurTE23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/BellurTE23
Ashwin Bellur, Karan Thakkar, Mounya Elhilali:
Explicit-memory multiresolution adaptive framework for speech and music separation. 20
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/WangZCLY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/WangZCLY23
Kunpeng Wang, Hao Zhou, Jingxiang Cai, Wenna Li, Juan Yao:
Time-domain adaptive attention network for single-channel speech separation. 21
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/LiuCW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/LiuCW23
Gang Liu, Shifang Cai, Ce Wang:
Speech emotion recognition based on emotion perception. 22
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/LiuY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/LiuY23
Tong Liu, Xiaochen Yuan:
Paralinguistic and spectral feature extraction for speech emotion classification using machine learning techniques. 23
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ComanducciGZAS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ComanducciGZAS23
Luca Comanducci, Davide Gioiosa, Massimiliano Zanoni, Fabio Antonacci, Augusto Sarti:
Variational Autoencoders for chord sequence generation conditioned on Western harmonic music complexity. 24
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/HanKLZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/HanKLZ23
Zhe Han, Yuxuan Ke, Xiaodong Li, Chengshi Zheng:
Parallel processing of distributed beamforming and multichannel linear prediction for speech denoising and deverberation in wireless acoustic sensor networks. 25
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/KocakDPKD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/KocakDPKD23
Tugçe Melike Koçak, Büsra Çilem Dibek, Esma Nafiye Polat, Nilüfer Kafesçioglu, Cenk Demiroglu:
Automatic detection of attachment style in married couples through conversation analysis. 26
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ZhouW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ZhouW23
Yuting Zhou, Hongjie Wan:
Dual-branch attention module-based network with parameter sharing for joint sound event detection and localization. 27
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/LiangZX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/LiangZX23
Xingwei Liang, Zehua Zhang, Ruifeng Xu:
Multi-task deep cross-attention networks for far-field speaker verification and keyword spotting. 28
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/GuntherBK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/GuntherBK23
Michael Günther, Andreas Brendel, Walter Kellermann:
Microphone utility estimation in acoustic sensor networks using single-channel signal features. 29
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/XuZW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/XuZW23
Shiyun Xu, Zehua Zhang, Mingjiang Wang:
Channel and temporal-frequency attention UNet for monaural speech enhancement. 30
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ZengL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ZengL23
Te Zeng, Francis C. M. Lau:
Training audio transformers for cover song identification. 31
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/GrinsteinNN23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/GrinsteinNN23
Eric Grinstein, Vincent W. Neo, Patrick A. Naylor:
Dual input neural networks for positional sound source localization. 32
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ChenX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ChenX23
Zhiyong Chen, Shugong Xu:
Learning domain-heterogeneous speaker recognition systems with personalized continual federated learning. 33
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ZhangWHZXLH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ZhangWHZXLH23
Lekai Zhang, Yingfan Wang, Kailun He, Hailong Zhang, Baixi Xing, Xiaofeng Liu, Fo Hu:
The power of humorous audio: exploring emotion regulation in traffic congestion through EEG-based study. 34
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/KawamuraYWOM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/KawamuraYWOM23
Takao Kawamura, Kouei Yamaoka, Yukoh Wakabayashi, Nobutaka Ono, Ryoichi Miyazaki:
Acoustic object canceller: removing a known signal from monaural recording using blind synchronization. 35
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/HsuB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/HsuB23
Yicheng Hsu, Mingsian R. Bai:
Learning-based robust speaker counting and separation with the aid of spatial coherence. 36
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/RuizWM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/RuizWM23
Santiago Ruiz, Toon van Waterschoot, Marc Moonen:
Cascade algorithms for combined acoustic feedback cancelation and noise reduction. 37
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/TenganDEW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/TenganDEW23
Elisa Tengan, Thomas Dietzen, Filip Elvander, Toon van Waterschoot:
Direction-of-arrival and power spectral density estimation using a single directional microphone and group-sparse optimization. 38
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/SaremiRGG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/SaremiRGG23
Amin Saremi, Balaji Ramkumar, Ghazaleh Ghaffari, Zonghua Gu:
An acoustic echo canceller optimized for hands-free speech telecommunication in large vehicle cabins. 39
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/LiWYI23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/LiWYI23
Yan Li, Yapeng Wang, Xu Yang, Sio Kei Im:
Speech emotion recognition based on Graph-LSTM neural network. 40
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/WangJZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/WangJZ23
Chunxi Wang, Maoshen Jia, Xinfeng Zhang:
Deep encoder/decoder dual-path neural network for speech separation in noisy reverberation environments. 41
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/GuanLKXZTW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/GuanLKXZTW23
Jian Guan, Youde Liu, Qiuqiang Kong, Feiyang Xiao, Qiaoxi Zhu, Jiantong Tian, Wenwu Wang:
Transformer-based autoencoder with ID constraint for unsupervised anomalous sound detection. 42
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/LiSZLLWQHYS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/LiSZLLWQHYS23
Jingtan Li, Mengkai Sun, Zhonghao Zhao, Xingcan Li, Gaigai Li, Chen Wu, Kun Qian, Bin Hu, Yoshiharu Yamamoto, Björn W. Schuller:
Battling with the low-resource condition for snore sound recognition: introducing a meta-learning strategy. 43
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/MaWTZZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/MaWTZZ23
Le Ma, Xinda Wu, Ruiyuan Tang, Chongjun Zhong, Kejun Zhang:
YuYin: a multi-task learning model of multi-modal e-commerce background music recommendation. 44
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/HuangWYHH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/HuangWYHH23
Hao Huang, Lin Wang, Jichen Yang, Ying Hu, Liang He:
W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision. 45
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/KindtTBM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/KindtTBM23
Stijn Kindt, Jenthe Thienpondt, Luca Becker, Nilesh Madhu:
Robustness of ad hoc microphone clustering using speaker embeddings: evaluation under realistic and challenging scenarios. 46
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ManoharJR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ManoharJR23
Kavya Manohar, A. R. Jayan, Rajeev Rajan:
Improving speech recognition systems for the morphologically complex Malayalam language using subword tokens for language modeling. 47
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/QianXY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/QianXY23
Zhaopeng Qian, Kejing Xiao, Chongchong Yu:
A survey of technologies for automatic Dysarthric speech recognition. 48
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ReghunathR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ReghunathR23
Lekshmi Chandrika Reghunath, Rajeev Rajan:
Predominant audio source separation in polyphonic music. 49
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/XueSTHYHX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/XueSTHYHX23
Huiwen Xue, Chenxin Sun, Mingcheng Tang, Chenrui Hu, Zhengqing Yuan, Min Huang, Zhongzhe Xiao:
Effective acoustic parameters for automatic classification of performed and synthesized Guzheng music. 50
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/GrumiauxL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/GrumiauxL23
Pierre-Amaury Grumiaux, Mathieu Lagrange:
Efficient bandwidth extension of musical signals using a differentiable harmonic plus noise model. 51
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/Suzuki23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/Suzuki23
Masahiro Suzuki:
Piano score rearrangement into multiple difficulty levels via notation-to-notation approach. 52
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/WangLXYYL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/WangLXYYL23
Jing Wang, Hanyue Liu, Liang Xu, Wenjing Yang, Weiming Yi, Fang Liu:
Lightweight target speaker separation network based on joint training. 53
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/KellermannMO23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/KellermannMO23
Walter Kellermann, Rainer Martin, Nobutaka Ono:
Signal processing and machine learning for speech and audio in acoustic sensor networks. 54
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ChinaevKE23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ChinaevKE23
Aleksej Chinaev, Niklas Knaepper, Gerald Enzner:
Online distributed waveform-synchronization for acoustic sensor networks with dynamic topology. 55

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.