


default search action
IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 26
Volume 26, Number 1, January 2018
- Dianna Yee, A. Homayoun Kamkar-Parsi, Rainer Martin
, Henning Puder:
A Noise Reduction Postfilter for Binaurally Linked Single-Microphone Hearing Aids Utilizing a Nearby External Microphone. 5-18 - Tom Bäckström
, Johannes Fischer:
Fast Randomization for Distributed Low-Bitrate Coding of Speech and Audio. 19-30 - Jun Deng, Xinzhou Xu, Zixing Zhang, Sascha Frühholz
, Björn W. Schuller
:
Semisupervised Autoencoders for Speech Emotion Recognition. 31-43 - Md. Sahidullah
, Dennis Alexander Lehmann Thomsen, Rosa González Hautamäki, Tomi Kinnunen, Zheng-Hua Tan, Robert Parts, Martti Pitkänen:
Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones. 44-56 - Gilles Degottex, Pierre Lanchantin, Mark J. F. Gales:
A Log Domain Pulse Model for Parametric Speech Synthesis. 57-70 - Johannes Abel
, Tim Fingscheidt
:
Artificial Speech Bandwidth Extension Using Deep Neural Networks for Wideband Spectral Envelope Estimation. 71-83 - Yuki Saito
, Shinnosuke Takamichi, Hiroshi Saruwatari:
Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks. 84-96 - Kristian Timm Andersen, Marc Moonen:
Robust Speech-Distortion Weighted Interframe Wiener Filters for Single-Channel Noise Reduction. 97-107 - Chen-Yu Chiang:
Cross-Dialect Adaptation Framework for Constructing Prosodic Models for Chinese Dialect Text-to-Speech Systems. 108-121 - Bingquan Liu, Zhen Xu, Chengjie Sun, Baoxun Wang
, Xiaolong Wang, Derek F. Wong
, Min Zhang:
Content-Oriented User Modeling for Personalized Response Ranking in Chatbots. 122-133 - Zhiyuan Tang, Dong Wang, Yixiang Chen, Lantian Li
, Andrew Abel:
Phonetic Temporal Neural Model for Language Identification. 134-144 - Soumitro Chakrabarty, Emanuël A. P. Habets:
A Bayesian Approach to Informed Spatial Filtering With Robustness Against DOA Estimation Errors. 145-160 - Kuan-Yu Chen
, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang
:
An Information Distillation Framework for Extractive Summarization. 161-170 - Ma Jin
, Yan Song, Ian McLoughlin
, Li-Rong Dai:
LID-Senones and Their Statistics for Language Identification. 171-183 - Zhehuai Chen
, Jasha Droppo
, Jinyu Li
, Wayne Xiong:
Progressive Joint Modeling in Unsupervised Single-Channel Overlapped Speech Recognition. 184-196 - Shivesh Ranjan
, John H. L. Hansen
:
Curriculum Learning Based Approaches for Noise Robust Speaker Recognition. 197-210
Volume 26, Number 2, February 2018
- Yoshiaki Bando
, Katsutoshi Itoyama, Masashi Konyo
, Satoshi Tadokoro, Kazuhiro Nakadai
, Kazuyoshi Yoshii
, Tatsuya Kawahara
, Hiroshi G. Okuno
:
Speech Enhancement Based on Bayesian Low-Rank and Sparse Decomposition of Multichannel Magnitude Spectrograms. 215-230 - Yu-Ping Ruan
, Qian Chen, Zhen-Hua Ling
:
A Sequential Neural Encoder With Latent Structured Description for Modeling Sentences. 231-242 - Amelia Jane Gully
, Helena Daffern, Damian T. Murphy
:
Diphthong Synthesis Using the Dynamic 3D Digital Waveguide Mesh. 243-255 - Chunyang Wu
, Mark J. F. Gales, Anton Ragni, Penny Karanasou
, Khe Chai Sim:
Improving Interpretability and Regularization in Deep Learning. 256-265 - Kehai Chen
, Tiejun Zhao, Muyun Yang
, Lemao Liu
, Akihiro Tamura
, Rui Wang
, Masao Utiyama, Eiichiro Sumita:
A Neural Approach to Source Dependence Based Context Model for Statistical Machine Translation. 266-280 - Joonas Nikunen
, Aleksandr Diment, Tuomas Virtanen
:
Separation of Moving Sound Sources Using Multichannel NMF and Acoustic Tracking. 281-295 - Johan Sward
, Hongbin Li
, Andreas Jakobsson
:
Off-Grid Fundamental Frequency Estimation. 296-303 - Dylan Menzies
, Marcos F. Simón Gálvez, Filippo Maria Fazi
:
A Low-Frequency Panning Method With Compensation for Head Rotation. 304-317 - Branimir Dropuljic
, Igor Mijic
, Davor Petrinovic
, Tanja Jovanovic
, Kresimir Cosic
:
Vocal Analysis of Acoustic Startle Responses. 318-329 - Philipp Aichinger
, Martin Hagmüller
, Berit Schneider-Stickler, Jean Schoentgen, Franz Pernkopf
:
Tracking of Multiple Fundamental Frequencies in Diplophonic Voices. 330-341 - Anastasios Alexandridis, Athanasios Mouchtaris
:
Multiple Sound Source Location Estimation in Wireless Acoustic Sensor Networks Using DOA Estimates: The Data-Association Problem. 342-356 - Robert Rehr, Timo Gerkmann
:
On the Importance of Super-Gaussian Speech Priors for Machine-Learning Based Speech Enhancement. 357-366 - Sonia Djaziri Larbi, Gaël Mahé, Imen Marrakchi-Mezghani, Monia Turki
, Meriem Jaïdane
:
Watermark-Driven Acoustic Echo Cancellation. 367-378 - Annamaria Mesaros
, Toni Heittola, Emmanouil Benetos
, Peter Foster, Mathieu Lagrange, Tuomas Virtanen
, Mark D. Plumbley
:
Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge. 379-393 - Cheng-Tao Chung, Lin-Shan Lee:
Unsupervised Discovery of Structured Acoustic Tokens With Applications to Spoken Term Detection. 394-405 - Tobias May
:
Robust Speech Dereverberation With a Neural Network-Based Post-Filter That Exploits Multi-Conditional Training of Binaural Cues. 406-414 - Majid Mirbagheri
, Les Atlas, Adrian K. C. Lee
:
Regression Factor Analysis With an Application to Continuous HRIR Measurement. 415-421 - Jen-Tzung Chien
:
Bayesian Nonparametric Learning for Hierarchical and Sparse Topics. 422-435 - Johannes Stahl
, Pejman Mowlaee
:
A Pitch-Synchronous Simultaneous Detection-Estimation Framework for Speech Enhancement. 436-450
Volume 26, Number 3, March 2018
- César D. Salvador
, Shuichi Sakamoto, Jorge Treviño, Yôiti Suzuki:
Boundary Matching Filters for Spherical Microphone and Loudspeaker Arrays. 461-474 - Ahmed Hussen Abdelaziz
:
Comparing Fusion Models for DNN-Based Audiovisual Continuous Speech Recognition. 475-484 - Satoru Emura
:
Residual Echo Reduction for Multichannel Acoustic Echo Cancelers With a Complex-Valued Residual Echo Estimate. 485-500 - Van Hai Do
, Nancy F. Chen
, Boon Pang Lim, Mark A. Hasegawa-Johnson
:
Multitask Learning for Phone Recognition of Underresourced Languages Using Mismatched Transcription. 501-514 - Mehdi Zohourian
, Gerald Enzner
, Rainer Martin
:
Binaural Speaker Localization Integrated Into an Adaptive Beamformer for Hearing Aids. 515-528 - Yong Xiang
, Iynkaran Natgunanathan
, Dezhong Peng
, Guang Hua
, Bo Liu
:
Spread Spectrum Audio Watermarking Using Multiple Orthogonal PN Sequences and Variable Embedding Strengths and Polarities. 529-539 - Chuanqi Tan
, Furu Wei, Qingyu Zhou
, Nan Yang, Bowen Du
, Weifeng Lv, Ming Zhou:
Context-Aware Answer Sentence Selection With Hierarchical Gated Recurrent Neural Networks. 540-549 - Jie Zhang
, Sundeep Prabhakar Chepuri
, Richard Christian Hendriks
, Richard Heusdens:
Microphone Subset Selection for MVDR Beamformer Based Noise Reduction. 550-563 - Syu-Siang Wang
, Payton Lin
, Yu Tsao
, Jeih-Weih Hung, Borching Su
:
Suppression by Selecting Wavelets for Feature Compression in Distributed Speech Recognition. 564-579 - Yu Wang
, Mike Brookes
:
Model-Based Speech Enhancement in the Modulation Domain. 580-594 - Christian Huemmer
, Christian Hofmann
, Roland Maas, Walter Kellermann:
Estimating Parameters of Nonlinear Systems Using the Elitist Particle Filter Based on Evolutionary Strategies. 595-608 - Daniele Salvati
, Carlo Drioli
, Gian Luca Foresti
:
A Low-Complexity Robust Beamforming Using Diagonal Unloading for Acoustic Source Localization. 609-622 - Jinsong Su
, Jiali Zeng, Deyi Xiong
, Yang Liu
, Mingxuan Wang, Jun Xie:
A Hierarchy-to-Sequence Attentional Neural Machine Translation Model. 623-632 - Waad Ben Kheder
, Driss Matrouf, Moez Ajili, Jean-François Bonastre:
A Unified Joint Model to Deal With Nuisance Variabilities in the i-Vector Space. 633-645 - Gregory Gelly
, Jean-Luc Gauvain:
Optimization of RNN-Based Speech Activity Detection. 646-656 - Maja Taseska
, Emanuël A. P. Habets
:
Blind Source Separation of Moving Sources Using Sparsity-Based Source Detection and Tracking. 657-670 - Liang-Chih Yu
, Jin Wang
, K. Robert Lai
, Xuejie Zhang:
Refining Word Embeddings Using Intensity Scores for Sentiment Analysis. 671-681 - Yuval Dorfan
, Axel Plinge
, Gershon Hazan, Sharon Gannot
:
Distributed Expectation-Maximization Algorithm for Speaker Localization in Reverberant Environments. 682-695
Volume 26, Number 4, April 2018
- Zhili Tan
, Man-Wai Mak
, Brian Kan-Wing Mak
:
DNN-Based Score Calibration With Multitask Learning for Noise Robust Speaker Verification. 700-712 - Ya-Jun Hu
, Zhen-Hua Ling
:
Extracting Spectral Features Using Deep Autoencoders With Binary Distributed Hidden Units for Statistical Parametric Speech Synthesis. 713-724 - Bracha Laufer-Goldshtein, Ronen Talmon
, Sharon Gannot
:
A Hybrid Approach for Speaker Tracking Based on TDOA and Data-Driven Models. 725-735 - Sandro Cumani
, Pietro Laface
:
Speaker Recognition Using e-Vectors. 736-748 - Longting Xu
, Kong-Aik Lee
, Haizhou Li
, Zhen Yang:
Generalizing I-Vector Estimation for Rapid Speaker Recognition. 749-759 - Yaakov Buchris
, Israel Cohen, Jacob Benesty
:
Frequency-Domain Design of Asymmetric Circular Differential Microphone Arrays. 760-773 - Jihui Zhang
, Thushara D. Abhayapala
, Wen Zhang
, Prasanga N. Samarasinghe
, Shouda Jiang:
Active Noise Control Over Space: A Wave Domain Approach. 774-786 - Yi Luo
, Zhuo Chen, Nima Mesgarani
:
Speaker-Independent Speech Separation With Deep Attractor Network. 787-796 - Neethu Mariam Joy
, Sandeep Reddy Kothinti
, Srinivasan Umesh
:
FMLLR Speaker Normalization With i-Vector: In Pseudo-FMLLR and Distillation Framework. 797-805 - Swati Chandna
, Wenwu Wang
:
Bootstrap Averaging for Model-Based Source Separation in Reverberant Conditions. 806-819 - Zhili Tan
, Man-Wai Mak
, Brian Kan-Wing Mak
, Yingke Zhu:
Denoised Senone I-Vectors for Robust Speaker Verification. 820-830 - Kousuke Itakura
, Yoshiaki Bando
, Eita Nakamura
, Katsutoshi Itoyama
, Kazuyoshi Yoshii
, Tatsuya Kawahara
:
Bayesian Multichannel Audio Source Separation Based on Integrated Source and Spatial Models. 831-846
Volume 26, Number 5, May 2018
- Youssef El Baba
, Andreas Walther, Emanuël A. P. Habets
:
3D Room Geometry Inference Based on Room Impulse Response Stacks. 857-872 - Qian Zhang, John H. L. Hansen
:
Language/Dialect Recognition Based on Unsupervised Deep Learning. 873-882 - Zhen-Hua Ling
, Yang Ai
, Yu Gu, Li-Rong Dai:
Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension. 883-894 - Marc Delcroix
, Keisuke Kinoshita
, Atsunori Ogawa
, Christian Huemmer
, Tomohiro Nakatani:
Context Adaptive Neural Network Based Acoustic Models for Rapid Adaptation. 895-908 - Linh Thi Thuc Tran, Sven Erik Nordholm
, Henning F. Schepker
, Hai Huyen Dam, Simon Doclo
:
Two-Microphone Hearing Aids Using Prediction Error Method for Adaptive Feedback Control. 909-923 - Jiho Chang, Marton Marschall:
Periphony-Lattice Mixed-Order Ambisonic Scheme for Spherical Microphone Arrays. 924-936 - Nikolaos Dionelis
, Mike Brookes
:
Phase-Aware Single-Channel Speech Enhancement With Modulation-Domain Kalman Filtering. 937-950 - Chengshi Zheng
, Antoine Deleforge, Xiaodong Li
, Walter Kellermann
:
Statistical Analysis of the Multichannel Wiener Filter Using a Bivariate Normal Distribution for Sample Covariance Matrices. 951-966 - Colin Vaz
, Vikram Ramanarayanan
, Shrikanth S. Narayanan
:
Acoustic Denoising Using Dictionary Learning With Spectral and Temporal Regularization. 967-980 - Lin Wang
, Andrea Cavallaro:
Pseudo-Determined Blind Source Separation for Ad-hoc Microphone Networks. 981-994 - Sandro Cumani
, Pietro Laface
:
Scoring Heterogeneous Speaker Vectors Using Nonlinear Transformations and Tied PLDA Models. 995-1009 - Giuliano Bernardi
, Toon van Waterschoot, Jan Wouters
, Marc Moonen
:
Subjective and Objective Sound-Quality Evaluation of Adaptive Feedback Cancellation Algorithms. 1010-1024
Volume 26, Number 6, June 2018
- Hirokazu Kameoka
, Takuya Higuchi
, Mikihiro Tanaka
, Li Li:
Nonnegative Matrix Factorization With Basis Clustering Using Cepstral Distance Regularization. 1025-1036 - Jacob Donley
, Christian H. Ritz
, W. Bastiaan Kleijn
:
Multizone Soundfield Reproduction With Privacy- and Quality-Based Speech Masking Filters. 1037-1051 - Sebastian Braun
, Adam Kuklasinski
, Ofer Schwartz, Oliver Thiergart, Emanuël A. P. Habets, Sharon Gannot
, Simon Doclo
, Jesper Jensen:
Evaluation and Comparison of Late Reverberation Power Spectral Density Estimators. 1052-1067 - Elie-Laurent Benaroya
, Nicolas Obin
, Marco Liuni
, Axel Roebel
, Wilson Raumel, Sylvain Argentieri
:
Binaural Localization of Multiple Sound Sources by Non-Negative Tensor Factorization. 1068-1078 - Nathanaël Perraudin
, Nicki Holighaus
, Piotr Majdak
, Péter Balázs:
Inpainting of Long Audio Segments With Similarity Graphs. 1079-1090 - Paul Magron
, Roland Badeau
, Bertrand David:
Model-Based STFT Phase Recovery for Audio Source Separation. 1091-1101 - Ina Kodrasi
, Simon Doclo
:
Analysis of Eigenvalue Decomposition-Based Late Reverberation Power Spectral Density Estimation. 1102-1114 - Sebastian Braun
, Emanuël A. P. Habets
:
Linear Prediction-Based Online Dereverberation and Noise Reduction Using Alternating Kalman Filters. 1115-1125 - Dhananjay Ram
, Afsaneh Asaei
, Hervé Bourlard:
Sparse Subspace Modeling for Query by Example Spoken Term Detection. 1126-1139 - Martin Krawczyk-Becker
, Timo Gerkmann
:
On Speech Enhancement Under PSD Uncertainty. 1140-1149 - Simon Leglaive
, Roland Badeau
, Gaël Richard:
Student's t Source and Mixing Models for Multichannel Audio Source Separation. 1150-1164
Volume 26, Number 7, July 2018
- Takenori Yoshimura
, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Mel-Cepstrum-Based Quantization Noise Shaping Applied to Neural-Network-Based Speech Waveform Synthesis. 1173-1180 - Qing Wang
, Jun Du
, Li-Rong Dai, Chin-Hui Lee:
A Multiobjective Learning and Ensembling Approach to High-Performance Speech Enhancement With Compact Neural Network Architectures. 1181-1193 - Miguel Ángel del Agua, Adrià Giménez
, Alberto Sanchís
, Jorge Civera
, Alfons Juan
:
Speaker-Adapted Confidence Measures for ASR Using Deep Bidirectional Recurrent Neural Networks. 1194-1202 - Jorge Proença
, Carla Lopes
, Michael Tjalve, Andreas Stolcke, Sara Candeias, Fernando Perdigão
:
Mispronunciation Detection in Children's Reading of Sentences. 1203-1215 - Ljubisa Stankovic
, Milos Brajovic
:
Analysis of the Reconstruction of Sparse Signals in the DCT Domain Applied to Audio Signals. 1216-1231 - João Felipe Santos
, Tiago H. Falk
:
Speech Dereverberation With Context-Aware Recurrent Neural Networks. 1232-1242 - Michele Geronazzo
, Simone Spagnol
, Federico Avanzini
:
Do We Need Individual Head-Related Transfer Functions for Vertical Localization? The Case Study of a Spectral Notch Distance Metric. 1243-1256 - Daniel Marquardt
, Simon Doclo
:
Interaural Coherence Preservation for Binaural Noise Reduction Using Partial Noise Estimation and Spectral Postfiltering. 1257-1270 - Mojtaba Farmani
, Michael Syskind Pedersen, Zheng-Hua Tan
, Jesper Jensen
:
Bias-Compensated Informed Sound Source Localization Using Relative Transfer Functions. 1271-1285 - Fei Tao
, Carlos Busso
:
Gating Neural Network for Large Vocabulary Audiovisual Speech Recognition. 1286-1298
Volume 26, Number 8, August 2018
- Zafar Rafii
, Antoine Liutkus, Fabian-Robert Stöter, Stylianos Ioannis Mimilakis, Derry FitzGerald, Bryan Pardo:
An Overview of Lead and Accompaniment Separation in Music. 1307-1335 - Chien-Yao Wang, Jia-Ching Wang, Andri Santoso
, Chin-Chin Chiang, Chung-Hsien Wu
:
Sound Event Recognition Using Auditory-Receptive-Field Binary Pattern and Hierarchical-Diving Deep Belief Network. 1336-1351 - Liner Yang, Meishan Zhang
, Yang Liu, Maosong Sun, Nan Yu, Guohong Fu:
Joint POS Tagging and Dependence Parsing With Transition-Based Neural Networks. 1352-1358 - Kai Yu
, Zijian Zhao, Xueyang Wu, Hongtao Lin, Xuan Liu:
Rich Short Text Conversation Using Semantic-Key-Controlled Sequence Generation. 1359-1368 - Bernhard Lehner
, Jan Schlüter, Gerhard Widmer
:
Online, Loudness-Invariant Vocal Detection in Mixed Music Signals. 1369-1380 - Simon Stone
, Michael Marxen
, Peter Birkholz
:
Construction and Evaluation of a Parametric One-Dimensional Vocal Tract Model. 1381-1392 - Tian Tan
, Yanmin Qian
, Hu Hu
, Ying Zhou
, Wen Ding
, Kai Yu
:
Adaptive Very Deep Convolutional Residual Network for Noise Robust Speech Recognition. 1393-1405 - Xin Wang
, Shinji Takaki, Junichi Yamagishi
:
Autoregressive Neural F0 Model for Statistical Parametric Speech Synthesis. 1406-1419 - Cassia Valentini-Botinhao
, Junichi Yamagishi
:
Speech Enhancement of Noisy and Reverberant Speech for Text-to-Speech. 1420-1433 - Andreas I. Koutrouvelis
, Thomas W. Sherson
, Richard Heusdens
, Richard C. Hendriks
:
A Low-Cost Robust Distributed Linearly Constrained Beamformer for Wireless Acoustic Sensor Networks With Arbitrary Topology. 1434-1448
Volume 26, Number 9, September 2018
- Chih-Wei Wu
, Christian Dittmar
, Carl Southall
, Richard Vogl, Gerhard Widmer
, Jason Hockman, Meinard Müller
, Alexander Lerch
:
A Review of Automatic Drum Transcription. 1457-1483 - Christine Evers
, Patrick A. Naylor
:
Acoustic SLAM. 1484-1498 - Clement Laroche
, Matthieu Kowalski
, Hélène Papadopoulos, Gaël Richard:
Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation. 1499-1511 - Julio J. Carabias-Orti
, Joonas Nikunen
, Tuomas Virtanen
, Pedro Vera-Candeas
:
Multichannel Blind Sound Source Separation Using Spatial Covariance Model With Level and Time Differences and Nonnegative Matrix Factorization. 1512-1527 - Meishan Zhang
, Nan Yu, Guohong Fu:
A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging. 1528-1538 - Dylan Menzies
, Filippo Maria Fazi
:
A Complex Panning Method for Near-Field Imaging. 1539-1548 - Abhinav Misra, John H. L. Hansen
:
Maximum-Likelihood Linear Transformation for Unsupervised Domain Adaptation in Speaker Verification. 1549-1558 - Yukoh Wakabayashi
, Takahiro Fukumori
, Masato Nakayama, Takanobu Nishiura
, Yoichi Yamashita
:
Single-Channel Speech Enhancement With Phase Reconstruction Based on Phase Distortion Averaging. 1559-1569 - Szu-Wei Fu
, Taowei Wang, Yu Tsao
, Xugang Lu, Hisashi Kawai:
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks. 1570-1584 - Ke Xiao
, Supin Wang, Mingxi Wan
, Liang Wu
:
Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation. 1585-1593 - Abdullah Fahim
, Prasanga N. Samarasinghe
, Thushara D. Abhayapala
:
PSD Estimation and Source Separation in a Noisy Reverberant Environment Using a Spherical Microphone Array. 1594-1607 - Hongsen He
, Jingdong Chen
, Jacob Benesty
, Tao Yang:
Noise Robust Frequency-Domain Adaptive Blind Multichannel Identification With ℓp-Norm Constraint. 1608-1619 - Weiwei Zhang
, Zhe Chen
, Fuliang Yin
, Qiaoling Zhang:
Melody Extraction From Polyphonic Music Using Particle Filter and Dynamic Programming. 1620-1632 - Chunlei Zhang
, Kazuhito Koishida, John H. L. Hansen
:
Text-Independent Speaker Verification Based on Triplet Convolutional Neural Network Embeddings. 1633-1644 - M. V. Achuth Rao
, Prasanta Kumar Ghosh
:
PSFM - A Probabilistic Source Filter Model for Noise Robust Glottal Closure Instant Detection. 1645-1657 - Manu Airaksinen
, Lauri Juvela
, Bajibabu Bollepalli, Junichi Yamagishi
, Paavo Alku
:
A Comparison Between STRAIGHT, Glottal, and Sinusoidal Vocoding in Statistical Parametric Speech Synthesis. 1658-1670 - Gaël Mahé
, Meriem Jaïdane
:
Perceptually Controlled Reshaping of Sound Histograms. 1671-1683 - Qinghua Huang, Lin Zhang
, Yong Fang:
Two-Step Spherical Harmonics ESPRIT-Type Algorithms and Performance Analysis. 1684-1697
Volume 26, Number 10, October 2018
- DeLiang Wang
, Jitong Chen
:
Supervised Speech Separation Based on Deep Learning: An Overview. 1702-1726 - Rui Wang
, Masao Utiyama, Andrew M. Finch, Lemao Liu
, Kehai Chen
, Eiichiro Sumita:
Sentence Selection and Weighting for Neural Machine Translation Domain Adaptation. 1727-1741 - Faheem Khan, Ben P. Milner, Thomas Le Cornu
:
Using Visual Speech Information in Masking Methods for Audio Speaker Separation. 1742-1754 - Xiaofei Li
, Sharon Gannot
, Laurent Girin
, Radu Horaud
:
Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction Based on Convolutive Transfer Function. 1755-1768 - Lutfi Kerem Senel
, Ihsan Utlu, Veysel Yücesoy, Aykut Koç
, Tolga Çukur
:
Semantic Structure and Interpretability of Word Embeddings. 1769-1779 - Yuma Koizumi
, Kenta Niwa
, Yusuke Hioka
, Kazunori Kobayashi, Yoichi Haneda
:
DNN-Based Source Enhancement to Increase Objective Sound Quality Assessment Score. 1780-1792 - Constantin Paleologu
, Jacob Benesty
, Silviu Ciochina:
Linear System Identification Based on a Kronecker Product Decomposition. 1793-1808 - Feifei Xiong
, Stefan Goetze
, Birger Kollmeier, Bernd T. Meyer
:
Exploring Auditory-Inspired Acoustic Features for Room Acoustic Parameter Estimation From Monaural Speech. 1809-1820 - Gaël Le Lan
, Delphine Charlet, Anthony Larcher, Sylvain Meignier:
An Adaptive Method for Cross-Recording Speaker Diarization. 1821-1832 - Wei Xue
, Alastair H. Moore
, Mike Brookes
, Patrick A. Naylor
:
Modulation-Domain Multichannel Kalman Filtering for Speech Enhancement. 1833-1847 - Kai Wu
, Vaninirappuputhenpurayil Gopalan Reju
, Andy W. H. Khong
:
Multisource DOA Estimation in a Reverberant Environment Using a Single Acoustic Vector Sensor. 1848-1859 - Jizhou Huang
, Yaming Sun
, Wei Zhang
, Haifeng Wang
, Ting Liu:
Entity Highlight Generation as Statistical and Neural Machine Translation. 1860-1872 - Quoc Truong Do
, Sakriani Sakti, Satoshi Nakamura
:
Sequence-to-Sequence Models for Emphasis Speech Translation. 1873-1883 - Federico Fontana
, Enrico Bozzo
:
Explicit Fixed-Point Computation of Nonlinear Delay-Free Loop Filter Networks. 1884-1896 - Simon Widmark
:
Causal IIR Audio Precompensator Filters Subject to Quadratic Constraints. 1897-1912 - Fiete Winter
, Hagen Wierstorf, Christoph Hold, Frank Krüger
, Alexander Raake
, Sascha Spors:
Colouration in Local Wave Field Synthesis. 1913-1924 - Asger Heidemann Andersen
, Jan Mark de Haan, Zheng-Hua Tan
, Jesper Jensen
:
Nonintrusive Speech Intelligibility Prediction Using Convolutional Neural Networks. 1925-1939
Volume 26, Number 11, November 2018
- Hossein Hadian
, Hossein Sameti
, Daniel Povey, Sanjeev Khudanpur:
Flat-Start Single-Stage Discriminatively Trained HMM-Based Models for ASR. 1949-1961 - Fabrice Katzberg
, Radoslaw Mazur, Marco Maaß
, Philipp Koch, Alfred Mertins
:
A Compressed Sensing Framework for Dynamic Sound-Field Measurements. 1962-1975 - Sundar Harshavardhan
, Thippur V. Sreenivas, Chandra Sekhar Seelamantula
:
TDOA-Based Multiple Acoustic Source Localization Without Association Ambiguity. 1976-1990 - Reza Sahraeian
, Dirk Van Compernolle
:
Cross-Entropy Training of DNN Ensemble Acoustic Models for Low-Resource ASR. 1991-2001 - Heinrich Dinkel
, Yanmin Qian
, Kai Yu
:
Investigating Raw Wave Deep Neural Networks for End-to-End Speaker Spoofing Detection. 2002-2014 - Jie Zhang
, Richard Heusdens
, Richard Christian Hendriks
:
Rate-Distributed Spatial Filtering Based Noise Reduction in Wireless Acoustic Sensor Networks. 2015-2026 - Michael Heck
, Sakriani Sakti, Satoshi Nakamura
:
Dirichlet Process Mixture of Mixtures Model for Unsupervised Subword Modeling. 2027-2042 - Shuai Nie
, Shan Liang
, Wenju Liu
, Xueliang Zhang
, Jianhua Tao:
Deep Learning Based Speech Separation via NMF-Style Reconstructions. 2043-2055 - Harishchandra Dubey
, Abhijeet Sangwan, John H. L. Hansen
:
Leveraging Frequency-Dependent Kernel and DIP-Based Clustering for Robust Speech Activity Detection in Naturalistic Audio Streams. 2056-2071 - Youngsoo Jang
, Jiyeon Ham, Byung-Jun Lee
, Kee-Eung Kim:
Cross-Language Neural Dialog State Tracker for Large Ontologies Using Hierarchical Attention. 2072-2082 - Gellért Weisz, Pawel Budzianowski
, Pei-Hao Su
, Milica Gasic
:
Sample Efficient Deep Reinforcement Learning for Dialogue Systems With Large Action Spaces. 2083-2097 - Shoufeng Lin
:
Reverberation-Robust Localization of Speakers Using Distinct Speech Onsets and Multichannel Cross Correlations. 2098-2111 - Shamsiah Abidin
, Roberto Togneri
, Ferdous Ahmed Sohel
:
Spectrotemporal Analysis Using Local Binary Pattern Variants for Acoustic Scene Classification. 2112-2121 - Ning Ma
, José A. González
, Guy J. Brown
:
Robust Binaural Localization of a Target Sound Source by Combining Spectral Source Models and Deep Neural Networks. 2122-2131 - Shuangzhi Wu
, Dongdong Zhang
, Zhirui Zhang, Nan Yang, Mu Li, Ming Zhou:
Dependency-to-Dependency Neural Machine Translation. 2132-2141 - Jingjing Xu
, Hangfeng He
, Xu Sun
, Xuancheng Ren
, Sujian Li:
Cross-Domain and Semisupervised Named Entity Recognition in Chinese Social Media: A Unified Model. 2142-2152 - Steven Van Kuyk
, W. Bastiaan Kleijn
, Richard Christian Hendriks
:
An Evaluation of Intrusive Instrumental Intelligibility Metrics. 2153-2166 - Xi Ouyang
, Kang Gu, Pan Zhou:
Spatial Pyramid Pooling Mechanism in 3D Convolutional Network for Sentence-Level Classification. 2167-2179 - Brian McFee
, Justin Salamon
, Juan Pablo Bello
:
Adaptive Pooling Operators for Weakly Labeled Sound Event Detection. 2180-2193 - Isabel Barbancho
, George Tzanetakis
, Ana M. Barbancho
, Lorenzo J. Tardón
:
Discrimination Between Ascending/Descending Pitch Arpeggios. 2194-2203 - Younggwan Kim
, Myung Jong Kim
, Jahyun Goo, Hoirin Kim
:
Learning Self-Informed Feature Contribution for Deep Learning-Based Acoustic Modeling. 2204-2214 - Mert Burkay Çöteli, Orhun Olgun, Hüseyin Hacihabiboglu
:
Multiple Sound Source Localization With Steered Response Power Density and Hierarchical Grid Refinement. 2215-2229 - Junwei Bao
, Yeyun Gong, Nan Duan
, Ming Zhou, Tiejun Zhao:
Question Generation With Doubly Adversarial Nets. 2230-2239 - Bing Bu, Changchun Bao
, Mao-shen Jia
:
Design of a Planar First-Order Loudspeaker Array for Global Active Noise Control. 2240-2250
Volume 26, Number 12, December 2018
- Xing Wang
, Zhaopeng Tu, Min Zhang
:
Incorporating Statistical Machine Translation Word Knowledge Into Neural Machine Translation. 2255-2266 - Yunxin Zhao
, Mili Kuruvilla-Dugdale
, Minguang Song:
Structured Sparse Spectral Transforms and Structural Measures for Voice Conversion. 2267-2276 - Haniyeh Salehi
, David Suelzle, Paula Folkeard, Vijay Parsa:
Learning-Based Reference-Free Speech Quality Measures for Hearing Aid Applications. 2277-2288 - Gerald Enzner
, Philipp Thüne
:
Bayesian MMSE Filtering of Noisy Speech by SNR Marginalization With Global PSD Priors. 2289-2304 - Gongping Huang
, Jingdong Chen
, Jacob Benesty
:
Insights Into Frequency-Invariant Beamforming With Concentric Circular Microphone Arrays. 2305-2318 - Shiqi Shen, Yun Chen, Cheng Yang
, Zhiyuan Liu
, Maosong Sun:
Zero-Shot Cross-Lingual Neural Headline Generation. 2319-2327 - Sudeep Surendran
, T. Kishore Kumar:
Oblique Projection and Cepstral Subtraction in Signal Subspace Speech Enhancement for Colored Noise Reduction. 2328-2340 - Qiang Li
, Derek F. Wong
, Lidia S. Chao, Muhua Zhu
, Tong Xiao
, Jingbo Zhu, Min Zhang:
Linguistic Knowledge-Aware Neural Machine Translation. 2341-2354 - Wen Zhang
, Christian Hofmann
, Michael Buerger, Thushara Dheemantha Abhayapala
, Walter Kellermann:
Spatial Noise-Field Control With Online Secondary Path Modeling: A Wave-Domain Approach. 2355-2370 - Adrien Meynard
, Bruno Torrésani
:
Spectral Analysis for Nonstationary Audio. 2371-2380 - Irene Martín-Morató
, Maximo Cobos
, Francesc J. Ferri
:
Adaptive Mid-Term Representations for Robust Audio Event Classification. 2381-2392 - Gergely Firtha
, Péter Fiala, Frank Schultz
, Sascha Spors
:
On the General Relation of Wave Field Synthesis and Spectral Division Method for Linear Arrays. 2393-2403 - Peter Birkholz
, Simon Stone
, Klaus Wolf, Dirk Plettemeier:
Non-Invasive Silent Phoneme Recognition Using Microwave Signals. 2404-2411 - Wei-Wei Lin
, Man-Wai Mak
, Jen-Tzung Chien
:
Multisource I-Vectors Domain Adaptation Using Maximum Mean Discrepancy Based Autoencoders. 2412-2422 - Mohammed Abdel-Wahab
, Carlos Busso
:
Domain Adversarial for Acoustic Emotion Recognition. 2423-2435 - Dalia El Badawy
, Ivan Dokmanic
:
Direction of Arrival With One Microphone, a Few LEGOs, and Non-Negative Matrix Factorization. 2436-2446 - Hung-yi Lee
, Pei-Hung Chung
, Yen-Chen Wu, Tzu-Hsiang Lin, Tsung-Hsien Wen:
Interactive Spoken Content Retrieval by Deep Reinforcement Learning. 2447-2459 - Samy Elshamy
, Nilesh Madhu
, Wouter Tirry, Tim Fingscheidt
:
DNN-Supported Speech Enhancement With Cepstral Estimation of Both Excitation and Envelope. 2460-2474 - Yu Bao
, Huawei Chen
:
A Chance-Constrained Programming Approach to the Design of Robust Broadband Beamformers With Microphone Mismatches. 2475-2488 - Haizhou Li
:
Farewell Editorial. 2489

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.