


default search action
International Journal of Speech Technology, Volume 27
Volume 27, Number 1, March 2024
- Tebbi Hanane
, Maamar Hamadouche:
Multi-agent based Arabic speech synthesis. 1-17 - Shrikala Deshmukh
, Preeti Gupta
:
Application of probabilistic neural network for speech emotion recognition. 19-28 - Aparna Vyakaranam
, Tomas Maul, Bavani Ramayah
:
A review on speech emotion recognition for late deafened educators in online education. 29-52 - Latifa Iben Nasr
, Abir Masmoudi
, Lamia Hadrich Belguith:
Survey on Arabic speech emotion recognition. 53-68 - Kimiya Nourali, Elham Dolkhani
:
Scene text visual question answering by using YOLO and STN. 69-76 - B. G. Nagaraja, Thimmaraja Yadava G.
, Mohamed Anees
:
Advancements in encoded speech data by background noise suppression under uncontrolled environment. 77-84 - O. Homa Kesav, G. K. Rajini:
Correction to: Automated detection system for texture feature based classification on different image datasets using S-transform. 85 - Mohit Dua
, Bhavesh Bhagat, Shelza Dua:
An amalgamation of integrated features with DeepSpeech2 architecture and improved spell corrector for improving Gujarati language ASR system. 87-99 - Purva Barche, Krishna Gurugubelli, Anil Kumar Vuppala:
Stockwell-Transform based feature representation for detection and assessment of voice disorders. 101-119 - Merouane Bouzid
, Nacèra Meziane, Salah Eddine Cheraitia:
Multi-coder vector quantizer for transparent coding of wideband speech ISF parameters. 121-132 - Mohit Dua
, Bhavesh Bhagat, Shelza Dua, Nidhi Chakravarty
:
A review on Gujarati language based automatic speech recognition (ASR) systems. 133-156 - Assal A. M. Alqudah, Mohammad A. M. Alshraideh, Mohammad A. M. Abushariah, Ahmad A. S. Sharieh:
Modern Standard Arabic speech disorders corpus for digital speech processing applications. 157-170 - Albert Cryssiover
, Amalia Zahra:
Speech recognition model design for Sundanese language using WAV2VEC 2.0. 171-177 - Marek B. Trawicki
:
Automatic gender recognition and speaker identification of Rhesus Macaques (Macaca mulatta) using hidden Markov models (HMMs). 179-186 - P. Ashwini
, S. H. Bharathi
:
Continuous feature learning representation to XGBoost classifier on the aggregation of discriminative Features using DenseNet-121 architecture and ResNet 18 architectures towards Apraxia Recognition in the Child Speech Therapy. 187-199 - Chengyong Yang, Xiukang Yu, Sheng Huang:
Conditional Denoising Diffusion Implicit Model for Speech Enhancement. 201-209 - Omayma Mahmoudi
, Mouncef Filali Bouami
, Mohamed Benchat:
Speech recognition based on the transformer's multi-head attention in Arabic. 211-223 - Nidhi Chakravarty
, Mohit Dua
:
Feature extraction using GTCC spectrogram and ResNet50 based classification for audio spoof detection. 225-237 - N. Aishwarya
, Kanwaljeet Kaur
, Karthik Seemakurthy:
A computationally efficient speech emotion recognition system employing machine learning classifiers and ensemble learning. 239-254 - Antor Mahamudul Hashan
, Chaganov Roman Dmitrievich, Melnikov Alexander Valerievich
, Dorokh Danila Vasilyevich, Khlebnikov Nikolai Alexandrovich
, Boris Andreevich Bredikhin
:
Hyperkinetic Dysarthria voice abnormalities: a neural network solution for text translation. 255-265 - Haidy H. Mustafa
, Nagy Ramadan Darwish, Hesham A. Hefny:
Automatic Speech Emotion Recognition: a Systematic Literature Review. 267-285 - Hossam Boulal, Mohamed Hamidi
, Mustapha Abarkan, Jamal Barkani:
Amazigh CNN speech recognition system based on Mel spectrogram feature extraction method. 287-296 - Retraction Note: Computer vision for facial analysis using human-computer interaction models. 297
Volume 27, Number 2, June 2024
- Shilin Wang, Haixin Guan
, Shuang Wei, Yanhua Long
:
Improving low-complexity and real-time DeepFilterNet2 for personalized speech enhancement. 299-306 - Retraction Note: Speaker identification using hybrid neural network support vector machine classifier. 307
- B. G. Nagaraja, Thimmaraja Yadava G.
, Prashanth Kabballi, Chandrashekar M. Patil:
VAD system under uncontrolled environment: A solution for strengthening the noise robustness using MMSE-SPZC. 309-317 - Yongyan Yang:
Feature fusion: research on emotion recognition in English speech. 319-327 - Abdelkbir Ouisaadane
, Said Safi, Miloud Frikel:
An experiment of Moroccan dialect speech recognition in noisy environments using PocketSphinx. 329-339 - Mohammed Hamzah Abed
, Dávid Sztahó:
Effect of identical twins on deep speaker embeddings based forensic voice comparison. 341-351 - Arijul Haque, Krothapalli Sreenivasa Rao:
Speech emotion recognition with transfer learning and multi-condition training for noisy environments. 353-365 - Youyuan Zhang, Sashank Gondala, Thiago Fraga-Silva, Christophe Van Gysel
:
Server-side rescoring of spoken entity-centric knowledge queries for virtual assistants. 367-375 - Qian Shen, Mengxi Guo, YiDa Huang, Jianfen Ma:
Attentional multi-feature fusion for spoofing-aware speaker verification. 377-387 - Maysa Khalil
, Mohammad Azzeh
:
Fake news detection models using the largest social media ground-truth dataset (TruthSeeker). 389-404 - Mohamed Abdelkarim Remmide
, Fatima Boumahdi, Imane Rebeh Ammar Aouchiche, Amina Guendouz, Narhimene Boustia:
A robust approach to authorship verification using siamese deep learning: application in phishing email detection. 405-412 - Meriem Lounis, Bilal Dendani
, Halima Bahi
:
Anomaly detection with a variational autoencoder for Arabic mispronunciation detection. 413-424 - Shaik Mulla Shabber
, Mohan Bansal
:
Temporal feature-based approaches for enhancing phoneme boundary detection and masking in speech. 425-436 - Abderrahmane Louni
, Leila Rizoug, Abderrahim Belmadani:
A quantal model for Algerian vowel features identification using formants and subglottal resonances. 437-445 - Joan L. Imbwaga, Nagaratna B. Chittaragi, Shashidhar G. Koolagudi:
Automatic hate speech detection in audio using machine learning algorithms. 447-469 - Abdul Malik Abbasi
, Bisma Butt, Illahi Bux Gopang, Ahlam Khan, Kiran Naz, Dure Shehwar:
Analyzing acoustic patterns of vowel sounds produced by native Rangri speakers. 471-481 - Soumeya Belabbas
, Djamel Addou, Sid-Ahmed Selouani:
Pathological voice classification system based on CNN-BiLSTM network using speech enhancement and multi-stream approach. 483-502 - (Withdrawn) RETRACTED ARTICLE: Research on pronunciation accuracy detection of English Chinese consecutive interpretation in English intelligent speech translation terminal. 503
- (Withdrawn) RETRACTED ARTICLE: Construction of voice access clustering model for online shopping user groups based on electronic communication data mining algorithm. 505
- (Withdrawn) RETRACTED ARTICLE: Research on life prediction method of rolling bearing based on deep learning and voice interaction technology. 507
- (Withdrawn) RETRACTED ARTICLE: Speech fault recognition method of music intelligent player based on communication feature analysis. 509
- (Withdrawn) RETRACTED ARTICLE: Ice detection and voice alarm of wind turbine blades based on belief network. 511
- (Withdrawn) RETRACTED ARTICLE: Accurate recognition of heterogeneous features in super resolution image visualization based on voice remote control system. 513
- (Withdrawn) RETRACTED ARTICLE: Dialect recognition from Telugu speech utterances using spectral and prosodic features. 515
- (Withdrawn) RETRACTED ARTICLE: Deep learning based cardiovascular disease diagnosis system from heartbeat sound. 517
- (Withdrawn) RETRACTED ARTICLE: Wearable sensor based acoustic gait analysis using phase transition-based optimization algorithm on IoT. 519
Volume 27, Number 3, September 2024
- Mohammad Azzeh, Bushra Alhijawi
, Abedrahman Tabbaza, Omar Alabboshi, Nancy Hamdan, Dareen Jaser:
Arabic cyberbullying detection system using convolutional neural network and multi-head attention. 521-537 - B. G. Nagaraja, Thimmaraja Yadava G.
, K. Harshitha:
Noise robust speech encoding system in challenging acoustic conditions. 539-549 - R. Ramesh, V. B. Prahaladhan, P. Nithish, K. Mohanaprasad
:
Speech emotion recognition using the novel SwinEmoNet (Shifted Window Transformer Emotion Network). 551-568 - Kotha Manohar
, E. Logashanmugam:
ADMRF: Elucidation of deep feature extraction and adaptive deep Markov random fields with improved heuristic algorithm for speech emotion recognition. 569-597 - Veena Karjigi
, S. Roopa, H. M. Chandrashekar:
Investigation of different time-frequency representations for detection of fricatives. 599-611 - Pedro Nascimento
, João C. Ferreira
, Fernando Batista
:
Automatic transcription system for parliamentary debates in the context of assembly of the republic of Portugal. 613-635 - M. Balasubrahmanyam, R. S. Valarmathi:
An intelligent speech enhancement model using enhanced heuristic-based residual convolutional neural network with encoder-decoder architecture. 637-656 - V. Shibina
, T. M. Thasleema
:
A hybrid approach to detecting Parkinson's disease using spectrogram and deep learning CNN-LSTM network. 657-671 - Mohammed M. Nasef
, Amr A. Elshall, Amr M. Sauber:
ArabRecognizer: modern standard Arabic speech recognition inspired by DeepSpeech2 utilizing Franco-Arabic. 673-686 - Abdulqahar Mukhtar Abubakar
, Deepa Gupta, Susmitha Vekkot
:
Development of a diacritic-aware large vocabulary automatic speech recognition for Hausa language. 687-700 - Kurma Venkata Keerthana Sai, Rompicharla Thanmayee Krishna, Kodali Radha
, Dhulipalla Venkata Rao, Abdul Muneera:
Automated ASD detection in children from raw speech using customized STFT-CNN model. 701-716 - Malay Kumar Majhi, Sujan Kumar Saha
:
An automatic speech recognition system in Odia language using attention mechanism and data augmentation. 717-728 - Abhijit Barman
, Diganta Saha, Alok Ranjan Pal:
Bengali reduplication generation with finite-state transducers (FSTs). 729-737 - Sahar Farazi, Yasser Shekofteh
:
Voice pathology detection on spontaneous speech data using deep learning models. 739-751 - K. Venkatesh Sharma
, Pramod Reddy Ayiluri
, Rakesh Betala, P. Jagdish Kumar, K. Shirisha Reddy
:
Enhancing query relevance: leveraging SBERT and cosine similarity for optimal information retrieval. 753-763 - Luis Lugo, Valentin Vielzeuf:
Efficiency-oriented approaches for self-supervised speech representation learning. 765-779 - Suresh Veesa
, Madhusudan Singh:
Implicit processing of linear prediction residual for replay attack detection. 781-791 - Joan L. Imbwaga, Nagaratna B. Chittaragi, Shashidhar G. Koolagudi:
Explainable hate speech detection using LIME. 793-815 - D. Thiripurasundari, Kishor Bhangale, V. Aashritha, Sisira Mondreti, Mohanaprasad Kothandaraman
:
Speech emotion recognition for human-computer interaction. 817-830 - Purva Barche, Krishna Gurugubelli, Anil Kumar Vuppala:
Epoch extraction in real-world scenario. 831-845
Volume 27, Number 4, December 2024
- Marie Kunesová
, Zbynek Zajíc
, Lubos Smídl
, Martin Karafiát
:
Comparison of wav2vec 2.0 models on three speech processing tasks. 847-859 - Vitalii Fishchuk, Daniel Braun
:
Robustness of generative AI detection: adversarial attacks on black-box neural text detectors. 861-874 - Pantid Chantangphol
, Theerat Sakdejayont, Tawunrat Chalothorn:
Enhancing spoken term detection with deep acoustic word embeddings and cross-modal matching techniques. 875-886 - Irene Morazzoni, Vincenzo Scotti
, Roberto Tedesco
:
Def2Vec: you shall know a word by its definition. 887-899 - Nils Constantin Hellwig
, Jakob Fehle, Markus Bink, Thomas Schmidt, Christian Wolff:
Exploring Twitter discourse with BERTopic: topic modeling of tweets related to the major German parties during the 2021 German federal election. 901-921 - Nicolas Ballier
, Taylor B. Arnold
, Adrien Méli, Tori Thurston, Jean-Baptiste Yunès
:
Whisper for L2 speech scoring. 923-934 - Kristina Schaaff
, Tim Schlippe, Lorenz Mindner:
Classification of human- and AI-generated texts for different languages and domains. 935-956 - Lucia Ormaechea Grijalba, Nikos Tsourakis
:
Automatic text simplification for French: model fine-tuning for simplicity assessment and simpler text generation. 957-976 - S. Roopa, Veena Karjigi, H. M. Chandrashekar:
Analyzing fricative confusions in healthy and pathological speech using modified S-transform. 977-985 - M. R. Prasad
, Sharana Basavana Gowda
, Manjunath B. Talawar
, N. Jagadisha
:
Integrated noise suppression techniques for enhancing voice activity detection in degraded environments. 987-995 - Ouardia Abdelli
, Fatiha Merazka:
Deep learning for speech denoising with improved Wiener approach. 997-1012 - Manish Tiwari, Deepak Kumar Verma:
Enhanced text-independent speaker recognition using MFCC, Bi-LSTM, and CNN-based noise removal techniques. 1013-1026 - Kabir Garba, Taiwo Kolajo
, Joshua B. Agbogun:
A transformer-based approach to Nigerian Pidgin text generation. 1027-1037 - Ohidujjaman
, Mahmudul Hasan, Shiming Zhang, Mohammad Nurul Huda, Mohammad Shorif Uddin:
Spectral analysis of bone-conducted speech using modified linear prediction. 1039-1053 - Fahmida Khanom
, Shuvo Biswas
, Mohammad Shorif Uddin, Rafid Mostafiz
:
XEMLPD: an explainable ensemble machine learning approach for Parkinson disease diagnosis with optimized features. 1055-1083 - Ohidujjaman
, Mahmudul Hasan, Shiming Zhang, Mohammad Nurul Huda, Mohammad Shorif Uddin:
Ill-condition enhancement for BC speech using RMC method. 1085-1092 - Dominik Wagner, Ilja Baumann, Tobias Bocklet:
Generative adversarial networks for whispered to voiced speech conversion: a comparative study. 1093-1110 - Darong Chen, Guangguang Yang, Guangyong Wei, Fahad Anwaar, Jiaxin Yang, Wenxiao Dong, Jiafeng Zhang:
Sub-layer feature fusion applied to transformer model for automatic speech recognition. 1111-1120 - Meryam Telmem, Naouar Laaidi, Youssef Ghanou, Sanae Hamiane, Hassan Satori:
Comparative study of CNN, LSTM and hybrid CNN-LSTM model in amazigh speech recognition using spectrogram feature extraction and different gender and age dataset. 1121-1133 - Oindrila Banerjee
, D. Govind
:
Investigating the role of electroencephalogram based rhythmic bands: applications in imagined speech classification and epileptic seizure severity detection. 1135-1147 - Soumia Chafi, Mustapha Kabil
, Abdessamad Kamouss:
Distributed CV classification with attention mechanisms. 1149-1157

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.