default search action
International Journal of Speech Technology, Volume 25
Volume 25, Number 1, March 2022
- Wided Bakari, Mahmoud Neji:
A novel semantic and logical-based approach integrating RTE technique in the Arabic question-answering. 1-17 - Mohamed Morchid:
Bidirectional internal memory gate recurrent neural networks for spoken language understanding. 19-27 - Rim Laatar, Chafik Aloulou, Lamia Hadrich Belguith:
Towards a historical dictionary for Arabic language. 29-41 - Eiman Alsharhan, Allan Ramsay, Hanady Ahmed:
Evaluating the effect of using different transcription schemes in building a speech recognition system for Arabic. 43-56 - Ricky Mohanty, Bandi Kumar Mallik, Sandeep Singh Solanki:
Normalized approximate descent used for spike based automatic bird species recognition system. 57-65 - Ankit Kumar, Rajesh Kumar Aggarwal:
Hindi speech recognition using time delay neural network acoustic modeling with i-vector adaptation. 67-78 - Fady K. Fahmy, Hazem M. Abbas, Mahmoud I. Khalil:
Boosting subjective quality of Arabic text-to-speech (TTS) using end-to-end deep architecture. 79-88 - Abir Masmoudi, Chafik Aloulou, Abdel Ghader Sidi Abdellahi, Lamia Hadrich Belguith:
Automatic diacritization of Tunisian dialect text using SMT model. 89-104 - Aakshi Mittal, Mohit Dua:
Automatic speaker verification systems and spoof detection techniques: review and analysis. 105-134 - Song-Il Mun, Chol-Jin Han, Hye-Song Hong:
Exploiting variable length segments with coarticulation effect in online speech recognition based on deep bidirectional recurrent neural network and context-sensitive segment. 135-146 - Sreehari Vanajavilas Ravindran, Leena Mary:
Automatic short utterance speaker recognition using stationary wavelet coefficients of pitch synchronised LP residual. 147-161 - Achraf Benba, Imane Laaqira, Abdelilah Jilbab, Ahmed Hammouch:
Using novel method: Real Cepstral Discrete Cosine Transform, for detecting Parkinson from multiple system atrophy, other neurological diseases and healthy cases using voice analysis. 163-172 - Bidhan Barai, Tapas Chakraborty, Nibaran Das, Subhadip Basu, Mita Nasipuri:
Closed-set speaker identification using VQ and GMM based models. 173-196 - Pavan Raju Kammili, B. H. V. S. Ramakrishnam Raju, A. Sri Krishna:
Handling emotional speech: a prosody based data augmentation technique for improving neutral speech trained ASR systems. 197-204 - Hyok-Chol Ri, Chol Kim, Mok-Ran Jo:
A method for constructing Korean spontaneous spoken language corpus based on an imitation of abbreviated and transformed particles. 205-210 - Lee-Chung Kwek, Alan Wee-Chiat Tan, Heng-Siong Lim, Cheah Heng Tan, Khaled A. Alaghbari:
Sparse representation and reproduction of speech signals in complex Fourier basis. 211-217 - Nikunj Tahilramani, Ninad Bhatt:
Information hiding in proposed 10.6 kbps CS-ACELP based speech codec using Quantization Index Modulation. 219-230 - Gonzalo D. Sad, Lucas D. Terissi, Juan Carlos Gómez:
Complementary models for audio-visual speech classification. 231-249 - Tiantian Tang, Yanhua Long, Yijie Li, Jiaen Liang:
Acoustic domain mismatch compensation in bird audio detection. 251-260 - Jiangyu Han, Yan Shi, Yanhua Long, Jiaen Liang:
Exploring single channel speech separation for short-time text-dependent speaker verification. 261-268 - Lallouani Bouchakour, Mohamed Debyeche:
Noise-robust speech recognition in mobile network based on convolution neural networks. 269-277 - Khaled M. Abdelwahab, Saied M. Abd El-atty, Ayman M. Brisha, Fathi E. Abd El-Samie:
Efficient cancelable speaker identification system based on a hybrid structure of DWT and SVD. 279-288 - Emilia Parada-Cabaleiro, Anton Batliner, Alice Baird, Björn W. Schuller:
Correction to: The perception of emotional cues by children in artificial background noise. 289
Volume 25, Number 2, June 2022
- (Withdrawn) Big Data Analytics integrated AAC Framework for English language teaching. 291-304
- Yubin Liu, C. B. Sivaparthipan, Achyut Shankar:
Human-computer interaction based visual feedback system for augmentative and alternative communication. 305-314 - Ping Zhang, K. Deepa Thilak, Renjith V. Ravi:
Big data analytics and augmentative and alternative communication in EFL teaching. 315-329 - Wei Li, Xiaoli Qiu, Yang Li, Jing Ji, Xinxin Liu, Shuanzhu Li:
Towards a novel machine learning approach to support augmentative and alternative communication (AAC). 331-341 - Min Wang, BalaAnand Muthu, C. B. Sivaparthipan:
Smart assistance to dyslexia students using artificial intelligence based augmentative alternative communication. 343-353 - Xiang Lan, Zhongwang Cao, Le Yu:
Analyzing the mental states of the sports student based on augmentative communication with human-computer interaction. 355-365 - Wenjuan Hu, Premalatha R, R. S. Aiswarya:
Physical education system and training framework based on human-computer interaction for augmentative and alternative communication. 367-377 - (Withdrawn) Computer vision for facial analysis using human-computer interaction models. 379-389
- Man Liu:
English speech emotion recognition method based on speech recognition. 391-398 - Shanshan Yang, Ding Liu:
Automatic annotation method of VR speech corpus based on artificial intelligence. 399-407 - Ran Qian, Sudhakar Sengan, Sapna Juneja:
English language teaching based on big data analytics in augmentative and alternative communication system. 409-420 - Yanmei Huang, Qiang Mei, Mulan Hu, Thanjai Vadivel, A. Daison Raj:
A voice-assisted intelligent software architecture based on deep game network. 421-433 - K. Meenakshi, G. Maragatham:
AdvIris: a hybrid approach to detecting adversarial iris examples using wavelet transform. 435-441 - Khaled Lounnas, Mourad Abbas, Mohamed Lichouri, Mohamed Hamidi, Hassan Satori, Hocine Teffahi:
Enhancement of spoken digits recognition for under-resourced languages: case of Algerian and Moroccan dialects. 443-455 - Lakshmi Srinivas Dendukuri, Jakeer Hussain Shaik:
Emotional speech analysis and classification using variational mode decomposition. 457-469 - Mohamed Monir, Mona Kareem, Sami M. El-Dolil, Adel A. Saleeb, Adel S. El-Fishawy, Mohamed Abd-Elsalam Nassar, Mohamed A. Zein Eldin, Fathi E. Abd El-Samie:
Cancelable speaker identification based on cepstral coefficients and comb filters. 471-492 - Hao Wu, Linkai Luo, Hong Peng, Wei Wen:
A method of multi-models fusion for speaker recognition. 493-498 - Gautam Chakraborty, Mridusmita Sharma, Navajit Saikia, Kandarpa Kumar Sarma:
Soft-computation based speech recognition system for Sylheti language. 499-509 - Pradeep Tiwari, Anand D. Darji:
Pertinent feature selection techniques for automatic emotion recognition in stressed speech. 511-526 - Girish Gidaye, Jagannath H. Nirmal, Kadria Ezzine, Mondher Frikha:
Unified wavelet-based framework for evaluation of voice impairment. 527-548 - Girish Gidaye, Jagannath H. Nirmal, Kadria Ezzine, Mondher Frikha:
Correction to: Unified wavelet-based framework for evaluation of voice impairment. 549
Volume 25, Number 3, September 2022
- D. Bhavana, K. Kishore Kumar, D. Ravi Tej:
Infrared and visible image fusion using latent low rank technique for surveillance applications. 551-560 - Basavoju Harish, Mulpuri Santhi Sri Rukmini, Kosaraju Sivani:
Design of MAC unit for digital filters in signal processing and communication. 561-565 - P. Ramakrishna, K. Hari Kishore:
A low power reconfigurable ADC for bioimpedance monitroing system. 567-574 - (Withdrawn) Audio fingerprint analysis for speech processing using deep learning method. 575-581
- Rohit Lamba, Tarun Gulati, Hadeel Fahad Alharbi, Anurag Jain:
A hybrid system for Parkinson's disease diagnosis using machine learning techniques. 583-593 - A. Vijayarani, G. G. Lakshmi Priya:
Salient object detection based on adaptive recalibration technique through deep network. 595-604 - (Withdrawn) Nonlinear acoustic noise cancellation based automatic speech recognition system (NANC-ASR) with convolutional neural networks. 605-613
- (Withdrawn) Drought Prediction and Analysis of Water level based on satellite images Using Deep Convolutional Neural Network. 615-623
- (Withdrawn) Detecting adversarial attacks on audio-visual speech recognition using deep learning method. 625-631
- Niveditha V. R., Senthilnathan Palaniappan, K. Naresh, Chinmaya Kumar Nayak, B. Swapna:
High speed low area decimation filter for hearing aid application. 633-639 - (Withdrawn) An adaptive speech signal processing for COVID-19 detection using deep learning approach. 641-649
- Hamsa A. Abdullah, Raya K. Mohammed:
FPGA-based modified chaotic system for speech transmission. 651-657 - Zinah Abdulridha Abutiheen, Enas Ali Mohammed, Mohsin Hasan Hussein:
Behavior analysis in Arabic social media. 659-666 - Long Shi:
Application of big data language recognition technology and GPU parallel computing in English teaching visualization system. 667-677 - Samia Abd El-Moneim, Eman Abd El-Mordy, Mohamed Abd-Elsalam Nassar, Moawad I. Dessouky, Nabil A. Ismail, Adel S. El-Fishawy, Sami A. El-Dolil, Ibrahim M. El-Dokany, Fathi E. Abd El-Samie:
Performance enhancement of text-independent speaker recognition in noisy and reverberation conditions using Radon transform with deep learning. 679-687 - Samia Abd El-Moneim, Mohamed Abd-Elsalam Nassar, Moawad I. Dessouky, Nabil A. Ismail, Adel S. El-Fishawy, Fathi E. Abd El-Samie:
Cancellable template generation for speaker recognition based on spectrogram patch selection and deep convolutional neural networks. 689-696 - Sunil Kumar Koduri, Kishore Kumar Tappeta:
Discrete cosine transform-based data hiding for speech bandwidth extension. 697-706 - Tulika Jha, Ramisetty Kavya, J. Jabez Christopher, Vasan Arunachalam:
Machine learning techniques for speech emotion recognition using paralinguistic acoustic features. 707-725 - Anshul Kumar, Ankit Kumar Jain:
Emotion detection in psychological texts by fine-tuning BERT using emotion-cause pair extraction. 727-743 - Rahul Kumar Jaiswal, Sreenivasa Reddy Yeduri, Linga Reddy Cenkeramaddi:
Single-channel speech enhancement using implicit Wiener filter for high-quality speech communication. 745-758 - Chinmay Maiti, Bibhas Chandra Dhara:
A blind audio watermarking based on singular value decomposition and quantization. 759-771 - Vijay M. Sardar, Manisha L. Jadhav, Saurabh H. Deshmukh:
Timbre features with MEDIAN values for compensating intra-speaker variability in speaker identification of whispering sound. 773-782
Volume 25, Number 4, December 2022
- Praseetha V. M., P. P. Joby:
Speech emotion recognition using data augmentation. 783-792 - S. Anjali Devi, S. Sivakumar:
An efficient contextual glove feature extraction model on large textual databases. 793-802 - Shaikh Abdul Waheed, P. Sheik Abdul Khader, A. Abdul Azeez Khan, K. Javubar Sathick:
Feature extraction from behavioral styles of children for prediction of severity of stuttering using historical stuttering data. 803-815 - Yi Jiang, Erli Cheng, Yonghao Li, Yali Zhang:
Construction of complex environment speech signal communication system based on 5G and AI driven feature extraction techniques. 817-830 - V. Srinivasarao, Umesh Ghanekar:
A new double backward distributive weighted adaptive filtering approach for speech quality improvement. 831-836 - Ashok Kumar Konduru, J. L. Mazher Iqbal:
Handling high dimensional features by ensemble learning for emotion identification from speech signal. 837-851 - Xiao Ye, Xin Lv:
Data analysis framework for visual interactive product design under the background of cloud social speech environment. 853-862 - Kunyu Li, Xunxiang Li:
AI driven human-computer interaction design framework of virtual environment based on comprehensive semantic data analysis with feature extraction. 863-877 - Zong-Peng Kuo, Joy Iong-Zong Chen:
To deploy trained speech with DNN-LSTM framework for controlling a smart wheeled-robot in limited learning circumstance. 879-891 - Haidong Xu:
Intelligent automobile auxiliary propagation system based on speech recognition and AI driven feature extraction techniques. 893-905 - Dinesh Kumar Anguraj, J. Anitha, S. John Justin Thangaraj, L. Ramesh, Seetha Rama Krishna, D. Mythrayee:
Analysis of influencing features with spectral feature extraction and multi-class classification using deep neural network for speech recognition system. 907-920 - Lu Yang:
HTK-based speech recognition and corpus-based English vocabulary online guiding system. 921-931 - A. Kishore Kumar, Shefali Waldekar, Md. Sahidullah, Goutam Kumar Saha:
Robust acoustic domain identification with its application to speaker diarization. 933-945 - Rahul Kumar Jaiswal, Rajesh Kumar Dubey:
Non-intrusive speech quality assessment using context-aware neural networks. 947-965 - Yagnavajjula Madhu Keerthana, K. Sreenivasa Rao, Pabitra Mitra:
Dysarthric speech detection from telephone quality speech using epoch-based pitch perturbation features. 967-973 - Tiemin Mei, Guorong He, Yandong Zhao, Jihan Dong:
Blind identification of the inverse of SIMO system and deconvolution with Kalman filter. 975-986 - Xuefei Wang, Yanhua Long, Dongxing Xu:
Universal and accent-discriminative encoders for conformer-based accent-invariant speech recognition. 987-995 - Adnan Gutub:
Integrity verification of Holy Quran verses recitation via incomplete watermarking authentication. 997-1011 - Chol-Jin Han, Un-Chol Ri, Song-Il Mun, Kang-Song Jang, Song-Yun Han:
An end-to-end TTS model with pronunciation predictor. 1013-1024 - Luciana Albuquerque, António J. S. Teixeira, Catarina Oliveira, Daniela Figueiredo:
Age and vowel classification improvement by the inclusion of vowel dynamic features. 1025-1040 - (Withdrawn) Speaker identification using hybrid neural network support vector machine classifier. 1041-1053
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.