default search action
APSIPA 2017: Kuala Lumpur, Malaysia
- 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, Kuala Lumpur, Malaysia, December 12-15, 2017. IEEE 2017, ISBN 978-1-5386-1542-3
- Chin-Hui Lee:
Keynote speech 1: An integrated deep learning approach to acoustic signal pre-processing and acoustic modeling with applications to robust automatic speech recognition. v-viii - Yan Chen, Chih-Yu Wang:
Tutorial 1: Sequential decision making: Theories and applications. ix-xii - Binh Trans:
Online learning in the Asia Pacific region. - Jie Yan, Lei Xie, Guangsen Wang, Zhong-Hua Fu:
A segmental DNN/i-vector approach for digit-prompted speaker verification. 1-5 - Szu-Wei Fu, Yu Tsao, Xugang Lu, Hisashi Kawai:
Raw waveform-based speech enhancement by fully convolutional networks. 6-12 - Jian-Jiun Ding, Shiang-Chih Hua, Ronald Y. Chang, Yih-Cherng Lee:
Generalized atom and dictionary design and compressive sensing for vocal signal expansion. 13-18 - Chien-Yao Wang, Andri Santoso, Jia-Ching Wang:
Acoustic scene classification using self-determination convolutional neural network. 19-22 - I-Hsiang Wang, Jian-Jiun Ding, Hung-Wei Hsu:
Prediction techniques for wavelet based 1-D signal compression. 23-26 - Xiaoming Zhang, Hidetaka Aoki, Akiko Sato, Mohd Amin Abd Majid:
An empirical study on performance optimization at district cooling plant of Universiti Teknologi PETRONAS. 27-32 - Tomoya Sakai, Shun Ogawa, Hiroki Kuhara:
Sequential decomposition of 2D apparent motion fields based on low-rank and sparse approximation. 33-38 - Ettikan Kandasamy Karuppiah:
Internet of Things: Trend, technologies, and evolution. 37-38 - Lounell B. Gueta, Akiko Sato:
Classifying road surface conditions using vibration signals. 39-43 - Ryosuke Kawami, Hidetomo Kataoka, Daichi Kitahara, Akira Hirabayashi, Takashi Ijiri, Shigeharu Shimamura, Hiroshi Kikuchi, Tomoo Ushio:
Fast high-quality three-dimensional reconstruction from compressive observation of phased array weather radar. 44-49 - Akie Sakiyama, Yuichi Tanaka:
Graph reduction method using localization operator and its application to pyramid transform. 50-55 - Vui Ann Shim, Miaolong Yuan, Boon Hwa Tan:
Automatic object searching by a mobile robot with single RGB-D camera. 56-62 - Yan Wu, Ruohan Wang, Yong Ling Tay, Clarice Jiaying Wong:
Investigation on the roles of human and robot in collaborative storytelling. 63-68 - Gayane Shalunts, Gerhard Backfried, Helmy Syakh Alam:
Sentiment analysis in Indonesian and French by SentiSAIL. 69-75 - Luis Fernando D'Haro, Andreea I. Niculescu, Caixia Cai, Suraj Nair, Rafael E. Banchs, Alois C. Knoll, Haizhou Li:
An integrated framework for multimodal human-robot interaction. 76-82 - Andreea I. Niculescu, Luis Fernando D'Haro, Rafael E. Banchs:
When industrial robots become social: On the design and evaluation of a multimodal interface for welding robots. 83-89 - Xiao-Zhi Zhang, Ya Li, Bingo Wing-Kuen Ling, Chao Song, Kok Lay Teo:
Spread spectrum compressed sensing magnetic resonance imaging via fractional Fourier transform. 90-93 - Yi-Ping Bao, Yan-Na Zhang, Yu-E. Song, Bing-Zhao Li, Pei Dang:
Nonuniform sampling theorems for random signals in the offset linear canonical transform domain. 94-99 - Yi-Qian Wang, Bing-Zhao Li, Qi-Yuan Cheng:
The fractional Fourier transform on graphs. 105-110 - Aykut Koç, Haldun M. Özaktas, Burak Bartan, Erhan Gundogdu, Tolga Çukur:
Digital computation of fractional Fourier and linear canonical transforms and sparse image representation. 111-117 - Iman Tabatabaei Ardekani, Xiao Zhang, Hamid R. Sharifzadeh, Jari P. Kaipio:
Maximum a posteriori adjustment of adaptive transversal filters in active noise control. 118-123 - Masato Nakayama, Takanobu Nishiura:
Synchronized amplitude-and-frequency modulation for a parametric loudspeaker. 130-135 - Tomoki Murata, Yoshinobu Kajikawa, Seiji Miyoshi:
Statistical-mechanical analysis of the FXLMS algorithm for multiple-channel active noise control. 136-139 - Michael Anthony, Cheng-Yuan Chang, Sen M. Kuo:
Active noise control for muffler. 140-144 - Nan Chen, Changchun Bao, Xianyun Wang:
Speech enhancement based on binaural cues. 145-148 - Yan Yang, Changchun Bao, Xianyun Wang:
Codebook-driven speech enhancement using DNN and harmonic emphasis. 149-154 - Xin Wang, Jun Du, Yannan Wang:
A maximum likelihood approach to deep neural network based speech dereverberation. 155-158 - Tohari Ahmad, Burhanudin Rasyid:
SCFT: Sector-based cancelable fingeprint template. 156-160 - Xiao-Lei Zhang:
Speech separation by cost-sensitive deep learning. 159-162 - Shasha Xia, Hao Li, Xueliang Zhang:
Using optimal ratio mask as training target for supervised speech separation. 163-166 - Minghui Dong, Zhengchen Zhang, Huaiping Ming:
Representing raw linguistic information in chinese text-to-speech system. 167-170 - Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng:
An end-to-end neural network approach to story segmentation. 171-176 - Dong Wang, Lantian Li, Zhiyuan Tang, Thomas Fang Zheng:
Deep speaker verification: Do we need end to end? 177-181 - Keisuke Oyamada, Hirokazu Kameoka, Takuhiro Kaneko, Hiroyasu Ando, Kaoru Hiramatsu, Kunio Kashino:
Non-native speech conversion with consistency-aware recursive network and generative adversarial network. 182-188 - Sivanagaraja Tatinati, Mun Kit Ho, Andy W. H. Khong, Yubo Wang:
End-to-end speech emotion recognition using multi-scale convolution networks. 189-192 - Jessada Karnjana, Kasorn Galajit, Pakinee Aimmanee, Chai Wutiwiwatchai, Masashi Unoki:
Speech watermarking scheme based on singular-spectrum analysis for tampering detection and identification. 193-202 - Anu Aryal, Shoko Imaizumi, Takahiko Horiuchi, Hitoshi Kiya:
Integrated algorithm for block-permutation-based encryption with reversible data hiding. 203-208 - Simying Ong, KokSheik Wong, Kiyoshi Tanaka:
Redesigning data hiding: Interpolation-based scrambling-embedding method. 209-213 - KuanYew Tan, KokSheik Wong, Simying Ong, Kiyoshi Tanaka:
Rewritable data insertion in encrypted JPEG using coefficient prediction method. 214-219 - Koichi Ito, Takehisa Okano, Takafumi Aoki:
Recent advances in biometrie security: A case study of liveness detection in face recognition. 220-227 - Meng Yang, Nanning Zheng, Fei Wang, Ce Zhu:
A new bilateral filter for post-removing the noise of synthesis view in 3D video. 228-231 - Hongsheng Liu, Baozhu Guo, Zhizhong Fu, Xiaofeng Li:
A new active contour model based on complexity of textures for segmentation of natural image. 232-236 - Yifan Zhang, Ting Wang, Renjie He, Mingyi He:
Subpixel mapping of hyperspectral images with hybrid endmember library and optimized abundances. 237-241 - Yifan Zhang, Tuo Zhao, Mingyi He:
Hyperspectral and multispectral image fusion using local spatial-spectral dictionary pair. 242-246 - Cho-Ying Wu, Jian-Jiun Ding:
A fast non-convex regularizer for low rank matrix completion. 247-250 - Chia-Wei Wang, Tzu-Chieh Yang, Sheng-Ho Chiang, Tsaipei Wang:
Identifying and filling occlusion holes on planar surfaces for 3-D scene editing. 251-254 - Wisarut Chantara, Yo-Sung Ho:
Initial depth estimation using EPIs and structure tensor. 255-258 - Guiqing He, Siyuan Xing, Dandan Dong, Ximei Zhao:
Panchromatic and multi-spectral image fusion method based on two-step sparse representation and wavelet transform. 259-262 - Yuma Kinoshita, Taichi Yoshida, Sayaka Shiota, Hitoshi Kiya:
Pseudo multi-exposure fusion using a single image. 263-269 - Wen-Nung Lie, Chih-Hao Hu, Yi-Kai Chen, Jui-Chiu Chiang:
Multi-layer background sprite model for 2D-to-3D video conversion. 270-274 - Chao Zhang, Ce Zhu, Yipeng Liu, Hongdiao Wen, Zhengtao Wang:
Image ordinal estimation: Classification and regression benefit each other. 275-278 - Yusaku Akiyoshi, Taichi Sumi, Yoshimitsu Kuroki:
Dictionary design and disparity interpolation on distributed compressed sensing for light field image. 279-282 - Kwan-Jung Oh, Minsik Park, Jinwoong Kim:
Digital hologram data representation method. 283-286 - Yufei Zhao, Zhizhong Fu, Jin Xu, Linghua Mao:
Image fusion algorithm based on gradient similarity filter. 287-291 - Manoj Ramanathan, Wei-Yun Yau, Eam Khwang Teoh, Nadia Magnenat-Thalmann:
Pose-invariant kinematic features for action recognition. 292-299 - Tingtian Li, Daniel Pak-Kong Lun:
Salient object detection using array images. 300-303 - Jia Du, Wei Xiong, Wenyu Chen, Jierong Cheng, Ying Gu:
Accurate subset selection for pose estimation from uncertain points and lines. 304-308 - Xin Rong Soh, Vishnu Monn Baskaran, Adamu Muhammad Buhari, Raphael C.-W. Phan:
A real time micro-expression detection system with LBP-TOP on a many-core processor. 309-315 - Ryo Miyagi, Masaki Aono:
Sliced voxel representations with LSTM and CNN for 3D shape recognition. 320-323 - Yi Yang Ang, Nam Nguyen, Joni Polili Lie, Woon-Seng Gan:
Localization of harmonic source using a single moving sensor of known trajectory. 324-328 - Yi Yang Ang, Nam Nguyen, Joni Polili Lie, Woon-Seng Gan:
Grid-free compressive beamforming using a single moving sensor of known trajectory. 329-332 - Suraj Kumar Nayak, Karan Pande, Pratyush Kumar Patnaik, Shikshya Nayak, Shankar J. Patel, Arfat Anis, Anilesh Dey, Kunal Pal:
Understanding the effect of cannabis abuse on the ANS and cardiac physiology of the Indian women paddy-field workers using RR interval and ECG signal analyses. 333-341 - Phuttapong Sertsi, Surasak Boonkla, Vataya Chunwijitra, Nattapong Kurpukdee, Chai Wutiwiwatchai:
Robust voice activity detection based on LSTM recurrent neural networks and modulation spectrum. 342-346 - Yu-Siang Huang, Szu-Yu Chou, Yi-Hsuan Yang:
Music thumbnailing via neural attention modeling of music emotion. 347-350 - Shohei Mori, Hideo Saito:
Augmented visualization: Observing as desired. 351-356 - Kazuhisa Yamagishi:
QoE-estimation models for video streaming services. 357-363 - Kazuo Sugimoto, Robert A. Cohen, Dong Tian, Anthony Vetro:
Trends in efficient representation of 3D point clouds. 364-369 - Yohei Kawaguchi, Ryoichi Takashima, Takashi Endo, Masahito Togami:
Time-domain subsampling and reconstruction for microphone array. 370-374 - Yiqi Tew, Tiong Yew Tang, Yoon-Ket Lee:
A study on enhanced educational platform with adaptive sensing devices using IoT features. 375-379 - Yoon-Ket Lee, Jay Ming Lim, Kok Seng Eu, Yeh Huann Goh, Yiqi Tew:
Real time image processing based obstacle avoidance and navigation system for autonomous wheelchair application. 380-385 - Jian Han Lim, Eng Yeow Teh, Ming Han Geh, Chern Hong Lim:
Automated classroom monitoring with connected visioning system. 386-393 - Xin Li, Xueting Wei, Wei Zhou, Zhemin Duan:
Techniques for overheating detection and sensor allocation in a real dual-core processor. 394-400 - Jun Li, Keng Peng Tee, Lawrence Chen, Kong-Wah Wan, Wei-Yun Yau:
A perception system for robot arms to convey objects to in-car passengers. 401-408 - Yi Feng, Zhifeng Huang, Yun Zhang:
Motion planning of a 6-Dofs robot arm for bandaging nursing task. 409-413 - Jiadong Wang, Wenjuan Ouyang, Wenchao Gao, Qinyuan Ren:
Locomotion control of a serpentine crawling robot inspired by central pattern generators. 414-419 - Nicola Catenacci Volpi, Yan Wu, Dimitri Ognibene:
Towards event-based MCTS for autonomous cars. 420-427 - Yuya Chiba, Takashi Nose, Akinori Ito:
Analysis of efficient multimodal features for estimating user's willingness to talk: Comparison of human-machine and human-human dialog. 428-431 - Xia Bai, Jiatong Han, Juan Zhao:
Sparse-based disturbance cancellation approach for passive radar. 432-436 - Juan Zhao, Xia Bai:
An improved orthogonal matching pursuit based on randomly enhanced adaptive subspace pursuit. 437-441 - Shiori Mikami, Arata Kawamura, Youji Iiguni:
Residual drum sound estimation for RPCA singing voice extraction. 442-446 - Hyeonggwon Kim, Yoonsik Choe:
Background subtraction via truncated nuclear norm minimization. 447-451 - Yohei Kawaguchi, Sandra Ramaswami, Ryoichi Takashima, Takashi Endo, Rintaro Ikeshita:
Sub-Nyquist non-uniform sampling for low-cost sound monitoring. 452-456 - Valiantsin Belyi, Woon-Seng Gan:
Psychoacoustic subband active noise control algorithm. 457-463 - Shun Hirose, Yoshinobu Kajikawa:
Effectiveness of headrest ANC system with virtual sensing technique for factory noise. 464-468 - Dong-Yuan Shi, Chuang Shi, Woon-Seng Gan:
Effect of the audio amplifier's distortion on feedforward active noise control. 469-473 - Caixia Lu, Feiran Yang, Jun Yang:
A frequency-domain adaptive feedback cancellation algorithm based on convex combination. 474-477 - Kouei Yamaoka, Nobutaka Ono, Shoji Makino, Takeshi Yamada:
Abnormal sound detection by two microphones using virtual microphone technique. 478-482 - Feng Bao, Waleed H. Abdulla:
Signal power estimation based on convex optimization for speech enhancement. 483-487 - Yanhui Tu, Jun Du, Lei Sun, Chin-Hui Lee:
LSTM-based iterative mask estimation and post-processing for multi-channel speech enhancement. 488-491 - Zexin Liu, Heather T. Ma, Fei Chen:
A new data-driven band-weighting function for predicting the intelligibility of noise-suppressed speech. 492-496 - Miao Zhang, Yixiang Chen, Lantian Li, Dong Wang:
Speaker recognition with cough, laugh and "Wei". 497-501 - Hosana Kamiyama, Atsushi Ando, Satoshi Kobashikawa, Yushi Aono:
Robust children and adults speech identification and confidence measure based on DNN posteriorgram. 502-505 - Feng Li, Huihui Bai, Yao Zhao:
Visual attention guided eye movements for 360 degree images. 506-511 - Cairong Xing, Anhong Wang, Suyue Li, Peihao Li, Jing Zhang:
Random aliasing modulation with decision-directed demodulation. 512-515 - Chang Duan, Yuhuan Shen, Yingying Zhang, Shuai Wang, Ce Zhu, Meng Yang:
Enhancing wedgelet-based depth modeling in 3D-HEVC. 516-519 - Xiaoqiang Cao, Ce Zhu, Minjie Yang, Yongbing Lin, Jianhua Zheng:
A new intra prediction method based on consistent luminance changes. 520-523 - Szu-Wei Fu, Jian-Jiun Ding, Ying-Wun Huang, Ching-Wen Hsiao, Hsin-Hui Chen:
Collagen image compression using the JPEG-based predictive lossless coding scheme. 524-533 - Sze-Teng Liong, KokSheik Wong:
Micro-expression recognition using apex frame with phase information. 534-537 - Jierong Cheng, Wei Xiong, Jia Du, Wenyu Chen, Ying Gu:
Detection of meaningful line segment configurations. 538-541 - Jinyoung Jang, Dong-Won Shin, Yo-Sung Ho:
Disparity map refinement method using coarse-to-fine image segmentation. 542-545 - Dong-Won Shin, Yo-Sung Ho:
Local patch descriptor using deep convolutional generative adversarial network for loop closure detection in SLAM. 546-549 - Chen Chen, Shangwen Li, Xiang Fu, Yuzhuo Ren, Yueru Chen, C.-C. Jay Kuo:
Exploring confusing scene classes for the places dataset: Insights and solutions. 550-558 - Nirmesh J. Shah, Hemant A. Patil:
On the convergence of INCA algorithm. 559-562 - Maulik C. Madhavi, Hemant A. Patil:
Combining evidences from detection sources for query-by-example spoken term detection. 563-568 - Yuanjun Zhao, Roberto Togneri, Victor Sreeram:
Compressed high dimensional features for speaker spoofing detection. 569-572 - Vishnu Vidyadhara Raju Vegesna, Hari Krishna Vydana, Suryakanth V. Gangashetty, Anil Kumar Vuppala:
Importance of non-uniform prosody modification for speech recognition in emotion conditions. 573-576 - Chitralekha Gupta, Haizhou Li, Ye Wang:
Perceptual evaluation of singing quality. 577-586 - Kishin Migimatsu, Takuya Wakazono, Isao T. Tokuda:
Experimental study on source-filter interaction using physical model of the vocal folds. 587-590 - Yu-Huai Peng, Chin-Cheng Hsu, Yi-Chiao Wu, Hsin-Te Hwang, Yi-Wen Liu, Yu Tsao, Hsin-Min Wang:
Fast locally linear embedding algorithm for exemplar-based voice conversion. 591-595 - Shengke Lin, Takashi Tsunakawa, Masafumi Nishida, Masafumi Nishimura:
DNN-based feature transformation for speech recognition using throat microphone. 596-599 - Hitoshi Yamamoto, Koji Okabe, Takafumi Koshinaka:
Robust i-vector extraction tightly coupled with voice activity detection using deep neural networks. 600-604 - Chen-Yen Lai, Yu-Wen Lo, Yih-Liang Shen, Tai-Shih Chi:
Plastic multi-resolution auditory model based neural network for speech enhancement. 605-609 - Kazuho Morikawa, Tomoki Toda:
Electrolaryngeal speech modification towards singing aid system for laryngectomees. 610-613 - Peixin Chen, Wu Guo, Qingnan Wang, Yan Song:
Topic classification based on distributed document representation and latent topic information. 614-617 - Michael Hentschel, Atsunori Ogawa, Marc Delcroix, Tomohiro Nakatani, Yuji Matsumoto:
Exploiting imbalanced textual and acoustic data for training prosodically-enhanced RNNLMs. 618-621 - Junfeng Hou, Shiliang Zhang, Li-Rong Dai, Hui Jiang:
Feedforward sequential memory networks based encoder-decoder model for machine translation. 622-625 - Yu Chen, Yanting Chen, Hua Lin, Jie Hou, Yutong Xing, Jianwu Dang:
A study of high level tone in standard chinese produced by prelingually deaf adults. 626-629 - Hao Zhang, Nan Yan, Lan Wang, Manwa L. Ng:
Energy distribution analysis and nonlinear dynamical analysis of phonation in patients with Parkinson's disease. 630-635 - Chuanying Niu, Jinsong Zhang, Xuesong Yang, Yanlu Xie:
A study on landmark detection based on CTC and its application to pronunciation error detection. 636-640 - Chin-Hong Shih, Bi-Cheng Yan, Shih-Hung Liu, Berlin Chen:
Investigating Siamese LSTM networks for text categorization. 641-646 - Yu Chen, Jie Hou, Yutong Xing, Yanting Chen, Hua Lin, Jianwu Dang:
The acoustic characteristics of tone 3 in standard chinese produced by prelingually deaf adults. 647-650 - Nina Zhou, Xuancong Wang, AiTi Aw:
Dynamic boundary detection for speech translation. 651-656 - Hung-Wei Hsu, Jian-Jiun Ding:
FasterMDNet: Learning model adaptation by RNN in tracking-by-detection based visual tracking. 657-660 - Zheng-Teng Zhang, Chia-Hung Yeh, Li-Wei Kang, Min-Hui Lin:
Efficient CTU-based intra frame coding for HEVC based on deep learning. 661-664 - Zun-Ci Lee, Raphael C.-W. Phan, Su-Wei Tan, Kuan-Heng Lee:
Multimodal decomposition for enhanced subtle emotion recognition. 665-671 - Jing-Ming Guo, S. Sankarasrinivasan:
Enhanced block truncation coding image using digital multitone screen. 672-676 - Gayane Shalunts, Martin Cerman, Daniel Albertini:
Detection of sculpted faces on building facades. 677-685 - Yueru Chen, Pranav Aggarwal, Jongmoo Choi, C.-C. Jay Kuo:
A deep learning approach to drone monitoring. 686-691 - Guan-Ting Lin, Patrisia Sherryl Santoso, Che-Tsung Lin, Chia-Chi Tsai, Jiun-In Guo:
Stop line detection and distance measurement for road intersection based on deep learning neural network. 692-695 - Takuya Araki, Yuichi Nakamura:
Future trend of deep learning frameworks - From the perspective of big data analytics and HPC. 696-703 - Nam Kyun Kim, Jiwon Lee, Hun Kyu Ha, Geon Woo Lee, Jung Hyuk Lee, Hong Kook Kim:
Speech emotion recognition based on multi-task learning using a convolutional neural network. 704-707 - Jonghee Kim, Jinsu Kim, Seokeon Choi, Muhammad Abul Hasan, Changick Kim:
Robust template matching using scale-adaptive deep convolutional features. 708-711 - Sung-Phil Kim, Jae-Hwan Kang, Young Chang Jo, Ian Oakley:
Development of a multi-modal personal authentication interface. 712-715 - Shohei Ogai, Toshihisa Tanaka:
A drag-and-drop type human computer interaction technique based on electrooculogram. 716-720 - Jaeyoung Shin, Klaus-Robert Müller, Han-Jeong Hwang:
Hybrid EEG-NIRS brain-computer interface under eyes-closed condition. 721-723 - Sunghan Lee, Hohyun Cho, Sung Chan Jun:
Simultaneous bio-signal measurement system for multiple users - development and validation. 724-727 - Xiaoling Wu, Shuhua Gao, Dong-Yan Huang, Cheng Xiang:
Voichap: A standalone real-time voice change application on iOS platform. 728-732 - Guanyu Li, Hongzhi Yu, Thomas Fang Zheng, Jinghao Yan, Shipeng Xu:
Free linguistic and speech resources for Tibetan. 733-736 - Mijit Ablimit, Sardar Parhat, Askar Hamdulla, Thomas Fang Zheng:
A multilingual language processing tool for Uyghur, Kazak and Kirghiz. 737-740 - Shipeng Xu, Hongzhi Yu, Thomas Fang Zheng, Guanyu Li, Gegeentana:
Language resource construction for Mongolian. 741-744 - Ying Shi, Askar Hamdullah, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng:
A free Kazakh speech database and a speech recognition baseline. 745-748 - Zhiyuan Tang, Dong Wang, Yixiang Chen, Qing Chen:
AP17-OLR challenge: Data, plan, and baseline. 749-753 - Yeun Lok Lin, Ngai-Fong Law, Chi-Wai Do:
Portable vision screenings system. 754-759 - Kin-On Cheng, Ngai-Fong Law, Wan-Chi Siu:
Compressing population DNA sequences using multiple reference sequences. 760-764 - Yun-Xia Liu, Yang Yang, Yuehui Chen:
Lung sound classification based on Hilbert-Huang transform features and multilayer perceptron network. 765-768 - Yun-Xia Liu, Yang Yang, Yuehui Chen:
Automatic detection of circulating tumor cells based on microscopic images. 769-773 - Shengyan Li, Bin Li, Shixiong Zhang, Hong Fu, Wai-Lun Lo, Jie Yu, Cindy H. P. Sit, Ruimin Li:
A markerless visual-motor tracking system for behavior monitoring in DCD assessment. 774-777 - Lei Wang, Zexin Liu, Fei Chen:
Perceptual roles of temporal and segmentation cues in single-channel noise reduction processing. 778-785 - Alan Kan:
Improving speech intelligibility for bilateral cochlear implant users using Weiner filters and its impact on cognitive load. 786-792 - Jing Chen, Zhen Fu, Xiuyong Ding, Jiping Wu, Xihong Wu:
Electrically-evoked frequency following responses (EFFRs) and electrically-evoked auditory brainstem responses (EABRs) in guinea pigs. 793-802 - Rebecca E. Millman, Michael A. Stone, Chin-Tuan Tan:
Objective neurophysiological assessment for sound quality perception by hearing-impaired listeners. 803-807 - Syu-Siang Wang, Yu Tsao, Hsiao-Lan Sharon Wang, Ying-Hui Lai, Lieber Po-Hung Li:
A deep learning based noise reduction approach to improve speech intelligibility for cochlear implant recipients in the presence of competing speech noise. 808-812 - Chih-Chiang Chen, Shang-Ho Lawrence Tsai, Yuan-Pei Lin, Chia-Hua Lin:
Resource allocation and minimum rate for precoded non-orthogonal multiple access. 813-817 - Tzu-Chiao Lin, See-May Phoong:
MSE-optimized CP-based CFO estimation in OFDM systems over multipath channels. 818-822 - Y.-W. Peter Hong, An-An Lee, Yu-An Chen:
Successive MMSE group decoding and max-min power control for uplink multiceli NOMA systems under pilot contamination. 823-831 - Syu-Siang Long, Pei-Yun Tsai, Yuan-Hao Huang, I-Wei Lai:
Trellis coded generalized spatial modulation with spatial multiplexing. 832-837 - Chia-Yang Mei, Wan-Jen Huang:
Low-complexity zero-forcing detector for large-scale MIMO-OFDM systems. 838-841 - Yanhong Wu, Xiaolong Li, Yao Zhao, Rongrong Ni:
A new detector for JPEG decompressed bitmap identification. 842-845 - Omer Hemida, Yaoran Huo, Fan Chen, Hongjie He:
Block-DCT based alterable-coding restorable fragile watermarking scheme with superior localization. 846-851 - Kenta Iida, Hitoshi Kiya:
Robust image identification without any visible information for double-compressed JPEG images. 852-857 - Tatsuya Chuman, Kenta Iida, Hitoshi Kiya:
Image manipulation on social media for encryption-then-compression systems. 858-863 - Liuying Sun, Anthony T. S. Ho, Zhe Xia, Jiageng Chen, Xuzhe Huang, Yidan Zhang:
Detection and classification of malicious patterns in network traffic using Benford's law. 864-872 - John Håkon Husøy:
On the selection and design of filter banks in normalised subband adaptive filters (NSAF). 877-883 - Li Su:
Between homomorphic signal processing and deep neural networks: Constructing deep algorithms for polyphonic music transcription. 884-891 - Soichiro Aoki, Hiroki Tanji, Takahiro Murakami:
Array shape calibration using near field pilot sources with unknown distance. 892-896 - Xiangyuan Li, Cheng Cai, Jinrong He:
Density-based multi-manifold ISOMAP for data classification. 897-903 - Tomoya Wada, Toshihisa Tanaka:
Doubly adaptive kernel filtering. 904-909 - Kenzo Yamamoto, Kenji Suyama:
Active enumeration of local minima for IIR filter design using PSO. 910-917 - Chung-Nan Lee, Sheng-Wei Chu:
A fairness aware and resource reuse algorithm for LTE layered video multicast service. 918-925 - Wei Zhang, Lixia Hao, Yanlu Xie, Jinsong Zhang:
A study on quantitative computation for prosodie strength of Mandarin speech. 926-930 - Huiyong Li, Zihui Luo, Julan Xie, Jun Li:
Joint estimation of signal and mutual coupling parameters based on spatially spread polarization sensitive array. 931-937 - Shuichi Ohno, M. Rizwan Tariq, Masaaki Nagahara:
Min-max IIR filter design for feedback quantizers. 938-942 - Ayano Nakai, Kazunori Hayashi:
Diffusion LMS using consensus propagation. 943-948 - Bandhit Suksiri, Masahiro Fukumoto:
Enhanced array manifold matrices for L-shaped microphone array-based 2-D DOA estimation. 955-960 - Chisa Kodama, Kunihito Kato, Satoshi Tamura, Satoru Hayamizu:
Swallowing function evaluation using deep-learning-based acoustic signal processing. 961-964 - Xiaobai Chen, Jinlong Xu, Zhiyi Yu:
A fast and energy efficient FPGA-based system for real-time object tracking. 965-968 - Yuuki Saito, Akira Tanaka:
Optimal kernel in kernel regression problems with autocorrelation prior. 969-972 - Tatsuya Yokota, Hidekata Hontani:
An efficient method for adapting step-size parameters of primal-dual hybrid gradient method in application to total variation regularization. 973-979 - Surasak Boonkla, Masashi Unoki, Chai Wutiwiwatchai, Stanislav S. Makhanov:
F0 estimation using empirical mode decomposition and complex cepstrum analysis in reverberant environments. 980-986 - Andre McDonald, Anton van Wyk:
Construction of semi-Markov ergodic maps with selectable spectral characteristics via the solution of the inverse eigenvalue problem. 987-993 - Yujia Lu, Kazunori Hayashi:
A new pool control method for Boolean compressed sensing based adaptive group testing. 994-999 - Samad S. Kolahi, Bashar Barmada, Keysha Mudaliar:
Defence mechanisms evaluation against RA flood attacks for Linux-victim node. 1000-1005 - Tatsuya Kawahara:
Automatic meeting transcription system for the Japanese parliament (diet). 1006-1010 - Zhenzhen Wang, Jingjing Meng, Tan Yu, Junsong Yuan:
Common visual pattern discovery and search. 1011-1018 - Anthony Kuh, Muhammad Sharif Uddin, Phyllis Ng:
Online unsupervised kernel learning algorithms. 1019-1025 - Shiuan Huang, Hsueh-Ming Hang:
Multi-query image retrieval using CNN and SIFT features. 1026-1034 - Lilei Zheng, Ying Zhang, Vrizlynn L. L. Thing:
Understanding multi-layer perceptrons on spatial image steganalysis features. 1035-1039 - Lantian Li, Dong Wang, Askar Rozi, Thomas Fang Zheng:
Cross-lingual speaker verification with deep feature learning. 1040-1044 - Shinya Takamaeda-Yamazaki, Kodai Ueyoshi, Kota Ando, Ryota Uematsu, Kazutoshi Hirose, Masayuki Ikebe, Tetsuya Asai, Masato Motomura:
Accelerating deep learning by binarized hardware. 1045-1051 - Hao Xu, Yueru Chen, Ruiyuan Lin, C.-C. Jay Kuo:
Understanding CNN via deep features analysis. 1052-1060 - Yuan-Fu Li, Chia-Chi Tsai, Yi-Ting Lai, Jiun-In Guo:
A multiple-lane vehicle tracking method for forward collision warning system applications. 1061-1064 - Mehrdad Babazadeh, Sokratis Kartakis, Julie A. McCann:
Highly-distributed sensor processing using IoT for critical infrastructure monitoring. 1065-1074 - Kuan-Chung Wang, Yoga Dwi Pranata, Jia-Ching Wang:
Automatic vehicle classification using center strengthened convolutional neural network. 1075-1078 - Cong Lai, Wen Luo, Shiqiang Chen, Qinhua Li, Qingyu Yang, Hongbin Sun, Nanning Zheng:
Zynq-based full HD around view monitor system for intelligent vehicle. 1079-1082 - Huan-Rui Chang, Hsueh-Ming Hang:
Wide angle virtual view synthesis using two-by-two Kinect V2. 1083-1091 - Shiyue Zhang, Gulnigar Mahmut, Dong Wang, Askar Hamdulla:
Memory-augmented Chinese-Uyghur neural machine translation. 1092-1096 - Elok Cahyaningtyas, Dhany Arifianto:
Development of under-resourced Bahasa Indonesia speech corpus. 1097-1101 - Anocha Rugchatjaroen, Sittipong Saychum, Keiichiro Oura, Keiichi Tokuda:
Generalization of Thai tone contour in HMM-based speech synthesis. 1102-1105 - Aijun Li, Gongping Wang:
The longitudinal development of focus duration of Korean Chinese learners. 1106-1114 - Hankiz Yilahun, Aynur Nurtay, Askar Hamdulla:
Patterns of vowels in Uyghur Tri-syllabic words. 1115-1122 - Ismail M. El-Badawy, Ashraf M. Aziz, Zaid Bin Omar, M. B. Malarvili:
Correlation between different DNA period-3 signals: An analytical study for exons prediction. 1123-1128 - Nam H. Le, Khang N. Nguyen, Hien M. Nguyen:
Comparison analysis of ICA versus MCA-KSVD blind source separation on task-related fMRI data. 1129-1135 - Yudai Suzuki, Keigo Kawaji, Amit R. Patel, Satoshi Tamura, Satoru Hayamizu:
Toward effective noise reduction for sub-Nyquist high-frame-rate MRI techniques with deep learning. 1136-1139 - Satoshi Ito:
Compressed sensing reconstruction of MR phase-varied images using multi-scale complex sparsifying transform. 1140-1143 - Masara Yamashita, Tasuku Miura, Shoichi Matsunaga:
Distinction between healthy individuals and patients with confident abnormal respiration. 1144-1147 - Jie Chen, Yun Ni, Junhui Hou, Lap-Pui Chau:
Light field scene flow with occlusion regularization. 1148-1151 - Yu Zhou, Sam Kwong, Junhui Hou:
Single image superresolution by multiple geometrical regressors. 1152-1155 - Chia-Chun Hsu, Jian-Jiun Ding, Yih-Cherng Lee:
Efficient edge-oriented based image interpolation algorithm for non-integer scaling factor. 1156-1159 - Wei-Ting Lu, Chien-Wei Lin, Chih-Hung Kuo, Ying-Chan Tung:
Image super-resolution based on error compensation with convolutional neural network. 1160-1163 - Qinhui Fan, Hongsheng Liu, Zhizhong Fu, Xiaofeng Li:
Exemplar-based image inpainting based on pixel inhomogeneity factor. 1164-1168 - Yusuke Sugawara, Sayaka Shiota, Hitoshi Kiya:
A parallel computation algorithm for super-resolution methods using convolutional neural networks. 1169-1173 - Guiqing He, Dandan Dong, Siyuan Xing, Ximei Zhao:
Infrared and visible image fusion based on innovation feature simultaneous decomposition. 1174-1177 - Takuro Yamaguchi, Masaaki Ikehara:
Joint bilateral based image denoising using multi-sized 2D hard threshold. 1178-1181 - ShuMin Liu, Jiaxuan Zhang, Jiajia Chen:
Multi-focus image fusion using Gaussian filter and dynamic programming. 1182-1185 - Hyewon Song, Doyoung Kim, Hyuck-Joo Kwon, Sanghoon Lee:
Natural scene statistics based publication classification algorithm using convolutional neural network. 1186-1189 - LieLin Pang, KokSheik Wong, Sze-Teng Liong:
Data embedding in scalable coded video. 1190-1194 - Kaavya Sriskandaraja, Gajan Suthokumar, Vidhyasaharan Sethu, Eliathamby Ambikairajah:
Investigating the use of scattering coefficients for replay attack detection. 1195-1198 - Masashi Unoki, Yuta Kashihara, Maori Kobayashi, Masato Akagi:
Study on method for protecting speech privacy by actively controlling speech transmission index in simulated room. 1199-1204 - KokSheik Wong, Hitoshi Kiya:
Reversible data hiding for compression-friendly image encryption method. 1205-1209 - Kai Liu, Xuan Li, Qiong Zhang, Xiangui Kang:
Multi-channel neural network for steganalysis. 1210-1213 - Qingnan Wang, Wu Guo, Peixin Chen, Yan Song:
Tibetan-Mandarin bilingual speech recognition based on end-to-end framework. 1214-1217 - Huang Chen, Shiliang Zhang, Junfeng Hou, Lirong Dai:
Learning the number of nodes in DNNs with activation mask. 1218-1221 - Hiromitsu Nishizaki:
Data augmentation and feature extraction using variational autoencoder for acoustic modeling. 1222-1227 - Hitoshi Ito, Aiko Hagiwara, Manon Ichiki, Takeshi Mishima, Shoei Sato, Akio Kobayashi:
End-to-end speech recognition for languages with ideographic characters. 1228-1232 - Yuki Yasui, Nakamasa Inoue, Koji Iwano, Koichi Shinoda:
Multimodal speech recognition using mouth images from depth camera. 1233-1236 - Yen-Ting Lin, Chen-Yu Chiang:
Deep learning-based speaking rate-dependent hierarchical prosodie model for Mandarin TTS. 1237-1242 - Akira Sasou:
Automatic identification of pathological voice quality based on the GRBAS categorization. 1243-1247 - Kengo Ohta, Rikito Marumoto, Ryota Nishimura, Norihide Kitaoka:
Selecting type of response for chat-like spoken dialogue systems based on acoustic features of user utterances. 1248-1252 - Katsuki Inoue, Sunao Hara, Masanobu Abe, Nobukatsu Hojo, Yusuke Ijima:
An investigation to transplant emotional expressions in DNN-based TTS synthesis. 1253-1258 - Gaku Kotani, Daisuke Saito, Nobuaki Minematsu:
Voice conversion based on deep neural networks for time-variant linear transformations. 1259-1262 - Hiroto Ashikawa, Naohiro Tawara, Atsunori Ogawa, Tomoharu Iwata, Tetsunori Kobayashi, Tetsuji Ogawa:
Exploiting end of sentences and speaker alternations in language modeling for multiparty conversations. 1263-1267 - Yanfeng Lu, Chenyu Yang, Minghui Dong:
Word level prosody prediction using large audiobook dataset. 1268-1273 - Patrick Lumban Tobing, Hirokazu Kameoka, Tomoki Toda:
Deep acoustic-to-articulatory inversion mapping with latent trajectory modeling. 1274-1277 - Hyungjun Lim, Younggwan Kim, Yoonhoe Kim, Hoirin Kim:
CNN-based bottleneck feature for noise robust query-by-example spoken term detection. 1278-1281 - Chun-Ting Huang, Yueru Chen, Ruiyuan Lin, C.-C. Jay Kuo:
Age/gender classification with whole-component convolutional neural networks (WC-CNN). 1282-1285 - Jing Zhang, Yuchao Dai, Fatih Porikli, Mingyi He:
Multi-scale salient object detection with pyramid spatial pooling. 1286-1291 - Zeng Peng, Cheng Cai:
An effective segmentation algorithm of apple watercore disease region using fully convolutional neural networks. 1292-1299 - Pei Chee Yong, Kit Yan Chan, Sven Nordholm:
Utilizing neural network and critical band processing for speech enhancement. 1300-1303 - Conggui Liu, Nakamasa Inoue, Koichi Shinoda:
A unified network for multi-speaker speech recognition with multi-channel recordings. 1304-1307 - Shinnosuke Takamichi:
Modulation spectrum-based speech parameter trajectory smoothing for DNN-based speech synthesis using FFT spectra. 1308-1311 - Takeshi Hori, Kazuyuki Nakamura, Shigeki Sagayama:
Music chord recognition from audio data using bidirectional encoder-decoder LSTMs. 1312-1315 - Keisuke Imoto, Nobutaka Ono, Masahiro Niitsuma, Yoichi Yamashita:
Online sound structure analysis based on generative model of acoustic feature sequences. 1316-1321 - Nancy F. Chen, Boon Pang Lim, Van Hai Do, Van Tung Pham, Chongjia Ni, Haihua Xu, Mark Hasegawa-Johnson, Wenda Chen, Xiong Xiao, Sunil Sivadas, Eng Siong Chng, Bin Ma, Haizhou Li:
Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory. 1322-1327 - Sunao Hara, Asako Hatakeyama, Shota Kobayashi, Masanobu Abe:
Sound sensing using smartphones as a crowdsourcing approach. 1328-1333 - Akira Tamamori, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda:
An investigation of recurrent neural network for daily activity recognition using multi-modal signals. 1334-1340 - Tatsuya Komatsu, Masahiro Tani, Takahiro Toizumi, Chaitanya Narisetty, Masanori Kato, Yumi Arai, Osamu Hoshuyama, Yuzo Senda, Reishi Kondo:
An acoustic monitoring system and its field trials. 1341-1346 - Tin Lay Nwe, Tran Huy Dat, Bin Ma:
Convolutional neural network with multi-task learning scheme for acoustic scene classification. 1347-1350 - Danqing Luo, Yuexian Zou, Dongyan Huang:
Speech emotion recognition via ensembling neural networks. 1351-1355 - Yuanchao Li, Carlos Toshinori Ishi, Nigel G. Ward, Koji Inoue, Shizuka Nakamura, Katsuya Takanashi, Tatsuya Kawahara:
Emotion recognition by combining prosody and sentiment analysis for expressing reactive emotion by humanoid robot. 1356-1359 - Ming Li, Luting Wang, Zhicheng Xu, Danwei Cai:
Mandarin electrolaryngeal voice conversion with combination of Gaussian mixture model and non-negative matrix factorization. 1360-1363 - Rafael E. Banchs:
On the construction of more human-like chatbots: Affect and emotion analysis of movie dialogue data. 1364-1367 - Wan Ding, Dong-Yan Huang, Zhuo Chen, Xinguo Yu, Weisi Lin:
Facial action recognition using very deep networks for highly imbalanced class distribution. 1368-1372 - Felix Albu, Linh Thi Thuc Tran, Sven Nordholm:
A combined variable step size strategy for two microphones acoustic feedback cancellation using proportionate algorithms. 1373-1377 - Feiran Yang, Jun Yang:
A fast affine projection algorithm based on a modified Toeplitz matrix. 1378-1381 - Ryo Takehara, Arata Kawamura, Youji Iiguni:
Impulsive noise suppression using interpolated zero phase signal. 1382-1389 - Hala As'ad, Martin Bouchard, A. Homayoun Kamkar-Parsi:
Binaural beamforming with spatial cues preservation for hearing aids in real-life complex acoustic environments. 1390-1399 - Kiyoshi Nishikawa, Kan Okubo, Yuta Katori, Nobunao Takeuchi:
Application of mean-shift clustering for removing flux trapping noise from geomagnetic field signals measured using HTS-SQUID magnetometers. 1400-1405 - Mohammad Mogharen Askarin, KokSheik Wong, Raphael C.-W. Phan:
Reduced contact lifting of latent fingerprint. 1406-1410 - Kong-Yik Chee, Zhe Jin, Wun-She Yap, Bok-Min Goi:
Two-dimensional winner-takes-all hashing in template protection based on fingerprint and voice feature level fusion. 1411-1419 - Tatsunori Itakura, Toshihisa Tanaka:
Epileptic focus localization based on bivariate empirical mode decomposition and entropy. 1426-1429 - Anand Kumar Mukhopadhyay, Indrajit Chakrabarti, Mrigank Sharad:
Real-time digitized neural-spike storage scheme in multiple channels for biomedical applications. 1430-1435 - Jae Woong Soh, Hyun-Seung Lee, Nam Ik Cho:
An image compression algorithm based on the Karhunen Loève transform. 1436-1439 - Ji-Sang Bae, Jong-Ok Kim:
A rail detection algorithm based on pair particles filtering. 1440-1443 - Eunpil Park, Jae-Young Sim:
Gradient-based contrast enhancement and color correction for underwater images. 1444-1447 - Ji-Eun Lee, Min-Joo Kang, Je-Won Kang:
Ensemble of binary tree structured deep convolutional network for image classification. 1448-1451 - Bee Lim, Kyoung Mu Lee:
Deep recurrent resnet for video super-resolution. 1452-1455 - Chih-Yuan Lo, Yu-Wei Hua, Wei-Chuan Yu, Yu-Min Chuang:
Functional verification and performance testing for OpenAirinterface (OAI) eNodeB. 1456-1459 - Toshiyuki Shizuoka, Osamu Takyu, Mai Ohta, Takeo Fujii:
Multiband hierarchical ad hoc network with wireless environment recognition. 1464-1469 - Wen-Ping Lai, Yong-Hsiang Wang:
On the performance impact of virtual link types to 5G networking. 1470-1474 - Po-Chiang Lin, Sheng-Lun Huang, Xin-Yuan Li:
Teaching and learning next generation mobile communication networks through open source openAirInterface testbeds. 1475-1478 - Hongshen Tang, Rongrong Ni, Yao Zhao, Xiaolong Li:
Detection of various image operations based on CNN. 1479-1485 - Minoru Kuribayashi, Takahiro Ueda, Nobuo Funabiki:
Secure data management system with traceability against internal leakage. 1486-1494 - Ahmad Akmal Aminuddin Mohd Kamal, Keiichi Iwamura, Hyunho Kang:
Searchable encryption of image based on secret sharing scheme. 1495-1503 - Hoang-Quoc Nguyen-Son, Ngoc-Dung T. Tieu, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen:
Identifying computer-generated text using statistical analysis. 1504-1511 - Weiwei Sun, Jiantao Zhou:
Image origin identification for online social networks (OSNs). 1512-1515 - Jongheui Hong, Wonjoon Song:
Delta-modulated cross-correlation method for delay estimation on source localization. 1516-1519 - Kazutaka Kubo, Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An investigation of how to design control parameters for statistical voice timbre control. 1520-1523 - Decha Moungsri, Tomoki Koriyama, Takao Kobayashi:
Enhanced F0 generation for GPR-based speech synthesis considering syllable-based prosodic features. 1524-1527 - Nirmesh J. Shah, Pramod B. Bachhav, Hemant A. Patil:
A novel filtering-based F0 estimation algorithm with an application to voice conversion. 1528-1531 - Ming-Hsiang Su, Chung-Hsien Wu, Kun-Yi Huang, Qian-Bei Hong, Hsin-Min Wang:
Personality trait perception from speech signals using multiresolution analysis and convolutional neural networks. 1532-1536 - Berrak Sisman, Haizhou Li, Kay Chen Tan:
Transformation of prosody in voice conversion. 1537-1546 - Karthika Vijayan, Minghui Dong, Haizhou Li:
A dual alignment scheme for improved speech-to-singing voice conversion. 1547-1555 - Hideki Kawahara, Ken-Ichi Sakakibara, Masanori Morise, Hideki Banno, Tomoki Toda:
Accurate estimation of f0 and aperiodicity based on periodicity detector residuals and deviations of phase derivatives. 1556-1564 - Masato Obara, Munehiro Moriya, Ryota Konno, Kazunori Kojima, Kazuyo Tanaka, Shi-wook Lee, Yoshiaki Itoh:
Acceleration for query-by-example using posteriorgram of deep neural network. 1565-1569 - Zhi Hao Lim, Xiaohai Tian, Wei Rao, Eng Siong Chng:
An investigation of spectral feature partitioning for replay attacks detection. 1570-1573 - Hanwu Sun, Kong-Aik Lee, Trung Hieu Nguyen, Bin Ma, Haizhou Li:
I2R-NUS submission to oriental language recognition AP16-OL7 challenge. 1574-1578 - Yue Chen, Yanlu Xie, Jinsong Zhang:
A comparison study of information contributions of phonemic contrasts in Mandarin. 1579-1582 - Aodong Li, Shiyue Zhang, Dong Wang, Thomas Fang Zheng:
Enhanced neural machine translation by learning from draft. 1583-1587 - Ryo Masumura, Taichi Asami, Hirokazu Masataki, Yushi Aono:
Joint unsupervised adaptation of n-gram and RNN language models via LDA-based hybrid mixture modeling. 1588-1591 - Ryoichi Takashima, Yohei Kawaguchi, Qinghua Sun, Takashi Sumiyoshi, Masahito Togami:
An application of noise-robust speech translation using asynchronous smart devices. 1592-1595 - Zhiping Zeng, Haihua Xu, Tze Yuang Chong, Eng Siong Chng, Haizhou Li:
Improving N-gram language modeling for code-switching speech recognition. 1596-1601 - Jia Yu, Xiong Xiao, Lei Xie, Eng Siong Chng:
Topic embedding of sentences for story segmentation. 1602-1607 - Xing Wei, Jingping Chen, Wei Wang, Yanlu Xie, Jinsong Zhang:
A study of automatic annotation of PETs with articulatory features. 1608-1612 - Shumin An, Zhenhua Ling, Lirong Dai:
Emotional statistical parametric speech synthesis using LSTM-RNNs. 1613-1616 - Shogo Hara, Hiromitsu Nishizaki:
Acoustic modeling with a shared phoneme set for multilingual speech recognition without code-switching. 1617-1620 - Narumi Mae, Yoshiki Mitsui, Shoji Makino, Daichi Kitamura, Nobutaka Ono, Takeshi Yamada, Hiroshi Saruwatari:
Sound source localization using binaural difference for hose-shaped rescue robot. 1621-1627 - Xiaowei Jiang, Shuai Wang, Xu Xiang, Yanmin Qian:
Integrating online i-vector into GMM-UBM for text-dependent speaker verification. 1628-1632 - Linh Thi Thuc Tran, Henning F. Schepker, Simon Doclo, Hai Huyen Dam, Sven E. Nordholm:
Adaptive feedback control using improved variable step-size affine projection algorithm for hearing aids. 1633-1640 - Effrosyni Paschou, Fabian Esqueda, Vesa Välimäki, John Mourjopoulos:
Modeling and measuring a Moog voltage-controlled filter. 1641-1647 - Kun-Yi Huang, Chung-Hsien Wu, Ming-Hsiang Su, Chia-Hui Chou:
Mood disorder identification using deep bottleneck features of elicited speech. 1648-1652 - Siying Liu, Karianto Leman:
Handling small motions without differential approximation. 1653-1656 - Ramanpreet Singh Pahwa, Tian-Tsong Ng, Minh N. Do:
Tracking objects using 3D object proposals. 1657-1660 - Sameer Khan, Suet-Peng Yong:
A deep learning architecture for classifying medical images of anatomy object. 1661-1668 - Chern Hong Lim, Kam Meng Goh:
Fuzzy qualitative approach for micro-expression recognition. 1669-1674 - Tai-En Wu, Chia-Chi Tsai, Jiun-In Guo:
LiDAR/camera sensor fusion technology for pedestrian detection. 1675-1678 - Jundai Sun, Mao-shen Jia, Changchun Bao:
Multiple source localization by using energy weighted single source zone detection. 1679-1683 - Shahab Pasha, Jacob Donley, Christian H. Ritz:
Blind speaker counting in highly reverberant environments by clustering coherence features. 1684-1687 - Yuexian Zou, Rongzhi Gu, Disong Wang, Aimin Jiang, Christian H. Ritz:
Learning a robust DOA estimation model with acoustic vector sensor cues. 1688-1691 - Zhong-Hua Fu, Lei Xie, Peng Li, Jiaen Liang:
Frequency-invariant differential microphone array design in the STFT domain. 1692-1695 - Shahab Pasha, Christian H. Ritz, Yue Xian Zou:
Spatial multi-channel linear prediction for dereverberation of ad-hoc microphones. 1696-1700 - Suguru Hirokawa, Shin Kurihara, Hisakazu Kikuchi:
Distributed video coding based on compressive sensing and intra-predictive coding. 1701-1706 - Savath Saypadith, Watchara Ruangsang, Supavadee Aramvith:
Optimized human detection on the embedded computer vision system. 1707-1711 - Yanlong Gao, Yan Feng:
Classification of spectral compressive hyperspectral images using morphological profiles. 1712-1718 - Yongfei Zhang, Rui Fan, Chao Zhang, Gang Wang, Zhe Li:
SIMD acceleration for HEVC encoding on DSP. 1719-1725 - Seishi Takamura, Atsushi Shimizu:
Efficient video coding using rigid object tracking. 1726-1729 - Soo Hyun Bae, In Kyu Choi, Hyung Yong Kim, Kang Hyun Lee, Nam Soo Kim:
Overlapping acoustic event classification based on joint training with source separation. 1730-1734 - In Kyu Choi, Soo Hyun Bae, Sung Jun Cheon, Won-Ik Cho, Nam Soo Kim:
Weakly labeled acoustic event detection using local detector and global classifier. 1735-1738 - Gen Takahashi, Takeshi Yamada, Nobutaka Ono, Shoji Makino:
Performance evaluation of acoustic scene classification using DNN-GMM and frame-concatenated acoustic features. 1739-1743 - Nattapong Kurpukdee, Tomoki Koriyama, Takao Kobayashi, Sawit Kasuriya, Chai Wutiwiwatchai, Poonlap Lamsrichan:
Speech emotion recognition using convolutional long short-term memory neural network and support vector machines. 1744-1749 - Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi:
Speech emotion recognition using multichannel parallel convolutional recurrent neural networks based on gammatone auditory filterbank. 1750-1755 - Samer Jammal, Tammam Tillo, Jimin Xiao:
Multi-resolution for disparity estimation with convolutional neural networks. 1756-1761 - Han-Ul Kim, Chang-Su Kim:
PGT: Proposal-guided object tracking. 1762-1767 - Vien Gia An, Chul Lee:
Single-shot high dynamic range imaging via deep convolutional neural network. 1768-1772 - Eu-Tteum Baek, Yo-Sung Ho:
Stereo matching using relative total variation and entropy. 1773-1776 - Ying Gu, Mark D. Rice, Wei Xiong, Liyuan Li:
A new approach for image segmentation with shape priors based on the Potts model. 1777-1782 - Ryo Hayakawa, Kazunori Hayashi:
Binary vector reconstruction via discreteness-aware approximate message passing. 1783-1789 - Kotaro Kihara, Toshihiko Nishimura, Takeo Ohgane, Yasutaka Ogawa:
Signal detection with belief propagation in Faster-than-Nyquist signaling. 1790-1794 - Akihide David Shigyo, Koji Ishibashi:
QR-decomposed generalized belief propagation with smart message reduction for low-complexity MIMO signal detection. 1795-1799 - Takumi Takahashi, Shinsuke Ibi, Seiichi Sampei:
Design of adaptively scaled belief in large MIMO detection for higher-order modulation. 1800-1505 - Shunsuke Imai, Osamu Takyu, Fumihito Sasamori, Shiro Handa:
A study of monitoring system for radio leak with massive radio sensors. 1806-1810 - Kosuke Shimizu, Taizo Suzuki:
Cube-based encryption connected prior to motion JPEG standard. 1811-1814 - Kazuya Kawai, Junya Yamada, Hidekata Hontani, Tatsuya Yokota, Muneyuki Sakata, Yuichi Kimura:
A robust PET image reconstruction using constrained non-negative matrix factorization. 1815-1818 - Fairoza Amira Binti Hamzah, Taichi Yoshida, Masahiro Iwahashi:
Four-dimensional image compression with region of interest based on non-separable double lifting integer wavelet transform. 1819-1823 - Satoshi Nagayama, Shogo Muramatsu, Hiroyoshi Yamada, Yuuichi Sugiyama:
Millimeter wave radar image denoising with complex nonseparable oversampled lapped transform. 1824-1829 - Yusuke Nomura, Ryutaro Ogawa, Seisuke Kyochi, Taizo Suzuki:
Multiscale directional transforms based on cosine-sine modulated filter banks for sparse directional image representation. 1830-1834
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.