default search action
Wenwu Wang 0001
Person information
- affiliation: University of Surrey, Guildford, UK
Other persons with the same name
- Wenwu Wang 0002 — Qufu Normal University, Qufu, Shandong, China
- Wenwu Wang 0003 — Xidian University, Xi'an, China
- Wenwu Wang 0004 — Wuhan University, Wuhan, China
- Wenwu Wang 0005 — Harbin Institute of Technology, Harbin, China
- Wenwu Wang 0006 — Chinese Academy of Sciences, Institute of Microelectronics, Beijing, China
- Wenwu Wang 0007 — Sichuan University, School of Mechanical Engineering, Chengdu, China
- Wenwu Wang 0008 — Wuhan University of Science and Technology, School of Information Science and Engineering, China
- Wenwu Wang 0009 — Chinese Academy of Sciences, Institute of Microelectronics, Beijing, China
- Wenwu Wang 0010 — Guangxi University of Science and Technology, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j117]Wujiang Zhu, Xinyuan Zhou, Shiyong Lan, Wenwu Wang, Zhiang Hou, Yao Ren, Tianyi Pan:
A dual branch graph neural network based spatial interpolation method for traffic data inference in unobserved locations. Inf. Fusion 114: 102703 (2025) - 2024
- [j116]Kanghao Li, Shuguo Yang, Li Zhao, Wenwu Wang:
Weakly labeled sound event detection with a capsule-transformer model. Digit. Signal Process. 146: 104347 (2024) - [j115]Zijin Li, Wenwu Wang, Kejun Zhang, Mengyao Zhu:
Guest editorial: AI for computational audition - sound and music processing. EURASIP J. Audio Speech Music. Process. 2024(1): 44 (2024) - [j114]Wei Ma, Yao Li, Shiyong Lan, Wenwu Wang, Weikang Huang, Wujiang Zhu:
Semantic-aware normalizing flow with feature fusion for image anomaly detection. Neurocomputing 590: 127728 (2024) - [j113]Yiming Zhang, Ruoyi Du, Zheng-Hua Tan, Wenwu Wang, Zhanyu Ma:
Generating Accurate and Diverse Audio Captions Through Variational Autoencoder Framework. IEEE Signal Process. Lett. 31: 2520-2524 (2024) - [j112]Yuanbo Hou, Bo Kang, Andrew Mitchell, Wenwu Wang, Jian Kang, Dick Botteldooren:
Cooperative Scene-Event Modelling for Acoustic Scene Classification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 68-82 (2024) - [j111]Haohe Liu, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Qiao Tian, Yuping Wang, Wenwu Wang, Yuxuan Wang, Mark D. Plumbley:
AudioLDM 2: Learning Holistic Audio Generation With Self-Supervised Pretraining. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2871-2883 (2024) - [j110]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Towards Generating Diverse Audio Captions via Adversarial Training. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3311-3323 (2024) - [j109]Xinhao Mei, Chutong Meng, Haohe Liu, Qiuqiang Kong, Tom Ko, Chengqi Zhao, Mark D. Plumbley, Yuexian Zou, Wenwu Wang:
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3339-3354 (2024) - [j108]Sara Atito Ali Ahmed, Muhammad Awais, Wenwu Wang, Mark D. Plumbley, Josef Kittler:
ASiT: Local-Global Audio Spectrogram Vision Transformer for Event Classification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3684-3693 (2024) - [j107]Shengxi Li, Xuelong Li, Leonardo Chiariglione, Jiebo Luo, Wenwu Wang, Zhengyuan Yang, Danilo P. Mandic, Hamido Fujita:
Introduction to the Special Issue on AI-Generated Content for Multimedia. IEEE Trans. Circuits Syst. Video Technol. 34(8): 6809-6813 (2024) - [j106]Fatemeh Nazarieh, Zhenhua Feng, Muhammad Awais, Wenwu Wang, Josef Kittler:
A Survey of Cross-Modal Visual Content Generation. IEEE Trans. Circuits Syst. Video Technol. 34(8): 6814-6832 (2024) - [j105]Feng Zhan, Wenwu Wang, Qian Chen, Yina Guo, Lidan He, Lili Wang:
Three-Direction Fusion for Accurate Volumetric Liver and Tumor Segmentation. IEEE J. Biomed. Health Informatics 28(4): 2175-2186 (2024) - [j104]Yang Liu, Yong Xu, Peipei Wu, Wenwu Wang:
Labelled Non-Zero Diffusion Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking. IEEE Trans. Multim. 26: 2544-2559 (2024) - [j103]Zhaogeng Liu, Jielong Yang, Xionghu Zhong, Wenwu Wang, Hechang Chen, Yi Chang:
A Novel Composite Graph Neural Network. IEEE Trans. Neural Networks Learn. Syst. 35(10): 13411-13425 (2024) - [c209]Haohe Liu, Xubo Liu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Learning Temporal Resolution in Spectrogram for Audio Classification. AAAI 2024: 13873-13881 - [c208]Qiushi Huang, Xubo Liu, Tom Ko, Bo Wu, Wenwu Wang, Yu Zhang, Lilian Tang:
Selective Prompting Tuning for Personalized Conversations with LLMs. ACL (Findings) 2024: 16212-16226 - [c207]Junqi Zhao, Xubo Liu, Jinzheng Zhao, Yi Yuan, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
Universal Sound Separation with Self-Supervised Audio Masked Autoencoder. EUSIPCO 2024: 1-5 - [c206]Jinzheng Zhao, Xinyuan Qian, Yong Xu, Haohe Liu, Yin Cao, Davide Berghi, Wenwu Wang:
Text-Queried Target Sound Event Localization. EUSIPCO 2024: 261-265 - [c205]John-Joseph Brady, Yuhui Luo, Wenwu Wang, Víctor Elvira, Yunpeng Li:
Regime Learning for Differentiable Particle Filters. FUSION 2024: 1-6 - [c204]Yi Yuan, Haohe Liu, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Retrieval-Augmented Text-to-Audio Generation. ICASSP 2024: 581-585 - [c203]Yuanbo Hou, Qiaoqiao Ren, Siyang Song, Yuxin Song, Wenwu Wang, Dick Botteldooren:
Multi-Level Graph Learning For Audio Event Classification And Human-Perceived Annoyance Rating Prediction. ICASSP 2024: 716-720 - [c202]Haohe Liu, Ke Chen, Qiao Tian, Wenwu Wang, Mark D. Plumbley:
Audiosr: Versatile Audio Super-Resolution at Scale. ICASSP 2024: 1076-1080 - [c201]Hejing Zhang, Qiaoxi Zhu, Jian Guan, Haohe Liu, Feiyang Xiao, Jiantong Tian, Xinhao Mei, Xubo Liu, Wenwu Wang:
First-Shot Unsupervised Anomalous Sound Detection with Unknown Anomalies Estimated by Metadata-Assisted Audio Generation. ICASSP 2024: 1271-1275 - [c200]Haiyan Lan, Qiaoxi Zhu, Jian Guan, Yuming Wei, Wenwu Wang:
Hierarchical Metadata Information Constrained Self-Supervised Learning for Anomalous Sound Detection under Domain Shift. ICASSP 2024: 7670-7674 - [c199]Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang:
CM-PIE: Cross-Modal Perception for Interactive-Enhanced Audio-Visual Video Parsing. ICASSP 2024: 8421-8425 - [c198]Kunkun SongGong, Pufen Zhang, Xiongwei Zhang, Meng Sun, Wenwu Wang:
Multi-Speaker Localization in the Circular Harmonic Domain on Small Aperture Microphone Arrays Using Deep Convolutional Networks. ICASSP 2024: 8586-8590 - [c197]Davide Berghi, Peipei Wu, Jinzheng Zhao, Wenwu Wang, Philip J. B. Jackson:
Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection. ICASSP 2024: 8816-8820 - [c196]Xuenan Xu, Arshdeep Singh, Mengyue Wu, Wenwu Wang, Mark D. Plumbley:
Investigating Passive Filter Pruning for Efficient CNN-Transformer Audio Captioning. MLSP 2024: 1-6 - [c195]Yi Yuan, Zhuo Chen, Xubo Liu, Haohe Liu, Xuenan Xu, Dongya Jia, Yuanzhe Chen, Mark D. Plumbley, Wenwu Wang:
T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining. MLSP 2024: 1-6 - [c194]Seyed Ahmad Soleymani, Mohammad Shojafar, Chaun Heng Foh, Shidrokh Goudarzi, Wenwu Wang:
Secure Target-Tracking by UAVs in O-RAN Environment. IFIP Networking 2024: 204-212 - [i115]Yi Yuan, Zhuo Chen, Xubo Liu, Haohe Liu, Xuenan Xu, Dongya Jia, Yuanzhe Chen, Mark D. Plumbley, Wenwu Wang:
T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining. CoRR abs/2404.17806 (2024) - [i114]Qixin Deng, Qikai Yang, Ruibin Yuan, Yipeng Huang, Yi Wang, Xubo Liu, Zeyue Tian, Jiahao Pan, Ge Zhang, Hanfeng Lin, Yizhi Li, Yinghao Ma, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenwu Wang, Guangyu Xia, Wei Xue, Yike Guo:
ComposerX: Multi-Agent Symbolic Music Composition with LLMs. CoRR abs/2404.18081 (2024) - [i113]Haohe Liu, Xuenan Xu, Yi Yuan, Mengyue Wu, Wenwu Wang, Mark D. Plumbley:
SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound. CoRR abs/2405.00233 (2024) - [i112]John-Joseph Brady, Yuhui Luo, Wenwu Wang, Victor Elvira, Yunpeng Li:
Regime Learning for Differentiable Particle Filters. CoRR abs/2405.04865 (2024) - [i111]Yuanbo Hou, Qiaoqiao Ren, Andrew Mitchell, Wenwu Wang, Jian Kang, Tony Belpaeme, Dick Botteldooren:
Soundscape Captioning using Sound Affective Quality Network and Large Language Model. CoRR abs/2406.05914 (2024) - [i110]Yiming Zhang, Xuenan Xu, Ruoyi Du, Haohe Liu, Yuan Dong, Zheng-Hua Tan, Wenwu Wang, Zhanyu Ma:
Zero-Shot Audio Captioning Using Soft and Hard Prompts. CoRR abs/2406.06295 (2024) - [i109]Meng Cui, Xubo Liu, Haohe Liu, Jinzheng Zhao, Daoliang Li, Wenwu Wang:
Fish Tracking, Counting, and Behaviour Analysis in Digital Aquaculture: A Comprehensive Review. CoRR abs/2406.17800 (2024) - [i108]Qiushi Huang, Xubo Liu, Tom Ko, Bo Wu, Wenwu Wang, Yu Zhang, Lilian Tang:
Selective Prompting Tuning for Personalized Conversations with LLMs. CoRR abs/2406.18187 (2024) - [i107]Qiushi Huang, Shuai Fu, Xubo Liu, Wenwu Wang, Tom Ko, Yu Zhang, Lilian Tang:
Learning Retrieval Augmentation for Personalized Dialogue Generation. CoRR abs/2406.18847 (2024) - [i106]Yi Yuan, Dongya Jia, Xiaobin Zhuang, Yuanzhe Chen, Zhengxi Liu, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Improving Audio Generation with Visual Enhanced Caption. CoRR abs/2407.04416 (2024) - [i105]Feiyang Xiao, Jian Guan, Qiaoxi Zhu, Xubo Liu, Wenbo Wang, Shuhan Qi, Kejia Zhang, Jianyuan Sun, Wenwu Wang:
A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining. CoRR abs/2407.04936 (2024) - [i104]Junqi Zhao, Xubo Liu, Jinzheng Zhao, Yi Yuan, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
Universal Sound Separation with Self-Supervised Audio Masked Autoencoder. CoRR abs/2407.11745 (2024) - [i103]Xuenan Xu, Haohe Liu, Mengyue Wu, Wenwu Wang, Mark D. Plumbley:
Efficient Audio Captioning with Encoder-Level Knowledge Distillation. CoRR abs/2407.14329 (2024) - [i102]Yi Yuan, Xubo Liu, Haohe Liu, Mark D. Plumbley, Wenwu Wang:
FlowSep: Language-Queried Sound Separation with Rectified Flow Matching. CoRR abs/2409.07614 (2024) - [i101]John-Joseph Brady, Yuhui Luo, Wenwu Wang, Víctor Elvira, Yunpeng Li:
Differentiable Interacting Multiple Model Particle Filtering. CoRR abs/2410.00620 (2024) - 2023
- [j102]Mukunthan Tharmakulasingam, Wenwu Wang, Michael Kerby, Roberto La Ragione, Anil Fernando:
TransAMR: An Interpretable Transformer Model for Accurate Prediction of Antimicrobial Resistance Using Antibiotic Administration Data. IEEE Access 11: 75337-75350 (2023) - [j101]Jian Guan, Youde Liu, Qiuqiang Kong, Feiyang Xiao, Qiaoxi Zhu, Jiantong Tian, Wenwu Wang:
Transformer-based autoencoder with ID constraint for unsupervised anomalous sound detection. EURASIP J. Audio Speech Music. Process. 2023(1): 42 (2023) - [j100]Yina Guo, Ting Liu, Xiaofei Zhang, Anhong Wang, Wenwu Wang:
End-to-end translation of human neural activity to speech with a dual-dual generative adversarial network. Knowl. Based Syst. 277: 110837 (2023) - [j99]Jing Dong, Kai Wu, Chang Liu, Xue Mei, Wenwu Wang:
Discriminative analysis dictionary learning with adaptively ordinal locality preserving. Neural Networks 165: 298-309 (2023) - [j98]Liming Shi, Xinheng Wang, Limin Yu, Wenwu Wang, Zhi Wang, Muddesar Iqbal, Charalampos C. Tsimenidis, Shahid Mumtaz:
A long-range aerial acoustic communication scheme. Phys. Commun. 60: 102135 (2023) - [j97]Feiyang Xiao, Jian Guan, Qiaoxi Zhu, Wenwu Wang:
Graph Attention for Automated Audio Captioning. IEEE Signal Process. Lett. 30: 413-417 (2023) - [j96]Yuanbo Hou, Siyang Song, Chuang Yu, Wenwu Wang, Dick Botteldooren:
Audio Event-Relational Graph Representation Learning for Acoustic Scene Classification. IEEE Signal Process. Lett. 30: 1382-1386 (2023) - [j95]Shidrokh Goudarzi, Seyed Ahmad Soleymani, Wenwu Wang, Pei Xiao:
UAV-Enabled Mobile Edge Computing for Resource Allocation Using Cooperative Evolutionary Computation. IEEE Trans. Aerosp. Electron. Syst. 59(5): 5134-5147 (2023) - [j94]Yi Li, Yang Sun, Wenwu Wang, Syed Mohsen Naqvi:
U-Shaped Transformer With Frequency-Band Aware Attention for Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1511-1521 (2023) - [j93]Yiming Zhang, Hong Yu, Ruoyi Du, Zheng-Hua Tan, Wenwu Wang, Zhanyu Ma, Yuan Dong:
ACTUAL: Audio Captioning With Caption Feature Space Regularization. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2643-2657 (2023) - [j92]Weitao Yuan, Shengbei Wang, Jianming Wang, Masashi Unoki, Wenwu Wang:
Unsupervised Deep Unfolded Representation Learning for Singing Voice Separation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3206-3220 (2023) - [j91]Cheng Xue, Xionghu Zhong, Minjie Cai, Hao Chen, Wenwu Wang:
Audio-Visual Event Localization by Learning Spatial and Semantic Co-Attention. IEEE Trans. Multim. 25: 418-429 (2023) - [c193]Qiushi Huang, Yu Zhang, Tom Ko, Xubo Liu, Bo Wu, Wenwu Wang, H. Lilian Tang:
Personalized Dialogue Generation with Persona-Adaptive Attention. AAAI 2023: 12916-12923 - [c192]Qiushi Huang, Shuai Fu, Xubo Liu, Wenwu Wang, Tom Ko, Yu Zhang, Lilian Tang:
Learning Retrieval Augmentation for Personalized Dialogue Generation. EMNLP 2023: 2523-2540 - [c191]Özkan Çayli, Xubo Liu, Volkan Kiliç, Wenwu Wang:
Knowledge Distillation for Efficient Audio-Visual Video Captioning. EUSIPCO 2023: 745-749 - [c190]Feiyang Xiao, Qiaoxi Zhu, Jian Guan, Wenwu Wang:
Enhancing Audio Retrieval with Attention-based Encoder for Audio Feature Representation. EUSIPCO 2023: 755-759 - [c189]Yi Yuan, Haohe Liu, Jinhua Liang, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Leveraging Pre-Trained AudioLDM for Sound Generation: A Benchmark Study. EUSIPCO 2023: 765-769 - [c188]Bowei Pu, Shiyong Lan, Wenwu Wang, Caiying Yang, Wei Pan, Hongyu Yang, Wei Ma:
GanNeXt: A New Convolutional GAN for Anomaly Detection. ICANN (3) 2023: 39-49 - [c187]Xinyuan Zhou, Shiyong Lan, Wenwu Wang, Xinyang Li, Siyuan Zhou, Hongyu Yang:
Visual-Haptic-Kinesthetic Object Recognition with Multimodal Transformer. ICANN (7) 2023: 233-245 - [c186]Piaoyang Li, Shiyong Lan, Shipeng Sun, Wenwu Wang, Yongyang Gao, Yongyu Yang, Guangyu Yu:
Siamese Network Based on MLP and Multi-head Cross Attention for Visual Object Tracking. ICANN (10) 2023: 420-431 - [c185]Jian Guan, Youde Liu, Qiaoxi Zhu, Tieran Zheng, Jiqing Han, Wenwu Wang:
Time-Weighted Frequency Domain Audio Representation with GMM Estimator for Anomalous Sound Detection. ICASSP 2023: 1-5 - [c184]Jian Guan, Feiyang Xiao, Youde Liu, Qiaoxi Zhu, Wenwu Wang:
Anomalous Sound Detection Using Audio Representation with Machine ID Based Contrastive Learning Pretraining. ICASSP 2023: 1-5 - [c183]Yuanbo Hou, Yun Wang, Wenwu Wang, Dick Botteldooren:
Gct: Gated Contextual Transformer for Sequential Audio Tagging. ICASSP 2023: 1-5 - [c182]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Mark D. Plumbley, Wenwu Wang:
Simple Pooling Front-Ends for Efficient Audio Classification. ICASSP 2023: 1-5 - [c181]Weitao Yuan, Yuren Bian, Shengbei Wang, Masashi Unoki, Wenwu Wang:
An Improved Optimal Transport Kernel Embedding Method with Gating Mechanism for Singing Voice Separation and Speaker Identification. ICASSP 2023: 1-5 - [c180]Xiaoxiao Yin, Shiyong Lan, Weikang Huang, Yitong Ma, Wenwu Wang, Hongyu Yang, Yilin Zheng:
DLAHSD: Dynamic Label Adopted In Auxiliary Head for SAR Detection. ICIP 2023: 3434-3438 - [c179]Wei Ma, Shiyong Lan, Weikang Huang, Wenwu Wang, Hongyu Yang, Yitong Ma, Yongjie Ma:
A Semantics-Aware Normalizing Flow Model for Anomaly Detection. ICME 2023: 2207-2212 - [c178]Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo P. Mandic, Wenwu Wang, Mark D. Plumbley:
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models. ICML 2023: 21450-21474 - [c177]Jinhua Liang, Xubo Liu, Haohe Liu, Huy Phan, Emmanouil Benetos, Mark D. Plumbley, Wenwu Wang:
Adapting Language-Audio Models as Few-Shot Audio Learners. INTERSPEECH 2023: 276-280 - [c176]Yuanbo Hou, Siyang Song, Cheng Luo, Andrew Mitchell, Qiaoqiao Ren, Weicheng Xie, Jian Kang, Wenwu Wang, Dick Botteldooren:
Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning. INTERSPEECH 2023: 331-335 - [c175]Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, H. Lilian Tang, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention. INTERSPEECH 2023: 2838-2842 - [c174]Haohe Liu, Qiuqiang Kong, Xubo Liu, Xinhao Mei, Wenwu Wang, Mark D. Plumbley:
Ontology-aware Learning and Evaluation for Audio Tagging. INTERSPEECH 2023: 3799-3803 - [c173]Jianyuan Sun, Xubo Liu, Xinhao Mei, Volkan Kiliç, Mark D. Plumbley, Wenwu Wang:
Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning. INTERSPEECH 2023: 4164-4168 - [c172]Wenhan Li, Xiongjie Chen, Wenwu Wang, Víctor Elvira, Yunpeng Li:
Differentiable Bootstrap Particle Filters for Regime-Switching Models. SSP 2023: 200-204 - [i100]Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo P. Mandic, Wenwu Wang, Mark D. Plumbley:
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models. CoRR abs/2301.12503 (2023) - [i99]Wenhan Li, Xiongjie Chen, Wenwu Wang, Víctor Elvira, Yunpeng Li:
Differentiable Bootstrap Particle Filters for Regime-Switching Models. CoRR abs/2302.10319 (2023) - [i98]Yi Yuan, Haohe Liu, Jinhua Liang, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Leveraging Pre-trained AudioLDM for Text to Sound Generation: A Benchmark Study. CoRR abs/2303.03857 (2023) - [i97]Xinhao Mei, Chutong Meng, Haohe Liu, Qiuqiang Kong, Tom Ko, Chengqi Zhao, Mark D. Plumbley, Yuexian Zou, Wenwu Wang:
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research. CoRR abs/2303.17395 (2023) - [i96]Feiyang Xiao, Jian Guan, Qiaoxi Zhu, Wenwu Wang:
Graph Attention for Automated Audio Captioning. CoRR abs/2304.03586 (2023) - [i95]Jian Guan, Feiyang Xiao, Youde Liu, Qiaoxi Zhu, Wenwu Wang:
Anomalous Sound Detection using Audio Representation with Machine ID based Contrastive Learning Pretraining. CoRR abs/2304.03588 (2023) - [i94]Jian Guan, Youde Liu, Qiaoxi Zhu, Tieran Zheng, Jiqing Han, Wenwu Wang:
Time-weighted Frequency Domain Audio Representation with GMM Estimator for Anomalous Sound Detection. CoRR abs/2305.03328 (2023) - [i93]Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Mark D. Plumbley, Wenwu Wang:
Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7. CoRR abs/2305.15905 (2023) - [i92]Jinhua Liang, Xubo Liu, Haohe Liu, Huy Phan, Emmanouil Benetos, Mark D. Plumbley, Wenwu Wang:
Adapting Language-Audio Models as Few-Shot Audio Learners. CoRR abs/2305.17719 (2023) - [i91]Jianyuan Sun, Xubo Liu, Xinhao Mei, Volkan Kiliç, Mark D. Plumbley, Wenwu Wang:
Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning. CoRR abs/2305.18753 (2023) - [i90]Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Peipei Wu, Mark D. Plumbley, Wenwu Wang:
Text-Driven Foley Sound Generation With Latent Diffusion Model. CoRR abs/2306.10359 (2023) - [i89]Xubo Liu, Zhongkai Zhu, Haohe Liu, Yi Yuan, Meng Cui, Qiushi Huang, Jinhua Liang, Yin Cao, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
WavJourney: Compositional Audio Creation with Large Language Models. CoRR abs/2307.14335 (2023) - [i88]Xubo Liu, Qiuqiang Kong, Yan Zhao, Haohe Liu, Yi Yuan, Yuzhuo Liu, Rui Xia, Yuxuan Wang, Mark D. Plumbley, Wenwu Wang:
Separate Anything You Describe. CoRR abs/2308.05037 (2023) - [i87]Haohe Liu, Qiao Tian, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Yuping Wang, Wenwu Wang, Yuxuan Wang, Mark D. Plumbley:
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining. CoRR abs/2308.05734 (2023) - [i86]Jinbo Hu, Yin Cao, Ming Wu, Feiran Yang, Ziying Yu, Wenwu Wang, Mark D. Plumbley, Jun Yang:
META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection. CoRR abs/2308.08847 (2023) - [i85]Yuanbo Hou, Siyang Song, Cheng Luo, Andrew Mitchell, Qiaoqiao Ren, Weicheng Xie, Jian Kang, Wenwu Wang, Dick Botteldooren:
Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning. CoRR abs/2308.11980 (2023) - [i84]Siddique Latif, Moazzam Shoukat, Fahad Shamshad, Muhammad Usama, Yi Ren, Heriberto Cuayáhuitl, Wenwu Wang, Xulong Zhang, Roberto Togneri, Erik Cambria, Björn W. Schuller:
Sparks of Large Audio Models: A Survey and Outlook. CoRR abs/2308.12792 (2023) - [i83]Meng Cui, Xubo Liu, Haohe Liu, Zhuangzhuang Du, Tao Chen, Guoping Lian, Daoliang Li, Wenwu Wang:
Multimodal Fish Feeding Intensity Assessment in Aquaculture. CoRR abs/2309.05058 (2023) - [i82]Haohe Liu, Ke Chen, Qiao Tian, Wenwu Wang, Mark D. Plumbley:
AudioSR: Versatile Audio Super-resolution at Scale. CoRR abs/2309.07314 (2023) - [i81]Haiyan Lan, Qiaoxi Zhu, Jian Guan, Yuming Wei, Wenwu Wang:
Hierarchical Metadata Information Constrained Self-Supervised Learning for Anomalous Sound Detection Under Domain Shift. CoRR abs/2309.07498 (2023) - [i80]Yi Yuan, Haohe Liu, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Retrieval-Augmented Text-to-Audio Generation. CoRR abs/2309.08051 (2023) - [i79]Feiyang Xiao, Qiaoxi Zhu, Jian Guan, Xubo Liu, Haohe Liu, Kejia Zhang, Wenwu Wang:
Synth-AC: Enhancing Audio Captioning with Synthetic Supervision. CoRR abs/2309.09705 (2023) - [i78]Jinzheng Zhao, Yong Xu, Xinyuan Qian, Wenwu Wang:
Audio Visual Speaker Localization from EgoCentric Views. CoRR abs/2309.16308 (2023) - [i77]Yuanbo Hou, Siyang Song, Chuang Yu, Wenwu Wang, Dick Botteldooren:
Audio Event-Relational Graph Representation Learning for Acoustic Scene Classification. CoRR abs/2310.03889 (2023) - [i76]Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang:
CM-PIE: Cross-modal perception for interactive-enhanced audio-visual video parsing. CoRR abs/2310.07517 (2023) - [i75]Jian Guan, Youde Liu, Qiuqiang Kong, Feiyang Xiao, Qiaoxi Zhu, Jiantong Tian, Wenwu Wang:
Transformer-based Autoencoder with ID Constraint for Unsupervised Anomalous Sound Detection. CoRR abs/2310.08950 (2023) - [i74]Hejing Zhang, Qiaoxi Zhu, Jian Guan, Haohe Liu, Feiyang Xiao, Jiantong Tian, Xinhao Mei, Xubo Liu, Wenwu Wang:
First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation. CoRR abs/2310.14173 (2023) - [i73]Jinzheng Zhao, Yong Xu, Xinyuan Qian, Davide Berghi, Peipei Wu, Meng Cui, Jianyuan Sun, Philip J. B. Jackson, Wenwu Wang:
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions. CoRR abs/2310.14778 (2023) - [i72]Davide Berghi, Peipei Wu, Jinzheng Zhao, Wenwu Wang, Philip J. B. Jackson:
Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection. CoRR abs/2312.09034 (2023) - [i71]Yuanbo Hou, Qiaoqiao Ren, Siyang Song, Yuxin Song, Wenwu Wang, Dick Botteldooren:
Multi-level graph learning for audio event classification and human-perceived annoyance rating prediction. CoRR abs/2312.09952 (2023) - 2022
- [j90]Ting Liu, Wenwu Wang, Xiaofei Zhang, Yina Guo:
One to multiple mapping dual learning: Learning multiple signals from one mixture. Digit. Signal Process. 129: 103686 (2022) - [j89]Haitao Li, Shuguo Yang, Wenwu Wang:
Improved capsule routing for weakly labeled sound event detection. EURASIP J. Audio Speech Music. Process. 2022(1): 5 (2022) - [j88]Xinhao Mei, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Automated audio captioning: an overview of recent progress and new challenges. EURASIP J. Audio Speech Music. Process. 2022(1): 26 (2022) - [j87]Jian Guan, Jiabei Liu, Pengming Feng, Wenwu Wang:
Multiscale Deep Neural Network With Two-Stage Loss for SAR Target Recognition With Small Training Set. IEEE Geosci. Remote. Sens. Lett. 19: 1-5 (2022) - [j86]Jing Dong, Liu Yang, Chang Liu, Wei Cheng, Wenwu Wang:
Support vector machine embedding discriminative dictionary pair learning for pattern classification. Neural Networks 155: 498-511 (2022) - [j85]Lin Dong, Jifeng Qi, Baoshu Yin, Hai Zhi, Delei Li, Shuguo Yang, Wenwu Wang, Hong Cai, Bowen Xie:
Reconstruction of Subsurface Salinity Structure in the South China Sea Using Satellite Observations: A LightGBM-Based Deep Forest Method. Remote. Sens. 14(14): 3494 (2022) - [j84]Arash Shilandari, Hossein Marvi, Hossein Khosravi, Wenwu Wang:
Speech emotion recognition using data augmentation method by cycle-generative adversarial networks. Signal Image Video Process. 16(7): 1955-1962 (2022) - [j83]Feiyang Xiao, Jian Guan, Haiyan Lan, Qiaoxi Zhu, Wenwu Wang:
Local Information Assisted Attention-Free Decoder for Audio Captioning. IEEE Signal Process. Lett. 29: 1604-1608 (2022) - [j82]Kunkun SongGong, Wenwu Wang, Huawei Chen:
Acoustic Source Localization in the Circular Harmonic Domain Using Deep Learning Architecture. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2475-2491 (2022) - [c171]Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Segment-Level Metric Learning for Few-Shot Bioacoustic Event Detection. DCASE 2022 - [c170]Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning for On-Ddevice Environmental Sound Classification. DCASE 2022 - [c169]Dongchao Yang, Helin Wang, Wenwu Wang, Yuexian Zou:
A Mixed Supervised Learning Framework For Target Sound Detection. DCASE 2022 - [c168]Jianyuan Sun, Xubo Liu, Xinhao Mei, Jinzheng Zhao, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Deep Neural Decision Forest for Acoustic Scene Classification. EUSIPCO 2022: 772-776 - [c167]Jinzheng Zhao, Peipei Wu, Shidrokh Goudarzi, Xubo Liu, Jianyuan Sun, Yong Xu, Wenwu Wang:
Visually Assisted Self-supervised Audio Speaker Localization and Tracking. EUSIPCO 2022: 787-791 - [c166]Özkan Çayli, Volkan Kiliç, Aytug Onan, Wenwu Wang:
Auxiliary Classifier based Residual RNN for Image Captioning. EUSIPCO 2022: 1126-1130 - [c165]Xubo Liu, Xinhao Mei, Qiushi Huang, Jianyuan Sun, Jinzheng Zhao, Haohe Liu, Mark D. Plumbley, Volkan Kilic, Wenwu Wang:
Leveraging Pre-trained BERT for Audio Captioning. EUSIPCO 2022: 1145-1149 - [c164]Özge Taylan Moral, Volkan Kiliç, Aytug Onan, Wenwu Wang:
Automated Image Captioning with Multi-layer Gated Recurrent Unit. EUSIPCO 2022: 1160-1164 - [c163]Wenbo Wang, Jian Guan, Xinyi Che, Wenwu Wang:
MS-MLP: Multi-scale Sampling MLP for ECG Classification. EUSIPCO 2022: 1288-1292 - [c162]Shidrokh Goudarzi, Wenwu Wang, Pei Xiao, Lyudmila Mihaylova, Simon J. Godsill:
UAV-enabled Edge Computing for Optimal Task Distribution in Target Tracking. FUSION 2022: 1-7 - [c161]Tassadaq Hussain, Wenwu Wang, Nidhal Bouaynaya, Hassan M. Fathallah-Shaykh, Lyudmila Mihaylova:
Deep Learning for Audio Visual Emotion Recognition. FUSION 2022: 1-8 - [c160]Weikang Huang, Shiyong Lan, Wenwu Wang, Xuedong Yuan, Hongyu Yang, Piaoyang Li, Wei Ma:
Face Super-Resolution with Spatial Attention Guided by Multiscale Receptive-Field Features. ICANN (1) 2022: 145-157 - [c159]Caiyin Yang, Shiyong Lan, Weikang Huang, Wenwu Wang, Guoliang Liu, Hongyu Yang, Wei Ma, Piaoyang Li:
A Transformer-Based GAN for Anomaly Detection. ICANN (2) 2022: 345-357 - [c158]Dongchao Yang, Helin Wang, Yuexian Zou, Zhongjie Ye, Wenwu Wang:
A Mutual Learning Framework for Few-Shot Sound Event Detection. ICASSP 2022: 811-815 - [c157]Youde Liu, Jian Guan, Qiaoxi Zhu, Wenwu Wang:
Anomalous Sound Detection Using Spectral-Temporal Information Fusion. ICASSP 2022: 816-820 - [c156]Jinzheng Zhao, Peipei Wu, Xubo Liu, Yong Xu, Lyudmila Mihaylova, Simon J. Godsill, Wenwu Wang:
Audio-Visual Tracking of Multiple Speakers Via a PMBM Filter. ICASSP 2022: 5068-5072 - [c155]Peipei Wu, Jinzheng Zhao, Shidrokh Goudarzi, Wenwu Wang:
Partial Arithmetic Consensus based Distributed Intensity Particle Flow SMC-PHD Filter for Multi-Target Tracking. ICASSP 2022: 5078-5082 - [c154]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Diverse Audio Captioning Via Adversarial Training. ICASSP 2022: 8882-8886 - [c153]Shiyong Lan, Yitong Ma, Weikang Huang, Wenwu Wang, Hongyu Yang, Pyang Li:
DSTAGNN: Dynamic Spatial-Temporal Aware Graph Neural Network for Traffic Flow Forecasting. ICML 2022: 11906-11917 - [c152]Dongchao Yang, Helin Wang, Zhongjie Ye, Yuexian Zou, Wenwu Wang:
RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection. INTERSPEECH 2022: 1511-1515 - [c151]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Separate What You Describe: Language-Queried Audio Source Separation. INTERSPEECH 2022: 1801-1805 - [c150]Jinzheng Zhao, Peipei Wu, Xubo Liu, Shidrokh Goudarzi, Haohe Liu, Yong Xu, Wenwu Wang:
Audio Visual Multi-Speaker Tracking with Improved GCF and PMBM Filter. INTERSPEECH 2022: 3704-3708 - [c149]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
On Metric Learning for Audio-Text Cross-Modal Retrieval. INTERSPEECH 2022: 4142-4146 - [c148]Meng Cui, Xubo Liu, Jinzheng Zhao, Jianyuan Sun, Guoping Lian, Tao Chen, Mark D. Plumbley, Daoliang Li, Wenwu Wang:
Fish Feeding Intensity Assessment in Aquaculture: A New Audio Dataset AFFIA3K and a Deep Learning Algorithm. MLSP 2022: 1-6 - [c147]Buddhiprabha Erabadda, Gosala Kulupana, Thanuja Mallikarachchi, Wenwu Wang, Anil Fernando:
A Hybrid Approach to Blind Video Quality Prediction of User Generated Content. PCS 2022: 307-311 - [i70]Feiyang Xiao, Jian Guan, Qiaoxi Zhu, Haiyan Lan, Wenwu Wang:
Local Information Assisted Attention-free Decoder for Audio Captioning. CoRR abs/2201.03217 (2022) - [i69]Youde Liu, Jian Guan, Qiaoxi Zhu, Wenwu Wang:
Anomalous Sound Detection using Spectral-Temporal Information Fusion. CoRR abs/2201.05510 (2022) - [i68]Xubo Liu, Xinhao Mei, Qiushi Huang, Jianyuan Sun, Jinzheng Zhao, Haohe Liu, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Leveraging Pre-trained BERT for Audio Captioning. CoRR abs/2203.02838 (2022) - [i67]Jianyuan Sun, Xubo Liu, Xinhao Mei, Jinzheng Zhao, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Deep Neural Decision Forest for Acoustic Scene Classification. CoRR abs/2203.03436 (2022) - [i66]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Separate What You Describe: Language-Queried Audio Source Separation. CoRR abs/2203.15147 (2022) - [i65]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
On Metric Learning for Audio-Text Cross-Modal Retrieval. CoRR abs/2203.15537 (2022) - [i64]Dongchao Yang, Helin Wang, Yuexian Zou, Wenwu Wang:
A Two-student Learning Framework for Mixed Supervised Target Sound Detection. CoRR abs/2204.02088 (2022) - [i63]Dongchao Yang, Helin Wang, Zhongjie Ye, Yuexian Zou, Wenwu Wang:
RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection. CoRR abs/2204.02143 (2022) - [i62]Xinhao Mei, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Automated Audio Captioning: an Overview of Recent Progress and New Challenges. CoRR abs/2205.05949 (2022) - [i61]Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning For On-Device Environmental Sound Classification. CoRR abs/2207.07429 (2022) - [i60]Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Segment-level Metric Learning for Few-shot Bioacoustic Event Detection. CoRR abs/2207.07773 (2022) - [i59]Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Surrey System for DCASE 2022 Task 5: Few-shot Bioacoustic Event Detection with Segment-level Metric Learning. CoRR abs/2207.10547 (2022) - [i58]Arshdeep Singh, James A. King, Xubo Liu, Wenwu Wang, Mark D. Plumbley:
Low-complexity CNNs for Acoustic Scene Classification. CoRR abs/2208.01555 (2022) - [i57]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Mark D. Plumbley, Wenwu Wang:
Simple Pooling Front-ends For Efficient Audio Classification. CoRR abs/2210.00943 (2022) - [i56]Haohe Liu, Xubo Liu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Learning the Spectrogram Temporal Resolution for Audio Classification. CoRR abs/2210.01719 (2022) - [i55]Jianyuan Sun, Xubo Liu, Xinhao Mei, Mark D. Plumbley, Volkan Kilic, Wenwu Wang:
Automated Audio Captioning via Fusion of Low- and High- Dimensional Features. CoRR abs/2210.05037 (2022) - [i54]Yuanbo Hou, Yun Wang, Wenwu Wang, Dick Botteldooren:
GCT: Gated Contextual Transformer for Sequential Audio Tagging. CoRR abs/2210.12541 (2022) - [i53]Qiushi Huang, Yu Zhang, Tom Ko, Xubo Liu, Bo Wu, Wenwu Wang, H. Lilian Tang:
Personalized Dialogue Generation with Persona-Adaptive Attention. CoRR abs/2210.15088 (2022) - [i52]Yuanbo Hou, Siyang Song, Chuang Yu, Yuxin Song, Wenwu Wang, Dick Botteldooren:
Multi-dimensional Edge-based Audio Event Relational Graph Representation Learning for Acoustic Scene Classification. CoRR abs/2210.15366 (2022) - [i51]Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, H. Lilian Tang, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention. CoRR abs/2210.16428 (2022) - [i50]Haohe Liu, Qiuqiang Kong, Xubo Liu, Xinhao Mei, Wenwu Wang, Mark D. Plumbley:
Ontology-aware Learning and Evaluation for Audio Tagging. CoRR abs/2211.12195 (2022) - [i49]Sara Atito, Muhammad Awais, Wenwu Wang, Mark D. Plumbley, Josef Kittler:
ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation. CoRR abs/2211.13189 (2022) - [i48]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Towards Generating Diverse Audio Captions via Adversarial Training. CoRR abs/2212.02033 (2022) - [i47]Yaozong Mo, Chaofeng Li, Wenqi Ren, Shaopeng Shang, Wenwu Wang, Xiao-Jun Wu:
Unpaired Overwater Image Defogging Using Prior Map Guided CycleGAN. CoRR abs/2212.12116 (2022) - 2021
- [j81]Bin Li, Lucas Rencker, Jing Dong, Yuhui Luo, Mark D. Plumbley, Wenwu Wang:
Sparse Analysis Model Based Dictionary Learning for Signal Declipping. IEEE J. Sel. Top. Signal Process. 15(1): 25-36 (2021) - [j80]Yang Xian, Yang Sun, Wenwu Wang, Syed Mohsen Naqvi:
A Multi-Scale Feature Recalibration Network for End-to-End Single Channel Speech Enhancement. IEEE J. Sel. Top. Signal Process. 15(1): 143-155 (2021) - [j79]Yang Xian, Yang Sun, Wenwu Wang, Syed Mohsen Naqvi:
Convolutional fusion network for monaural speech enhancement. Neural Networks 143: 97-107 (2021) - [j78]Jielong Yang, Xionghu Zhong, Weiguang Chen, Wenwu Wang:
Multiple Acoustic Source Localization in Microphone Array Networks. IEEE ACM Trans. Audio Speech Lang. Process. 29: 334-347 (2021) - [j77]Weitao Yuan, Bofei Dong, Shengbei Wang, Masashi Unoki, Wenwu Wang:
Evolving Multi-Resolution Pooling CNN for Monaural Singing Voice Separation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 807-822 (2021) - [j76]Kunkun SongGong, Huawei Chen, Wenwu Wang:
Indoor Multi-Speaker Localization Based on Bayesian Nonparametrics in the Circular Harmonic Domain. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1864-1880 (2021) - [j75]Yulong Huang, Chao Xue, Fengchi Zhu, Wenwu Wang, Yonggang Zhang, Jonathon A. Chambers:
Adaptive Recursive Decentralized Cooperative Localization for Multirobot Systems With Time-Varying Measurement Accuracy. IEEE Trans. Instrum. Meas. 70: 1-25 (2021) - [j74]Shanliang Zhu, Yu Zhao, Yanjie Zhang, Qingling Li, Wenwu Wang, Shuguo Yang:
Short-Term Traffic Flow Prediction With Wavelet and Multi-Dimensional Taylor Network Model. IEEE Trans. Intell. Transp. Syst. 22(5): 3203-3208 (2021) - [c146]Xubo Liu, Qiushi Huang, Xinhao Mei, Tom Ko, H. Lilian Tang, Mark D. Plumbley, Wenwu Wang:
CL4AC: A Contrastive Loss for Audio Captioning. DCASE 2021: 196-200 - [c145]Turab Iqbal, Yin Cao, Andrew Bailey, Mark D. Plumbley, Wenwu Wang:
ARCA23K: An Audio Dataset for Investigating Open-Set Label Noise. DCASE 2021: 201-205 - [c144]Xinhao Mei, Qiushi Huang, Xubo Liu, Gengyun Chen, Jingqian Wu, Yusong Wu, Jinzheng Zhao, Shengchen Li, Tom Ko, H. Lilian Tang, Xi Shao, Mark D. Plumbley, Wenwu Wang:
An Encoder-Decoder Based Audio Captioning System with Transfer and Reinforcement Learning. DCASE 2021: 206-210 - [c143]Xinhao Mei, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Audio Captioning Transformer. DCASE 2021: 211-215 - [c142]Lam Pham, Chris Baume, Qiuqiang Kong, Tassadaq Hussain, Wenwu Wang, Mark D. Plumbley:
An Audio-Based Deep Learning Framework For BBC Television Programme Classification. EUSIPCO 2021: 56-60 - [c141]Helin Wang, Yuexian Zou, Wenwu Wang:
A Global-Local Attention Framework for Weakly Labelled Audio Tagging. ICASSP 2021: 351-355 - [c140]Turab Iqbal, Karim Helwani, Arvindh Krishnaswamy, Wenwu Wang:
Enhancing Audio Augmentation Methods with Consistency Learning. ICASSP 2021: 646-650 - [c139]Yin Cao, Turab Iqbal, Qiuqiang Kong, Fengyan An, Wenwu Wang, Mark D. Plumbley:
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection. ICASSP 2021: 885-889 - [c138]Jian Guan, Wenbo Wang, Pengming Feng, Xinxin Wang, Wenwu Wang:
Low-Dimensional Denoising Embedding Transformer for ECG Classification. ICASSP 2021: 1285-1289 - [c137]Shuoyang Li, Yuhui Luo, Jonathon A. Chambers, Wenwu Wang:
Dimension Selected Subspace Clustering. ICASSP 2021: 3195-3199 - [c136]Jingshu Zhang, Mark D. Plumbley, Wenwu Wang:
Weighted Magnitude-Phase Loss for Speech Dereverberation. ICASSP 2021: 5794-5798 - [c135]Chao Liu, Xiaodong Yang, Dading Chong, Wenwu Wang, Liang Li:
Enhancing Alzheimer's Disease Diagnosis via Hierarchical 3D-FCN with Multi-Modal Features. ICIP 2021: 304-308 - [c134]Shiyong Lan, Jin Li, Shipeng Sun, Xin Lai, Wenwu Wang:
Robust Visual Object Tracking with Spatiotemporal Regularisation and Discriminative Occlusion Deformation. ICIP 2021: 1879-1883 - [c133]Guoliang Liu, Shiyong Lan, Ting Zhang, Weikang Huang, Wenwu Wang:
SAGAN: Skip-Attention GAN For Anomaly Detection. ICIP 2021: 2468-2472 - [c132]Helin Wang, Yuexian Zou, Wenwu Wang:
SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification. Interspeech 2021: 551-555 - [c131]Weitao Yuan, Shengbei Wang, Xiangrui Li, Masashi Unoki, Wenwu Wang:
Crossfire Conditional Generative Adversarial Networks for Singing Voice Extraction. Interspeech 2021: 3041-3045 - [c130]Xubo Liu, Turab Iqbal, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning. MLSP 2021: 1-6 - [d1]Turab Iqbal, Yin Cao, Andrew Bailey, Mark D. Plumbley, Wenwu Wang:
ARCA23K. Zenodo, 2021 - [i46]Helin Wang, Yuexian Zou, Wenwu Wang:
A Global-local Attention Framework for Weakly Labelled Audio Tagging. CoRR abs/2102.01931 (2021) - [i45]Turab Iqbal, Karim Helwani, Arvindh Krishnaswamy, Wenwu Wang:
Enhancing Audio Augmentation Methods with Consistency Learning. CoRR abs/2102.05151 (2021) - [i44]Feiyang Xiao, Jian Guan, Qiuqiang Kong, Wenwu Wang:
Time-domain Speech Enhancement with Generative Adversarial Learning. CoRR abs/2103.16149 (2021) - [i43]Helin Wang, Yuexian Zou, Wenwu Wang:
SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification. CoRR abs/2103.16858 (2021) - [i42]Lam Pham, Chris Baume, Qiuqiang Kong, Tassadaq Hussain, Wenwu Wang, Mark D. Plumbley:
An Audio-Based Deep Learning Framework ForBBC Television Programme Classification. CoRR abs/2104.01161 (2021) - [i41]Xinhao Mei, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Audio Captioning Transformer. CoRR abs/2107.09817 (2021) - [i40]Xubo Liu, Qiushi Huang, Xinhao Mei, Tom Ko, H. Lilian Tang, Mark D. Plumbley, Wenwu Wang:
CL4AC: A Contrastive Loss for Audio Captioning. CoRR abs/2107.09990 (2021) - [i39]Xubo Liu, Turab Iqbal, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning. CoRR abs/2107.09998 (2021) - [i38]Xinhao Mei, Qiushi Huang, Xubo Liu, Gengyun Chen, Jingqian Wu, Yusong Wu, Jinzheng Zhao, Shengchen Li, Tom Ko, H. Lilian Tang, Xi Shao, Mark D. Plumbley, Wenwu Wang:
An Encoder-Decoder Based Audio Captioning System With Transfer and Reinforcement Learning. CoRR abs/2108.02752 (2021) - [i37]Turab Iqbal, Yin Cao, Andrew Bailey, Mark D. Plumbley, Wenwu Wang:
ARCA23K: An audio dataset for investigating open-set label noise. CoRR abs/2109.09227 (2021) - [i36]Dongchao Yang, Helin Wang, Yuexian Zou, Zhongjie Ye, Wenwu Wang:
A Mutual learning framework for Few-shot Sound Event Detection. CoRR abs/2110.04474 (2021) - [i35]Ting Liu, Wenwu Wang, Xiaofei Zhang, Zhenyin Gong, Yina Guo:
One to Multiple Mapping Dual Learning: Learning Multiple Sources from One Mixed Signal. CoRR abs/2110.06568 (2021) - [i34]Yina Guo, Xiaofei Zhang, Zhenying Gong, Anhong Wang, Wenwu Wang:
End-to-end translation of human neural activity to speech with a dual-dual generative adversarial network. CoRR abs/2110.06634 (2021) - [i33]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Diverse Audio Captioning via Adversarial Training. CoRR abs/2110.06691 (2021) - 2020
- [j73]Jing Dong, Zhichao Xue, Wenwu Wang:
Robust PCA Using Nonconvex Rank Approximation and Sparse Regularizer. Circuits Syst. Signal Process. 39(6): 3086-3104 (2020) - [j72]Shili Peng, Qinghua Hu, Jianwu Dang, Wenwu Wang:
Optimal feasible step-size based working set selection for large scale SVMs training. Neurocomputing 407: 366-375 (2020) - [j71]Mi He, Yongjian Nian, Luping Xu, Lihong Qiao, Wenwu Wang:
Adaptive Separation of Respiratory and Heartbeat Signals among Multiple People Based on Empirical Wavelet Transform Using UWB Radar. Sensors 20(17): 4913 (2020) - [j70]Dongli Xu, Jian Guan, Pengming Feng, Wenwu Wang:
Association Loss for Visual Object Detection. IEEE Signal Process. Lett. 27: 1435-1439 (2020) - [j69]Helin Wang, Yuexian Zou, Dading Chong, Wenwu Wang:
Modeling Label Dependencies for Audio Tagging With Graph Convolutional Network. IEEE Signal Process. Lett. 27: 1560-1564 (2020) - [j68]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Sound Event Detection of Weakly Labelled Data With CNN-Transformer and Automatic Threshold Optimization. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2450-2460 (2020) - [j67]Qiuqiang Kong, Yin Cao, Turab Iqbal, Yuxuan Wang, Wenwu Wang, Mark D. Plumbley:
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2880-2894 (2020) - [j66]Yina Guo, Xiangning Zhao, Jianyu Li, Anhong Wang, Wenwu Wang:
Blind Multiple-Input Multiple-Output Image Phase Retrieval. IEEE Trans. Ind. Electron. 67(3): 2220-2230 (2020) - [j65]Yina Guo, Jianguo Chen, Xiaowen Ren, Anhong Wang, Wenwu Wang:
Joint Raindrop and Haze Removal From a Single Image. IEEE Trans. Image Process. 29: 9508-9519 (2020) - [j64]Yang Liu, Volkan Kiliç, Jian Guan, Wenwu Wang:
Audio-Visual Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking. IEEE Trans. Multim. 22(4): 934-948 (2020) - [c129]Liming Shi, Limin Yu, Kaizhu Huang, Xu Zhu, Zhi Wang, Xiaofei Li, Wenwu Wang, Xinheng Wang:
A Covert Ultrasonic Phone-to-Phone Communication Scheme. CollaborateCom (1) 2020: 36-48 - [c128]Yin Cao, Turab Iqbal, Qiuqiang Kong, Yue Zhong, Wenwu Wang, Mark D. Plumbley:
Event-Independent Network for Polyphonic Sound Event Localization and Detection. DCASE 2020: 11-15 - [c127]Saeid Safavi, Turab Iqbal, Wenwu Wang, Philip Coleman, Mark D. Plumbley:
Open-Window: A Sound Event Dataset for Window State Detection and Recognition. DCASE 2020: 185-189 - [c126]Yang Xian, Yang Sun, Wenwu Wang, Syed Mohsen Naqvi:
Multi-Scale Residual Convolutional Encoder Decoder with Bidirectional Long Short-Term Memory for Single Channel Speech Enhancement. EUSIPCO 2020: 431-435 - [c125]Qiuqiang Kong, Yuxuan Wang, Xuchen Song, Yin Cao, Wenwu Wang, Mark D. Plumbley:
Source Separation with Weakly Labelled Data: an Approach to Computational Auditory Scene Analysis. ICASSP 2020: 101-105 - [c124]Sixin Hong, Yuexian Zou, Wenwu Wang, Meng Cao:
Weakly Labelled Audio Tagging Via Convolutional Networks with Spatial and Channel-Wise Attention. ICASSP 2020: 296-300 - [c123]Turab Iqbal, Yin Cao, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
Learning With Out-of-Distribution Data for Audio Classification. ICASSP 2020: 636-640 - [c122]Jian Guan, Jiabei Liu, Jianguo Sun, Pengming Feng, Tong Shuai, Wenwu Wang:
Meta Metric Learning for Highly Imbalanced Aerial Scene Classification. ICASSP 2020: 4047-4051 - [c121]Takahiro Murakami, Wenwu Wang:
An Analytical Solution to Jacobsen Estimator for Windowed Signals. ICASSP 2020: 5950-5954 - [c120]Sixin Hong, Yuexian Zou, Wenwu Wang:
Gated Multi-Head Attention Pooling for Weakly Labelled Audio Tagging. INTERSPEECH 2020: 816-820 - [c119]Helin Wang, Yuexian Zou, Dading Chong, Wenwu Wang:
Environmental Sound Classification with Parallel Temporal-Spectral Attention. INTERSPEECH 2020: 821-825 - [i32]Qiuqiang Kong, Yuxuan Wang, Xuchen Song, Yin Cao, Wenwu Wang, Mark D. Plumbley:
Source separation with weakly labelled data: An approach to computational auditory scene analysis. CoRR abs/2002.02065 (2020) - [i31]Turab Iqbal, Yin Cao, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
Learning with Out-of-Distribution Data for Audio Classification. CoRR abs/2002.04683 (2020) - [i30]Weitao Yuan, Bofei Dong, Shengbei Wang, Masashi Unoki, Wenwu Wang:
Evolving Multi-Resolution Pooling CNN for Monaural Singing Voice Separation. CoRR abs/2008.00816 (2020) - [i29]Yin Cao, Turab Iqbal, Qiuqiang Kong, Yue Zhong, Wenwu Wang, Mark D. Plumbley:
Event-Independent Network for Polyphonic Sound Event Localization and Detection. CoRR abs/2010.00140 (2020) - [i28]Yin Cao, Turab Iqbal, Qiuqiang Kong, Fengyan An, Wenwu Wang, Mark D. Plumbley:
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection. CoRR abs/2010.13092 (2020)
2010 – 2019
- 2019
- [j63]Yina Guo, Tao Wang, Jianyu Li, Anhong Wang, Wenwu Wang:
Multiple Input Single Output Phase Retrieval. Circuits Syst. Signal Process. 38(8): 3818-3840 (2019) - [j62]Yang Sun, Yang Xian, Wenwu Wang, Syed Mohsen Naqvi:
Monaural Source Separation in Complex Domain With Long Short-Term Memory Neural Network. IEEE J. Sel. Top. Signal Process. 13(2): 359-369 (2019) - [j61]Yang Chen, Wenwu Wang, Zhe Wang, Bingyin Xia:
A Source Counting Method Using Acoustic Vector Sensor Based on Sparse Modeling of DOA Histogram. IEEE Signal Process. Lett. 26(1): 69-73 (2019) - [j60]Weitao Yuan, Shengbei Wang, Xiangrui Li, Masashi Unoki, Wenwu Wang:
A Skip Attention Mechanism for Monaural Singing Voice Separation. IEEE Signal Process. Lett. 26(10): 1481-1485 (2019) - [j59]Qingju Liu, Philip J. B. Jackson, Wenwu Wang:
A Speech Synthesis Approach for High Quality Speech Separation and Generation. IEEE Signal Process. Lett. 26(12): 1872-1876 (2019) - [j58]Yang Sun, Wenwu Wang, Jonathon A. Chambers, Syed Mohsen Naqvi:
Two-Stage Monaural Source Separation in Reverberant Room Environments Using Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 27(1): 125-139 (2019) - [j57]Qiuqiang Kong, Yong Xu, Iwona Sobieraj, Wenwu Wang, Mark D. Plumbley:
Sound Event Detection and Time-Frequency Segmentation from Weakly Labelled Data. IEEE ACM Trans. Audio Speech Lang. Process. 27(4): 777-787 (2019) - [j56]Qiuqiang Kong, Changsong Yu, Yong Xu, Turab Iqbal, Wenwu Wang, Mark D. Plumbley:
Weakly Labelled AudioSet Tagging With Attention Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 27(11): 1791-1802 (2019) - [j55]Luca Remaggi, Philip J. B. Jackson, Wenwu Wang:
Modeling the Comb Filter Effect and Interaural Coherence for Binaural Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 2263-2277 (2019) - [j54]Lucas Rencker, Francis R. Bach, Wenwu Wang, Mark D. Plumbley:
Sparse Recovery and Dictionary Learning From Nonlinear Compressive Measurements. IEEE Trans. Signal Process. 67(21): 5659-5670 (2019) - [c118]Junyi Peng, Rongzhi Gu, Yuexian Zou, Wenwu Wang:
Speaker-discriminative Embedding Learning via Affinity Matrix for Short Utterance Speaker Verification. APSIPA 2019: 314-319 - [c117]Yin Cao, Qiuqiang Kong, Turab Iqbal, Fengyan An, Wenwu Wang, Mark D. Plumbley:
Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy. DCASE 2019: 30-34 - [c116]Shuoyang Li, Yuantao Gu, Yuhui Luo, Jonathon A. Chambers, Wenwu Wang:
Enhanced Streaming Based Subspace Clustering Applied to Acoustic Scene Data Clustering. ICASSP 2019: 11-15 - [c115]Weitao Yuan, Shengbei Wang, Xiangrui Li, Masashi Unoki, Wenwu Wang:
Proximal Deep Recurrent Neural Network for Monaural Singing Voice Separation. ICASSP 2019: 286-290 - [c114]Qiuqiang Kong, Yong Xu, Turab Iqbal, Yin Cao, Wenwu Wang, Mark D. Plumbley:
Acoustic Scene Generation with Conditional Samplernn. ICASSP 2019: 925-929 - [c113]Yuexian Zou, Yi Wang, Wenjie Guan, Wenwu Wang:
Semantic Super-resolution for Extremely Low-resolution Vehicle License Plate. ICASSP 2019: 3772-3776 - [c112]Yang Liu, Qinghua Hu, Yuexian Zou, Wenwu Wang:
Labelled Non-zero Particle Flow for SMC-PHD Filtering. ICASSP 2019: 5197-5201 - [c111]Yan Tang, Trevor J. Cox, Bruno M. Fazenda, Qingju Liu, Wenwu Wang:
Background Adaptation for Improved Listening Experience in Broadcasting. ICASSP 2019: 8008-8012 - [c110]Christian Kroos, Oliver Bones, Yin Cao, Lara Harris, Philip J. B. Jackson, William J. Davies, Wenwu Wang, Trevor J. Cox, Mark D. Plumbley:
Generalisation in Environmental Sound Classification: The 'Making Sense of Sounds' Data Set and Challenge. ICASSP 2019: 8082-8086 - [c109]Jun Wang, Shengchen Li, Wenwu Wang:
SVD-Based Channel Pruning for Convolutional Neural Network in Acoustic Scene Classification Model. ICME Workshops 2019: 390-395 - [c108]Yang Sun, Yang Xian, Wenwu Wang, Syed Mohsen Naqvi:
Single-Channel Speech Enhancement with Sequentially Trained DNN System. ICSPCS 2019: 1-6 - [c107]Yang Xian, Yang Sun, Wenwu Wang, Syed Mohsen Naqvi:
Monaural Speech Enhancement Based On Two Stage Long Short-Term Memory Networks. ICSPCS 2019: 1-5 - [c106]Qiuqiang Kong, Yong Xu, Philip J. B. Jackson, Wenwu Wang, Mark D. Plumbley:
Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks. IJCAI 2019: 2747-2753 - [c105]Dading Chong, Yuexian Zou, Wenwu Wang:
Multi-channel Convolutional Neural Networks with Multi-level Feature Fusion for Environmental Sound Classification. MMM (2) 2019: 157-168 - [i27]Qiuqiang Kong, Changsong Yu, Turab Iqbal, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Weakly labelled AudioSet Classification with Attention Neural Networks. CoRR abs/1903.00765 (2019) - [i26]Qiuqiang Kong, Yin Cao, Turab Iqbal, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Cross-task learning for audio tagging, sound event detection and spatial localization: DCASE 2019 baseline systems. CoRR abs/1904.03476 (2019) - [i25]Yin Cao, Qiuqiang Kong, Turab Iqbal, Fengyan An, Wenwu Wang, Mark D. Plumbley:
Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy. CoRR abs/1905.00268 (2019) - [i24]Qiuqiang Kong, Yong Xu, Wenwu Wang, Philip J. B. Jackson, Mark D. Plumbley:
Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks. CoRR abs/1906.07552 (2019) - [i23]Luca Remaggi, Philip J. B. Jackson, Wenwu Wang:
Modeling the Comb Filter Effect and Interaural Coherence for Binaural Source Separation. CoRR abs/1910.02127 (2019) - [i22]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Sound Event Detection of Weakly Labelled Data with CNN-Transformer and Automatic Threshold Optimization. CoRR abs/1912.04761 (2019) - [i21]Helin Wang, Yuexian Zou, Dading Chong, Wenwu Wang:
Learning discriminative and robust time-frequency representations for environmental sound classification. CoRR abs/1912.06808 (2019) - [i20]Qiuqiang Kong, Yin Cao, Turab Iqbal, Yuxuan Wang, Wenwu Wang, Mark D. Plumbley:
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition. CoRR abs/1912.10211 (2019) - 2018
- [j53]Disong Wang, Yuexian Zou, Wenwu Wang:
Learning soft mask with DNN and DNN-SVM for multi-speaker DOA estimation using an acoustic vector sensor. J. Frankl. Inst. 355(4): 1692-1709 (2018) - [j52]Fanglin Gu, Shan Wang, Wenwu Wang:
Standard-independent I/Q imbalance estimation and compensation scheme inOFDM. Frontiers Inf. Technol. Electron. Eng. 19(3): 388-397 (2018) - [j51]Jianqing Liang, Qinghua Hu, Pengfei Zhu, Wenwu Wang:
Efficient multi-modal geometric mean metric learning. Pattern Recognit. 75: 188-198 (2018) - [j50]Josef Kittler, Cemre Zor, Ioannis Kaloskampis, Yulia Hicks, Wenwu Wang:
Error sensitivity analysis of Delta divergence - a novel measure for classifier incongruence detection. Pattern Recognit. 77: 30-44 (2018) - [j49]Syed Zubair, Naveed Ishtiaq Chaudhary, Zeshan Aslam Khan, Wenwu Wang:
Momentum fractional LMS for power signal parameter estimation. Signal Process. 142: 441-449 (2018) - [j48]Jian Guan, Xuan Wang, Pengming Feng, Jing Dong, Jonathon A. Chambers, Zoe Lin Jiang, Wenwu Wang:
Polynomial dictionary learning algorithms in sparse representations. Signal Process. 142: 492-503 (2018) - [j47]Yina Guo, Anhong Wang, Wenwu Wang:
Multi-source phase retrieval from multi-channel phaseless STFT measurements. Signal Process. 144: 36-40 (2018) - [j46]Yan Tang, Qingju Liu, Wenwu Wang, Trevor J. Cox:
A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones. Speech Commun. 96: 116-128 (2018) - [j45]Jing Dong, Zhichao Xue, Jian Guan, Zi-Fa Han, Wenwu Wang:
Low rank matrix completion using truncated nuclear norm and sparse regularizer. Signal Process. Image Commun. 68: 76-87 (2018) - [j44]Jiayang Wang, Gen Li, Lucas Rencker, Wenwu Wang, Yuantao Gu:
An RIP-Based Performance Guarantee of Covariance-Assisted Matching Pursuit. IEEE Signal Process. Lett. 25(6): 828-832 (2018) - [j43]Swati Chandna, Wenwu Wang:
Bootstrap Averaging for Model-Based Source Separation in Reverberant Conditions. IEEE ACM Trans. Audio Speech Lang. Process. 26(4): 806-819 (2018) - [j42]Yi Wang, Yuexian Zou, Wenwu Wang:
Manifold-Based Visual Object Counting. IEEE Trans. Image Process. 27(7): 3248-3263 (2018) - [j41]Qingju Liu, Wenwu Wang, Teofilo de Campos, Philip J. B. Jackson, Adrian Hilton:
Multiple Speaker Tracking in Spatial Audio via PHD Filtering and Depth-Audio Fusion. IEEE Trans. Multim. 20(7): 1767-1780 (2018) - [c104]Saeid Safavi, Andy Pearce, Wenwu Wang, Mark D. Plumbley:
Predicting the perceived level of reverberation using machine learning. ACSSC 2018: 27-30 - [c103]Qingju Liu, Wenwu Wang, Philip J. B. Jackson, Saeid Safavi:
A Performance Evaluation of Several Deep Neural Networks for Reverberant Speech Separation. ACSSC 2018: 689-693 - [c102]Turab Iqbal, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
General-purpose audio tagging from noisy labels using convolutional neural networks. DCASE 2018: 212-216 - [c101]Qiuqiang Kong, Turab Iqbal, Yong Xu, Wenwu Wang, Mark D. Plumbley:
DCASE 2018 Challenge Surrey cross-task convolutional neural network baseline. DCASE 2018: 217-221 - [c100]Yang Sun, Wenwu Wang, Jonathon A. Chambers, Syed Mohsen Naqvi:
Enhanced Time-Frequency Masking by Using Neural Networks for Monaural Source Separation in Reverberant Room Environments. EUSIPCO 2018: 1647-1651 - [c99]Turab Iqbal, Yong Xu, Qiuqiang Kong, Wenwu Wang:
Capsule Routing for Sound Event Detection. EUSIPCO 2018: 2255-2259 - [c98]Shuoyang Li, Wenwu Wang:
Randomly Sketched Sparse Subspace Clustering for Acoustic Scene Clustering. EUSIPCO 2018: 2489-2493 - [c97]Alfredo Zermini, Qiuqiang Kong, Yong Xu, Mark D. Plumbley, Wenwu Wang:
Improving Reverberant Speech Separation with Binaural Cues Using Temporal Context and Convolutional Neural Networks. LVA/ICA 2018: 361-371 - [c96]Lucas Rencker, Francis R. Bach, Wenwu Wang, Mark D. Plumbley:
Consistent Dictionary Learning for Signal Declipping. LVA/ICA 2018: 446-455 - [c95]Yong Xu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network. ICASSP 2018: 121-125 - [c94]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Audio Set Classification with Attention Model: A Probabilistic Perspective. ICASSP 2018: 316-320 - [c93]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
A Joint Separation-Classification Model for Sound Event Detection of Weakly Labelled Data. ICASSP 2018: 321-325 - [c92]Qingju Liu, Yong Xu, Philip J. B. Jackson, Wenwu Wang, Philip Coleman:
Iterative Deep Neural Networks for Speaker-Independent Binaural Blind Speech Separation. ICASSP 2018: 541-545 - [c91]Qiang Huang, Philip J. B. Jackson, Mark D. Plumbley, Wenwu Wang:
Synthesis of Images by Two-Stage Generative Adversarial Networks. ICASSP 2018: 1593-1597 - [c90]Viet Hung Tran, Wenwu Wang, Yuhui Luo, Jonathon A. Chambers:
Bayesian Inference for Multi-Line Spectra in Linear Sensor Array. ICASSP 2018: 4254-4258 - [c89]Yang Liu, Adrian Hilton, Jonathon A. Chambers, Yuxin Zhao, Wenwu Wang:
Non-Zero Diffusion Particle Flow SMC-PHD Filter for Audio-Visual Multi-Speaker Tracking. ICASSP 2018: 4304-4308 - [c88]Josef Kittler, Ioannis Kaloskampis, Cemre Zor, Yang Xu, Yulia Hicks, Wenwu Wang:
Intelligent Signal Processing Mechanisms for Nuanced Anomaly Detection in Action Audio-Visual Data Streams. ICASSP 2018: 6563-6567 - [c87]Jianchao Gao, Hong Shi, Wenwu Wang:
Spatially Regularized Low Rank Tensor Optimization for Visual Data Completion. ICIP 2018: 1822-1826 - [c86]Xiaohu Zhang, Yuexian Zou, Wenwu Wang:
LD-CNN: A Lightweight Dilated Convolutional Neural Network for Environmental Sound Classification. ICPR 2018: 373-378 - [c85]Huan Zhang, Hong Shi, Wenwu Wang:
Cascade Deep Networks for Sparse Linear Inverse Problems. ICPR 2018: 812-817 - [i19]Turab Iqbal, Wenwu Wang:
Approximate Message Passing for Underdetermined Audio Source Separation. CoRR abs/1802.00380 (2018) - [i18]Qiuqiang Kong, Yong Xu, Iwona Sobieraj, Wenwu Wang, Mark D. Plumbley:
Sound Event Detection and Time-Frequency Segmentation from Weakly Labelled Data. CoRR abs/1804.04715 (2018) - [i17]Turab Iqbal, Yong Xu, Qiuqiang Kong, Wenwu Wang:
Capsule Routing for Sound Event Detection. CoRR abs/1806.04699 (2018) - [i16]Qiuqiang Kong, Turab Iqbal, Yong Xu, Wenwu Wang, Mark D. Plumbley:
DCASE 2018 Challenge baseline with convolutional neural networks. CoRR abs/1808.00773 (2018) - [i15]Viet Hung Tran, Wenwu Wang:
Bayesian inference for PCA and MUSIC algorithms with unknown number of sources. CoRR abs/1809.10168 (2018) - [i14]Yang Liu, Wenwu Wang, Volkan Kilic:
Intensity Particle Flow SMC-PHD Filter For Audio Speaker Tracking. CoRR abs/1812.01570 (2018) - 2017
- [j40]Fanglin Gu, Hang Zhang, Wenwu Wang, Shan Wang:
An Expectation-Maximization Algorithm for Blind Separation of Noisy Mixtures Using Gaussian Mixture Model. Circuits Syst. Signal Process. 36(7): 2697-2726 (2017) - [j39]Jian Guan, Xuan Wang, Wenwu Wang, Lei Huang:
Sparse Blind Speech Deconvolution with Dynamic Range Regularization and Indicator Function. Circuits Syst. Signal Process. 36(10): 4145-4160 (2017) - [j38]Jing Dong, Zi-Fa Han, Yuxin Zhao, Wenwu Wang, Ales Procházka, Jonathon A. Chambers:
Sparse analysis model based multiplicative noise removal with enhanced regularization. Signal Process. 137: 160-176 (2017) - [j37]Luca Remaggi, Philip J. B. Jackson, Philip Coleman, Wenwu Wang:
Acoustic Reflector Localization: Novel Image Source Reversion and Direct Localization Methods. IEEE ACM Trans. Audio Speech Lang. Process. 25(2): 296-309 (2017) - [j36]Andreas Franck, Wenwu Wang, Filippo Maria Fazi:
Sparse ℓ1-Optimal Multiloudspeaker Panning and Its Relation to Vector Base Amplitude Panning. IEEE ACM Trans. Audio Speech Lang. Process. 25(5): 996-1010 (2017) - [j35]Yong Xu, Qiang Huang, Wenwu Wang, Peter Foster, Siddharth Sigtia, Philip J. B. Jackson, Mark D. Plumbley:
Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging. IEEE ACM Trans. Audio Speech Lang. Process. 25(6): 1230-1241 (2017) - [j34]Pengming Feng, Wenwu Wang, Satnam Singh Dlay, Syed Mohsen Naqvi, Jonathon A. Chambers:
Social Force Model-Based MCMC-OCSVM Particle PHD Filter for Multiple Human Tracking. IEEE Trans. Multim. 19(4): 725-739 (2017) - [j33]Jianqing Liang, Qinghua Hu, Wenwu Wang, Yahong Han:
Semisupervised Online Multikernel Similarity Learning for Image Retrieval. IEEE Trans. Multim. 19(5): 1077-1089 (2017) - [c84]Qingju Liu, Wenwu Wang, Philip J. B. Jackson, Yan Tang:
A perceptually-weighted deep neural network for monaural speech enhancement in various background noise conditions. EUSIPCO 2017: 1270-1274 - [c83]Mingyang Chen, Wenwu Wang, Mark Barnard, Jonathon A. Chambers:
Wideband DoA estimation based on joint optimisation of array and spatial sparsity. EUSIPCO 2017: 2106-2110 - [c82]Lucas Rencker, Wenwu Wang, Mark D. Plumbley:
Multivariate iterative hard thresholding for sparse decomposition with flexible sparsity patterns. EUSIPCO 2017: 2156-2160 - [c81]Yang Liu, Wenwu Wang, Jonathon A. Chambers, Volkan Kilic, Adrian Hilton:
Particle Flow SMC-PHD Filter for Audio-Visual Multi-speaker Tracking. LVA/ICA 2017: 344-353 - [c80]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
A joint detection-classification model for audio tagging of weakly labelled data. ICASSP 2017: 641-645 - [c79]Ronan Hamon, Valentin Emiya, Lucas Rencker, Wenwu Wang, Mark D. Plumbley:
Assessment of musical noise using localization of isolated peaks in time-frequency domain. ICASSP 2017: 696-700 - [c78]Qiang Huang, Yong Xu, Philip J. B. Jackson, Wenwu Wang, Mark D. Plumbley:
Fast tagging of natural sounds using marginal co-regularization. ICASSP 2017: 2991-2995 - [c77]Yang Liu, Wenwu Wang, Yuxin Zhao:
Particle flow for sequential Monte Carlo implementation of probability hypothesis density. ICASSP 2017: 4371-4375 - [c76]Lucas Rencker, Wenwu Wang, Mark D. Plumbley:
A greedy algorithm with learned statistics for sparse signal reconstruction. ICASSP 2017: 4775-4779 - [c75]Yong Xu, Qiuqiang Kong, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Convolutional gated recurrent neural network incorporating spatial features for audio tagging. IJCNN 2017: 3461-3466 - [c74]Jian Guan, Xuan Wang, Pengming Feng, Jing Dong, Wenwu Wang:
Matrix of Polynomials Model Based Polynomial Dictionary Learning Method for Acoustic Impulse Response Modeling. INTERSPEECH 2017: 3068-3072 - [c73]Yong Xu, Qiuqiang Kong, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Attention and Localization Based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging. INTERSPEECH 2017: 3083-3087 - [c72]Alfredo Zermini, Qingju Liu, Yong Xu, Mark D. Plumbley, Dave Betts, Wenwu Wang:
Binaural and log-power spectra features with deep neural networks for speech-noise separation. MMSP 2017: 1-6 - [c71]Jian Guan, Xuan Wang, Shuhan Qi, Jing Dong, Wenwu Wang:
Blind Speech Deconvolution via Pretrained Polynomial Dictionary and Sparse Representation. PCM (1) 2017: 411-420 - [c70]Jian Guan, Xuan Wang, Zongxia Xie, Shuhan Qi, Wenwu Wang:
Joint L1-L2 Regularisation for Blind Speech Deconvolution. PCM (1) 2017: 834-843 - [i13]Yong Xu, Qiuqiang Kong, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Convolutional Gated Recurrent Neural Network Incorporating Spatial Features for Audio Tagging. CoRR abs/1702.07787 (2017) - [i12]Yong Xu, Qiuqiang Kong, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Attention and Localization based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging. CoRR abs/1703.06052 (2017) - [i11]Jian Guan, Xuan Wang, Pengming Feng, Jing Dong, Wenwu Wang:
Matrix of Polynomials Model based Polynomial Dictionary Learning Method for Acoustic Impulse Response Modeling. CoRR abs/1705.08660 (2017) - [i10]Yong Xu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Surrey-cvssp system for DCASE2017 challenge task4. CoRR abs/1709.00551 (2017) - [i9]Yong Xu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Large-scale weakly supervised audio classification using gated convolutional neural network. CoRR abs/1710.00343 (2017) - [i8]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Audio Set classification with attention model: A probabilistic perspective. CoRR abs/1711.00927 (2017) - [i7]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
A joint separation-classification model for sound event detection of weakly labelled data. CoRR abs/1711.03037 (2017) - 2016
- [j32]Fanglin Gu, Hang Zhang, Wenwu Wang, Chunlin Xiong:
A Promising Technique for Blind Identification: The Generic Statistics. Circuits Syst. Signal Process. 35(7): 2544-2562 (2016) - [j31]Yang Yu, Wenwu Wang, Peng Han:
Localization based stereo speech source separation using probabilistic time-frequency masking and deep neural networks. EURASIP J. Audio Speech Music. Process. 2016: 7 (2016) - [j30]Mark Barnard, Wenwu Wang:
Audio head pose estimation using the direct to reverberant speech ratio. Speech Commun. 85: 98-108 (2016) - [j29]Pengming Feng, Wenwu Wang, Syed Mohsen Naqvi, Jonathon A. Chambers:
Adaptive Retrodiction Particle PHD Filter for Multiple Human Tracking. IEEE Signal Process. Lett. 23(11): 1592-1596 (2016) - [j28]Volkan Kilic, Mark Barnard, Wenwu Wang, Adrian Hilton, Josef Kittler:
Mean-Shift and Sparse Sampling-Based SMC-PHD Filtering for Audio Informed Visual Speaker Tracking. IEEE Trans. Multim. 18(12): 2417-2431 (2016) - [j27]Jing Dong, Wenwu Wang, Wei Dai, Mark D. Plumbley, Zi-Fa Han, Jonathon A. Chambers:
Analysis SimCO Algorithms for Sparse Analysis Model Based Dictionary Learning. IEEE Trans. Signal Process. 64(2): 417-431 (2016) - [c69]Qiuqiang Kong, Iwona Sobieraj, Wenwu Wang, Mark D. Plumbley:
Deep Neural Network Baseline for DCASE Challenge 2016. DCASE 2016: 50-54 - [c68]Yong Xu, Qiang Huang, Wenwu Wang, Philip J. B. Jackson, Mark D. Plumbley:
Fully DNN-Based Multi-Label Regression for Audio Tagging. DCASE 2016: 105-109 - [c67]Yong Xu, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Hierarchical Learning for DNN-Based Acoustic Scene Classification. DCASE 2016: 110-114 - [c66]Qingju Liu, Teófilo Emídio de Campos, Wenwu Wang, Adrian Hilton:
Identity association using PHD filters in multiple head tracking with depth sensors. ICASSP 2016: 1506-1510 - [c65]Pengming Feng, Wenwu Wang, Syed Mohsen Naqvi, Satnam Singh Dlay, Jonathon A. Chambers:
Social force model aided robust particle PHD filter for multiple human tracking. ICASSP 2016: 4398-4402 - [c64]Qingju Liu, Yan Tang, Philip J. B. Jackson, Wenwu Wang:
Predicting Binaural Speech Intelligibility from Signals Estimated by a Blind Source Separation Algorithm. INTERSPEECH 2016: 140-144 - [c63]Fanglin Gu, Shan Wang, Jibo Wei, Wenwu Wang:
Higher-Order Circularity Based I/Q Imbalance Compensation in Direct-Conversion Receivers. VTC Fall 2016: 1-6 - [i6]Yong Xu, Qiang Huang, Wenwu Wang, Philip J. B. Jackson, Mark D. Plumbley:
Fully DNN-based Multi-label regression for audio tagging. CoRR abs/1606.07695 (2016) - [i5]Yong Xu, Qiang Huang, Wenwu Wang, Peter Foster, Siddharth Sigtia, Philip J. B. Jackson, Mark D. Plumbley:
Fully Deep Neural Networks Incorporating Unsupervised Feature Learning for Audio Tagging. CoRR abs/1607.03681 (2016) - [i4]Yong Xu, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Hierachical learning for DNN-based acoustic scene classification. CoRR abs/1607.03682 (2016) - [i3]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
A Joint Detection-Classification Model for Audio Tagging of Weakly Labelled Data. CoRR abs/1610.01797 (2016) - [i2]Luca Remaggi, Philip J. B. Jackson, Philip Coleman, Wenwu Wang:
Acoustic Reflector Localization: Novel Image Source Reversion and Direct Localization Methods. CoRR abs/1610.05653 (2016) - 2015
- [j26]Di Wu, Yuxin Zhao, Wenwu Wang, Yanling Hao:
Cosparsity-based Stagewise Matching Pursuit algorithm for reconstruction of the cosparse signals. EURASIP J. Adv. Signal Process. 2015: 101 (2015) - [j25]Xiaoyi Chen, Wenwu Wang, Yingmin Wang, Xionghu Zhong, Atiyeh Alinaghi:
Reverberant speech separation with probabilistic time-frequency masking for B-format recordings. Speech Commun. 68: 41-54 (2015) - [j24]Volkan Kilic, Mark Barnard, Wenwu Wang, Josef Kittler:
Audio Assisted Robust Visual Tracking With Adaptive Particle Filtering. IEEE Trans. Multim. 17(2): 186-200 (2015) - [j23]Lei Zhao, Qinghua Hu, Wenwu Wang:
Heterogeneous Feature Selection With Multi-Modal Deep Neural Networks and Sparse Group LASSO. IEEE Trans. Multim. 17(11): 1936-1948 (2015) - [c62]Qingju Liu, Wenwu Wang, Philip J. B. Jackson, Trevor J. Cox:
A source separation evaluation method in object-based spatial audio. EUSIPCO 2015: 1088-1092 - [c61]Pengming Feng, Miao Yu, Syed Mohsen Naqvi, Wenwu Wang, Jonathon A. Chambers:
A Robust student's-t distribution PHD filter with OCSVM updating for multiple human tracking. EUSIPCO 2015: 2396-2400 - [c60]Hatem Deif, Wenwu Wang, Lu Gan, Saadat Alhashmi:
A local discontinuity based approach for monaural singing voice separation from accompanying music with multi-stage non-negative matrix factorization. GlobalSIP 2015: 93-97 - [c59]Jian Guan, Jing Dong, Xuan Wang, Wenwu Wang:
A Polynomial Dictionary Learning Method for Acoustic Impulse Response Modeling. LVA/ICA 2015: 211-218 - [c58]Luca Remaggi, Philip J. B. Jackson, Wenwu Wang, Jonathon A. Chambers:
A 3D model for room boundary estimation. ICASSP 2015: 514-518 - [c57]Qingju Liu, Teofilo de Campos, Wenwu Wang, Philip J. B. Jackson, Adrian Hilton:
Person Tracking Using Audio and Depth Cues. ICCV Workshops 2015: 709-717 - [c56]Yang Yu, Wenwu Wang, Jian Luo, Pengming Feng:
Localization based stereo speech separation using deep networks. DSP 2015: 153-157 - [c55]Jing Dong, Wenwu Wang, Jonathon A. Chambers:
Audio super-resolution using analysis dictionary learning. DSP 2015: 604-608 - [c54]Pengming Feng, Wenwu Wang, Syed Mohsen Naqvi, Jonathon A. Chambers:
A robust PHD filter with deep learning updating for multiple human tracking. DSP 2015: 1227-1231 - [c53]Volkan Kilic, Mark Barnard, Wenwu Wang, Adrian Hilton, Josef Kittler:
Audio informed visual speaker tracking with SMC-PHD filter. ICME 2015: 1-6 - [c52]Hatem Deif, Derry Fitzgerald, Wenwu Wang, Lu Gan:
Separation of vocals from monaural music recordings using diagonal median filters and practical time-frequency parameters. ISSPIT 2015: 163-167 - [c51]Shahrzad Shapoori, Saeid Sanei, Wenwu Wang:
Blind source separation of medial temporal discharges via partial dictionary learning. MLSP 2015: 1-5 - [c50]Shahrzad Shapoori, Saeid Sanei, Wenwu Wang:
A novel approach for detection of medial temporal discharges using blind source separation incorporating dictionary look up. NER 2015: 894-897 - 2014
- [j22]Fanglin Gu, Hang Zhang, Wenwu Wang, Desheng Zhu:
PARAFAC-Based Blind Identification of Underdetermined Mixtures Using Gaussian Mixture Model. Circuits Syst. Signal Process. 33(6): 1841-1857 (2014) - [j21]Bertrand Rivet, Wenwu Wang, Syed M. Naqvi, Jonathon A. Chambers:
Audiovisual Speech Source Separation: An overview of key methodologies. IEEE Signal Process. Mag. 31(3): 125-134 (2014) - [j20]Atiyeh Alinaghi, Philip J. B. Jackson, Qingju Liu, Wenwu Wang:
Joint Mixing Vector and Binaural Model Based Stereo Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 22(9): 1434-1448 (2014) - [j19]Mark Barnard, Peter Koniusz, Wenwu Wang, Josef Kittler, Syed Mohsen Naqvi, Jonathon A. Chambers:
Robust Multi-Speaker Tracking via Dictionary Learning and Identity Modeling. IEEE Trans. Multim. 16(3): 864-880 (2014) - [j18]Qingju Liu, Andrew J. Aubrey, Wenwu Wang:
Interference Reduction in Reverberant Speech Separation With Visual Voice Activity Detection. IEEE Trans. Multim. 16(6): 1610-1623 (2014) - [c49]Volkan Kilic, Xionghu Zhong, Mark Barnard, Wenwu Wang, Josef Kittler:
Audio-visual tracking of a variable number of speakers with a random finite set approach. FUSION 2014: 1-7 - [c48]Xionghu Zhong, Wenwu Wang, Syed Mohsen Naqvi, Engsiong Chng:
A Bayesian performance bound for time-delay of arrival based acoustic source tracking in a reverberant environment. FUSION 2014: 1-8 - [c47]Syed Zubair, Wenwu Wang, Jonathon A. Chambers:
Discriminativetensor dictionaries and sparsity for speaker identification. HSCMA 2014: 37-41 - [c46]Jing Dong, Wenwu Wang, Wei Dai:
Analysis SimCO: A new algorithm for analysis dictionary learning. ICASSP 2014: 7193-7197 - [c45]Syed Zubair, Wenwu Wang:
Signal classification based on block-sparse tensor representation. DSP 2014: 361-365 - [c44]Jing Dong, Wenwu Wang:
Analysis dictionary learning based on Nesterov's gradient with application to SAR image despeckling. ISCCSP 2014: 501-504 - [c43]Swati Chandna, Wenwu Wang:
Improving model-based convolutive blind source separation techniques via bootstrap. SSP 2014: 424-427 - [c42]Fanglin Gu, Wei Li, Wenwu Wang:
Fourth-order cumulant based sources number estimation from mixtures of unknown number of sources. WCSP 2014: 1-6 - 2013
- [j17]Syed Zubair, Fei Yan, Wenwu Wang:
Dictionary learning based sparse coefficients for audio classification with max and average pooling. Digit. Signal Process. 23(3): 960-970 (2013) - [j16]Fanglin Gu, Hang Zhang, Wenwu Wang, Desheng Zhu:
Generalized generating function with tucker decomposition and alternating least squares for underdetermined blind identification. EURASIP J. Adv. Signal Process. 2013: 124 (2013) - [j15]Tao Xu, Wenwu Wang, Wei Dai:
Sparse coding with adaptive dictionary learning for underdetermined blind speech separation. Speech Commun. 55(3): 432-450 (2013) - [j14]Muhammad Salman Khan, Syed M. Naqvi, Ata ur-Rehman, Wenwu Wang, Jonathon A. Chambers:
Video-Aided Model-Based Source Separation in Real Reverberant Rooms. IEEE Trans. Speech Audio Process. 21(9): 1900-1912 (2013) - [j13]Qingju Liu, Wenwu Wang, Philip J. B. Jackson, Mark Barnard, Josef Kittler, Jonathon A. Chambers:
Source Separation of Convolutive and Noisy Mixtures Using Audio-Visual Dictionary Learning and Probabilistic Time-Frequency Masking. IEEE Trans. Signal Process. 61(22): 5520-5535 (2013) - [c41]Qingju Liu, Wenwu Wang:
Show-through removal for scanned images using non-linear NMF with adaptive smoothing. ChinaSIP 2013: 650-654 - [c40]Volkan Kilic, Mark Barnard, Wenwu Wang, Josef Kittler:
Adaptive particle filtering approach to audio-visual tracking. EUSIPCO 2013: 1-5 - [c39]Ye Zhang, Haolong Wang, Tenglong Yu, Wenwu Wang:
Subset pursuit for analysis dictionary learning. EUSIPCO 2013: 1-5 - [c38]Xionghu Zhong, Xiaoyi Chen, Wenwu Wang, Atiyeh Alinaghi, A. Benjamin Premkumar:
Acoustic vector sensor based reverberant speech separation with probabilistic time-frequency masking. EUSIPCO 2013: 1-5 - [c37]Xionghu Zhong, Arash Mohammadi, Wenwu Wang, A. Benjamin Premkumar, Amir Asif:
Acoustic source tracking in a reverberant environment using a pairwise synchronous microphone network. FUSION 2013: 953-960 - [c36]Mark Barnard, Wenwu Wang, Josef Kittler, Syed Mohsen Naqvi, Jonathon A. Chambers:
Audio-visual face detection for tracking in a meeting room environment. FUSION 2013: 1222-1227 - [c35]Atiyeh Alinaghi, Wenwu Wang, Philip J. B. Jackson:
Spatial and coherence cues based time-frequency masking for binaural reverberant speech separation. ICASSP 2013: 684-688 - [c34]Volkan Kilic, Mark Barnard, Wenwu Wang, Josef Kittler:
Audio constrained particle filter based visual tracking. ICASSP 2013: 3627-3631 - [c33]Mark Barnard, Wenwu Wang, Josef Kittler:
Audio head pose estimation using the direct to reverberant speech ratio. ICASSP 2013: 8056-8060 - [c32]Xiaoyi Chen, Atiyeh Alinaghi, Xionghu Zhong, Wenwu Wang:
Acoustic vector sensor based speech source separation with mixed Gaussian-Laplacian distributions. DSP 2013: 1-5 - [c31]Xiaochen Zhao, Guangyu Zhou, Wei Dai, Tao Xu, Wenwu Wang:
Joint image separation and dictionary learning. DSP 2013: 1-6 - [c30]Syed Zubair, Wenwu Wang:
Tensor dictionary learning with sparse TUCKER decomposition. DSP 2013: 1-6 - [c29]Shahrzad Shapoori, Wenwu Wang, Saeid Sanei:
A constrained approach for extraction of pre-ictal discharges from scalp EEG. MLSP 2013: 1-5 - [c28]Ye Zhang, Haolong Wang, Wenwu Wang, Saeid Sanei:
K-plane clustering algorithm for analysis dictionary learning. MLSP 2013: 1-4 - 2012
- [j12]Syed Mohsen Naqvi, Wenwu Wang, Muhammad Salman Khan, Mark Barnard, Jonathon A. Chambers:
Multimodal (audio-visual) source separation exploiting multi-speaker tracking, robust beamforming and time-frequency masking. IET Signal Process. 6(5): 466-477 (2012) - [j11]Qingju Liu, Wenwu Wang, Philip J. B. Jackson:
Use of bimodal coherence to resolve the permutation problem in convolutive BSS. Signal Process. 92(8): 1916-1927 (2012) - [j10]Wei Dai, Tao Xu, Wenwu Wang:
Simultaneous Codeword Optimization (SimCO) for Dictionary Update and Learning. IEEE Trans. Signal Process. 60(12): 6340-6353 (2012) - [c27]Tariqullah Jan, Wenwu Wang:
Blind reverberation time estimation based on Laplace distribution. EUSIPCO 2012: 2050-2054 - [c26]Tariqullah Jan, Wenwu Wang:
Joint blind dereverberation and separation of speech mixtures. EUSIPCO 2012: 2343-2347 - [c25]Mark Barnard, Wenwu Wang, Josef Kittler, Syed Mohsen Naqvi, Jonathon A. Chambers:
A dictionary learning approach to tracking. ICASSP 2012: 981-984 - [c24]Wei Dai, Tao Xu, Wenwu Wang:
Dictionary learning and update based on simultaneous codeword optimization (SimCO). ICASSP 2012: 2037-2040 - [c23]Qingju Liu, Wenwu Wang, Philip J. B. Jackson, Mark Barnard:
Reverberant speech separation based on audio-visual dictionary learning and binaural cues. SSP 2012: 664-667 - 2011
- [j9]Tariqullah Jan, Wenwu Wang, DeLiang Wang:
A multistage approach to blind separation of convolutive speech mixtures. Speech Commun. 53(4): 524-539 (2011) - [c22]Wei Dai, Tao Xu, Wenwu Wang:
Simultaneous codeword optimization (SimCO) for dictionary learning. Allerton 2011: 920-927 - [c21]Tariqullah Jan, Wenwu Wang:
Empirical mode decomposition for joint denoising and dereverberation. EUSIPCO 2011: 206-210 - [c20]Syed Mohsen Naqvi, Muhammad Salman Khan, Qingju Liu, Wenwu Wang, Jonathon A. Chambers:
Multimodal blind source separation with a circular microphone array and robust beamforming. EUSIPCO 2011: 1050-1054 - [c19]Qingju Liu, Syed Mohsen Naqvi, Wenwu Wang, Philip J. B. Jackson, Jonathon A. Chambers:
Robust feature selection for scaling ambiguity reduction in audio-visual convolutive BSS. EUSIPCO 2011: 1060-1064 - [c18]Atiyeh Alinaghi, Wenwu Wang, Philip J. B. Jackson:
Integrating binaural cues and blind source separation method for separating reverberant speech mixtures. ICASSP 2011: 209-212 - [c17]Qingju Liu, Wenwu Wang:
Blind source separation and visual voice activity detection for target speech extraction. iCAST 2011: 457-460 - [c16]Tao Xu, Wenwu Wang:
Methods for learning adaptive dictionary in underdetermined speech separation. MLSP 2011: 1-6 - [i1]Wei Dai, Tao Xu, Wenwu Wang:
Simultaneous Codeword Optimization (SimCO) for Dictionary Update and Learning. CoRR abs/1109.5302 (2011) - 2010
- [c15]Wenwu Wang, Hafiz Mustafa:
Single Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification. CMMR 2010: 84-101 - [c14]Qingju Liu, Wenwu Wang, Philip J. B. Jackson:
Use of Bimodal Coherence to Resolve Spectral Indeterminacy in Convolutive BSS. LVA/ICA 2010: 131-139 - [c13]Tao Xu, Wenwu Wang:
A block-based compressed sensing method for underdetermined blind speech separation incorporating binary mask. ICASSP 2010: 2022-2025 - [c12]Qingju Liu, Wenwu Wang, Philip J. B. Jackson:
Bimodal coherence based scale ambiguity cancellation for target speech extraction and enhancement. INTERSPEECH 2010: 438-441
2000 – 2009
- 2009
- [j8]Wenwu Wang, Andrzej Cichocki, Jonathon A. Chambers:
A multiplicative algorithm for convolutive non-negative matrix factorization based on squared Euclidean distance. IEEE Trans. Signal Process. 57(7): 2858-2864 (2009) - [c11]Tariqullah Jan, Wenwu Wang, DeLiang Wang:
A multistage approach for blind separation of convolutive speech mixtures. ICASSP 2009: 1713-1716 - 2008
- [j7]Andrzej Cichocki, Morten Mørup, Paris Smaragdis, Wenwu Wang, Rafal Zdunek:
Advances in Nonnegative Matrix and Tensor Factorization. Comput. Intell. Neurosci. 2008 (2008) - [j6]Wenwu Wang, Yuhui Luo, Jonathon A. Chambers, Saeid Sanei:
Note Onset Detection via Nonnegative Factorization of Magnitude Spectrum. EURASIP J. Adv. Signal Process. 2008 (2008) - [c10]Wenwu Wang:
Convolutive non-negative sparse coding. IJCNN 2008: 3681-3684 - 2007
- [c9]Yonggang Zhang, Jonathon A. Chambers, Wenwu Wang, Paul Kendrick, Trevor J. Cox:
A New Variable Step-Size LMS Algorithm with Robustness to Nonstationary Noise. ICASSP (3) 2007: 1349-1352 - 2006
- [j5]Maria G. Jafari, Wenwu Wang, Jonathon A. Chambers, Tetsuya Hoya, Andrzej Cichocki:
Sequential blind source separation based exclusively on second-order statistics developed for a class of periodic signals. IEEE Trans. Signal Process. 54(3): 1028-1040 (2006) - [j4]Yuhui Luo, Wenwu Wang, Jonathon A. Chambers, Sangarapillai Lambotharan, Ian K. Proudler:
Exploitation of source nonstationarity in underdetermined blind source separation with advanced clustering techniques. IEEE Trans. Signal Process. 54(6-1): 2198-2212 (2006) - 2005
- [j3]Leor Shoker, Saeid Sanei, Wenwu Wang, Jonathon A. Chambers:
Removal of eye blinking artifact from the electro-encephalogram, incorporating a new constrained blind source separation algorithm. Medical Biol. Eng. Comput. 43(2): 290-295 (2005) - [j2]Lianxi Yuan, Wenwu Wang, Jonathon A. Chambers:
Variable step-size sign natural gradient algorithm for sequential blind source separation. IEEE Signal Process. Lett. 12(8): 589-592 (2005) - [j1]Wenwu Wang, Saeid Sanei, Jonathon A. Chambers:
Penalty function-based joint diagonalization approach for convolutive blind separation of nonstationary sources. IEEE Trans. Signal Process. 53(5): 1654-1669 (2005) - [c8]Wenwu Wang, Darren Cosker, Yulia Hicks, Saeid Sanei, Jonathon A. Chambers:
Video assisted speech source separation. ICASSP (5) 2005: 425-428 - [c7]Lianxi Yuan, Enfang Sang, Wenwu Wang, Jonathon A. Chambers:
An Effective Method to Improve Convergence for Sequential Blind Source Separation. ICNC (1) 2005: 199-208 - 2004
- [c6]Wenwu Wang, Jonathon A. Chambers, Saeid Sanei:
Penalty function based joint diagonalization approach for convolutive constrained BSS of nonstationary signals. EUSIPCO 2004: 1701-1704 - [c5]Saeid Sanei, Loukianos Spyrou, Wenwu Wang, Jonathon A. Chambers:
Localization of P300 Sources in Schizophrenia Patients Using Constrained BSS. ICA 2004: 177-184 - [c4]Wenwu Wang, Jonathon A. Chambers, Saeid Sanei:
A Novel Hybrid Approach to the Permutation Problem of Frequency Domain Blind Source Separation. ICA 2004: 532-539 - [c3]Wenwu Wang, Jonathon A. Chambers, Saeid Sanei:
Penalty Function Approach for Constrained Convolutive Blind Source Separation. ICA 2004: 661-668 - [c2]Saeid Sanei, Wenwu Wang, Jonathon A. Chambers:
A coupled HMM for solving the permutation problem in frequency domain BSS. ICASSP (5) 2004: 565-568 - 2003
- [c1]Wenwu Wang, Maria G. Jafari, Saeid Sanei, Jonathon A. Chambers:
Blind separation of convolutive mixtures of cyclostationary sources using an extended natural gradient method. ISSPA (2) 2003: 93-96
Coauthor Index
aka: Volkan Kiliç
aka: Syed Mohsen Naqvi
aka: H. Lilian Tang
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-26 01:53 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint