default search action
Mengyue Wu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j5]Xuenan Xu, Zeyu Xie, Mengyue Wu, Kai Yu:
Beyond the Status Quo: A Contemporary Survey of Advances and Challenges in Audio Captioning. IEEE ACM Trans. Audio Speech Lang. Process. 32: 95-112 (2024) - [j4]Xuenan Xu, Ziyang Ma, Mengyue Wu, Kai Yu:
Towards Weakly Supervised Text-to-Audio Grounding. IEEE Trans. Multim. 26: 11126-11138 (2024) - [c41]Pingyue Zhang, Mengyue Wu:
Multi-Label Supervised Contrastive Learning. AAAI 2024: 16786-16793 - [c40]Zhige Huang, Haoan Jin, Mengyue Wu, Kenny Q. Zhu:
Automatic Reconstruction of Ancient Chinese Pronunciations. EMNLP (Findings) 2024: 5689-5698 - [c39]Theron Wang, Xingyuan Li, Chunhao Zhang, Mengyue Wu, Kenny Q. Zhu:
Phonetic and Lexical Discovery of Canine Vocalization. EMNLP (Findings) 2024: 13972-13983 - [c38]Zeyu Xie, Baihan Li, Xuenan Xu, Mengyue Wu, Kai Yu:
Enhancing Audio Generation Diversity with Visual Information. ICASSP 2024: 866-870 - [c37]Xuenan Xu, Xiaohang Xu, Zeyu Xie, Pingyue Zhang, Mengyue Wu, Kai Yu:
A Detailed Audio-Text Data Simulation Pipeline Using Single-Event Sounds. ICASSP 2024: 1091-1095 - [c36]Pingyue Zhang, Mengyue Wu, Kai Yu:
Semantic-Enhanced Supervised Contrastive Learning. ICASSP 2024: 6030-6034 - [c35]Xuenan Xu, Arshdeep Singh, Mengyue Wu, Wenwu Wang, Mark D. Plumbley:
Investigating Passive Filter Pruning for Efficient CNN-Transformer Audio Captioning. MLSP 2024: 1-6 - [c34]Luoyi Sun, Xuenan Xu, Mengyue Wu, Weidi Xie:
Auto-ACD: A Large-scale Dataset for Audio-Language Representation Learning. ACM Multimedia 2024: 5025-5034 - [c33]Siyuan Chen, Meilin Wang, Minghao Lv, Zhiling Zhang, Juqianqian Juqianqian, Dejiyangla Dejiyangla, Yujia Peng, Kenny Q. Zhu, Mengyue Wu:
Mapping Long-term Causalities in Psychiatric Symptomatology and Life Events from Social Media. NAACL-HLT 2024: 5472-5487 - [i49]Xuenan Xu, Ziyang Ma, Mengyue Wu, Kai Yu:
Towards Weakly Supervised Text-to-Audio Grounding. CoRR abs/2401.02584 (2024) - [i48]Xingyuan Li, Sinong Wang, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu:
Phonetic and Lexical Discovery of a Canine Language using HuBERT. CoRR abs/2402.15985 (2024) - [i47]Xiujie Song, Mengyue Wu, Kenny Q. Zhu, Chunhao Zhang, Yanyi Chen:
A Cognitive Evaluation Benchmark of Image Reasoning and Description for Large Vision Language Models. CoRR abs/2402.18409 (2024) - [i46]Zeyu Xie, Baihan Li, Xuenan Xu, Mengyue Wu, Kai Yu:
Enhancing Audio Generation Diversity with Visual Information. CoRR abs/2403.01278 (2024) - [i45]Xuenan Xu, Xiaohang Xu, Zeyu Xie, Pingyue Zhang, Mengyue Wu, Kai Yu:
A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds. CoRR abs/2403.04594 (2024) - [i44]Kunyao Lan, Cong Ming, Binwei Yao, Lu Chen, Mengyue Wu:
Towards Reliable and Empathetic Depression-Diagnosis-Oriented Chats. CoRR abs/2404.05012 (2024) - [i43]Haohe Liu, Xuenan Xu, Yi Yuan, Mengyue Wu, Wenwu Wang, Mark D. Plumbley:
SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound. CoRR abs/2405.00233 (2024) - [i42]Yufei Wang, Mengyue Wu:
Evaluation of data inconsistency for multi-modal sentiment analysis. CoRR abs/2406.03004 (2024) - [i41]Zeyu Xie, Baihan Li, Xuenan Xu, Zheng Liang, Kai Yu, Mengyue Wu:
FakeSound: Deepfake General Audio Detection. CoRR abs/2406.08052 (2024) - [i40]Zeyu Xie, Xuenan Xu, Zhizheng Wu, Mengyue Wu:
AudioTime: A Temporally-aligned Audio-text Benchmark Dataset. CoRR abs/2407.02857 (2024) - [i39]Zeyu Xie, Xuenan Xu, Zhizheng Wu, Mengyue Wu:
PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation. CoRR abs/2407.02869 (2024) - [i38]Baihan Li, Zeyu Xie, Xuenan Xu, Yiwei Guo, Ming Yan, Ji Zhang, Kai Yu, Mengyue Wu:
DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation. CoRR abs/2407.13198 (2024) - [i37]Xuenan Xu, Haohe Liu, Mengyue Wu, Wenwu Wang, Mark D. Plumbley:
Efficient Audio Captioning with Encoder-Level Knowledge Distillation. CoRR abs/2407.14329 (2024) - [i36]Xuenan Xu, Pingyue Zhang, Ming Yan, Ji Zhang, Mengyue Wu:
Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models. CoRR abs/2407.14355 (2024) - [i35]Kunyao Lan, Bingui Jin, Zichen Zhu, Siyuan Chen, Shu Zhang, Kenny Q. Zhu, Mengyue Wu:
Depression Diagnosis Dialogue Simulation: Self-improving Psychiatrist with Tertiary Memory. CoRR abs/2409.15084 (2024) - [i34]Siyuan Chen, Cong Ming, Zhiling Zhang, Yanyi Chen, Kenny Q. Zhu, Mengyue Wu:
Mixed Chain-of-Psychotherapies for Emotional Support Chatbot. CoRR abs/2409.19533 (2024) - [i33]Xun Jiang, Feng Li, Han Zhao, Jiaying Wang, Jun Shao, Shihao Xu, Shu Zhang, Weiling Chen, Xavier Tang, Yize Chen, Mengyue Wu, Weizhi Ma, Mengdi Wang, Tianqiao Chen:
Long Term Memory: The Foundation of AI Self-Evolution. CoRR abs/2410.15665 (2024) - 2023
- [j3]Zhi Chen, Yuncong Liu, Lu Chen, Su Zhu, Mengyue Wu, Kai Yu:
OPAL: Ontology-Aware Pretrained Language Model for End-to-End Task-Oriented Dialogue. Trans. Assoc. Comput. Linguistics 11: 68-84 (2023) - [c32]Jieyi Huang, Chunhao Zhang, Mengyue Wu, Kenny Q. Zhu:
Transcribing Vocal Communications of Domestic Shiba lnu Dogs. ACL (Findings) 2023: 13819-13832 - [c31]Siyuan Chen, Zhiling Zhang, Mengyue Wu, Kenny Q. Zhu:
Detection of Multiple Mental Disorders from Social Media with Two-Stream Psychiatric Experts. EMNLP 2023: 9071-9084 - [c30]Zhiling Zhang, Mengyue Wu, Kenny Q. Zhu:
Semantic Space Grounded Weighted Decoding for Multi-Attribute Controllable Dialogue Generation. EMNLP 2023: 13230-13243 - [c29]Guangwei Li, Xuenan Xu, Lingfeng Dai, Mengyue Wu, Kai Yu:
Diverse and Vivid Sound Generation from Text Descriptions. ICASSP 2023: 1-5 - [c28]Xuenan Xu, Mengyue Wu, Kai Yu:
Investigating Pooling Strategies and Loss Functions for Weakly-Supervised Text-to-Audio Grounding via Contrastive Learning. ICASSP Workshops 2023: 1-5 - [c27]Pingyue Zhang, Mengyue Wu, Kai Yu:
ReCLR: Reference-Enhanced Contrastive Learning of Audio Representation for Depression Detection. INTERSPEECH 2023: 2998-3002 - [c26]Zeyu Xie, Xuenan Xu, Mengyue Wu, Kai Yu:
Enhance Temporal Relations in Audio Captioning with Sound Event Detection. INTERSPEECH 2023: 4179-4183 - [c25]Xuenan Xu, Zhiling Zhang, Zelin Zhou, Pingyue Zhang, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu:
BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data. ACM Multimedia 2023: 2756-2764 - [i32]Xuenan Xu, Zhiling Zhang, Zelin Zhou, Pingyue Zhang, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu:
BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data. CoRR abs/2303.07902 (2023) - [i31]Guangwei Li, Xuenan Xu, Lingfeng Dai, Mengyue Wu, Kai Yu:
Diverse and Vivid Sound Generation from Text Descriptions. CoRR abs/2305.01980 (2023) - [i30]Zhiling Zhang, Mengyue Wu, Kenny Q. Zhu:
Semantic Space Grounded Weighted Decoding for Multi-Attribute Controllable Dialogue Generation. CoRR abs/2305.02820 (2023) - [i29]Siyuan Chen, Mengyue Wu, Kenny Q. Zhu, Kunyao Lan, Zhiling Zhang, Lyuchun Cui:
LLM-empowered Chatbots for Psychiatrist and Patient Simulation: Application and Evaluation. CoRR abs/2305.13614 (2023) - [i28]Zeyu Xie, Xuenan Xu, Mengyue Wu, Kai Yu:
Enhance Temporal Relations in Audio Captioning with Sound Event Detection. CoRR abs/2306.01533 (2023) - [i27]Hanxue Zhang, Zeyu Xie, Xuenan Xu, Mengyue Wu, Kai Yu:
Improving Audio Caption Fluency with Automatic Error Correction. CoRR abs/2306.10090 (2023) - [i26]Luoyi Sun, Xuenan Xu, Mengyue Wu, Weidi Xie:
A Large-scale Dataset for Audio-Language Representation Learning. CoRR abs/2309.11500 (2023) - [i25]Jieyi Huang, Chunhao Zhang, Yufei Wang, Mengyue Wu, Kenny Q. Zhu:
Does My Dog "Speak" Like Me? The Acoustic Correlation between Pet Dogs and Their Human Owners. CoRR abs/2309.13085 (2023) - [i24]Yufei Wang, Chunhao Zhang, Jieyi Huang, Mengyue Wu, Kenny Q. Zhu:
Towards Lexical Analysis of Dog Vocalizations via Online Videos. CoRR abs/2309.13086 (2023) - [i23]Haoan Jin, Siyuan Chen, Mengyue Wu, Kenny Q. Zhu:
PsyEval: A Comprehensive Large Language Model Evaluation Benchmark for Mental Health. CoRR abs/2311.09189 (2023) - 2022
- [c24]Binwei Yao, Chao Shi, Likai Zou, Lingfeng Dai, Mengyue Wu, Lu Chen, Zhen Wang, Kai Yu:
D4: a Chinese Dialogue Dataset for Depression-Diagnosis-Oriented Chat. EMNLP 2022: 2438-2459 - [c23]Zhiling Zhang, Siyuan Chen, Mengyue Wu, Kenny Q. Zhu:
Symptom Identification for Interpretable Detection of Multiple Mental Disorders on Social Media. EMNLP 2022: 9970-9985 - [c22]Guangwei Li, Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Category-Adapted Sound Event Enhancement with Weakly Labeled Data. ICASSP 2022: 851-855 - [c21]Xuenan Xu, Mengyue Wu, Kai Yu:
Diversity-Controllable and Accurate Audio Captioning Based on Neural Condition. ICASSP 2022: 971-975 - [c20]Zelin Zhou, Zhiling Zhang, Xuenan Xu, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu:
Can Audio Captions Be Evaluated With Image Caption Metrics? ICASSP 2022: 981-985 - [c19]Guangwei Li, Xuenan Xu, Mengyue Wu, Kai Yu:
Navigating Audio-Visual Event Detection Across Mismatched Modalities. ICASSP 2022: 1975-1979 - [c18]Siyu Lou, Xuenan Xu, Mengyue Wu, Kai Yu:
Audio-Text Retrieval in Context. ICASSP 2022: 4793-4797 - [c17]Wen Wu, Mengyue Wu, Kai Yu:
Climate and Weather: Inspecting Depression Detection via Emotion Recognition. ICASSP 2022: 6262-6266 - [c16]Zhiling Zhang, Siyuan Chen, Mengyue Wu, Kenny Q. Zhu:
Psychiatric Scale Guided Risky Post Screening for Early Detection of Depression. IJCAI 2022: 5220-5226 - [i22]Siyu Lou, Xuenan Xu, Mengyue Wu, Kai Yu:
Audio-text Retrieval in Context. CoRR abs/2203.13645 (2022) - [i21]Wen Wu, Mengyue Wu, Kai Yu:
Climate and Weather: Inspecting Depression Detection via Emotion Recognition. CoRR abs/2204.14099 (2022) - [i20]Xuenan Xu, Mengyue Wu, Kai Yu:
A Comprehensive Survey of Automated Audio Captioning. CoRR abs/2205.05357 (2022) - [i19]Zhiling Zhang, Siyuan Chen, Mengyue Wu, Kenny Q. Zhu:
Psychiatric Scale Guided Risky Post Screening for Early Detection of Depression. CoRR abs/2205.09497 (2022) - [i18]Zhiling Zhang, Siyuan Chen, Mengyue Wu, Kenny Q. Zhu:
Symptom Identification for Interpretable Detection of Multiple Mental Disorders. CoRR abs/2205.11308 (2022) - [i17]Binwei Yao, Chao Shi, Likai Zou, Lingfeng Dai, Mengyue Wu, Lu Chen, Zhen Wang, Kai Yu:
D4: a Chinese Dialogue Dataset for Depression-Diagnosis-Oriented Chat. CoRR abs/2205.11764 (2022) - [i16]Zhi Chen, Jijia Bao, Lu Chen, Yuncong Liu, Da Ma, Bei Chen, Mengyue Wu, Su Zhu, Jian-Guang Lou, Kai Yu:
DialogZoo: Large-Scale Dialog-Oriented Task Learning. CoRR abs/2205.12662 (2022) - [i15]Zhi Chen, Yuncong Liu, Lu Chen, Su Zhu, Mengyue Wu, Kai Yu:
OPAL: Ontology-Aware Pretrained Language Model for End-to-End Task-Oriented Dialogue. CoRR abs/2209.04595 (2022) - 2021
- [j2]Heinrich Dinkel, Mengyue Wu, Kai Yu:
Towards Duration Robust Weakly Supervised Sound Event Detection. IEEE ACM Trans. Audio Speech Lang. Process. 29: 887-900 (2021) - [j1]Heinrich Dinkel, Shuai Wang, Xuenan Xu, Mengyue Wu, Kai Yu:
Voice Activity Detection in the Wild: A Data-Driven Approach Using Teacher-Student Training. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1542-1555 (2021) - [c15]Die Zhang, Hao Zhang, Huilin Zhou, Xiaoyi Bao, Da Huo, Ruizhao Chen, Xu Cheng, Mengyue Wu, Quanshi Zhang:
Building Interpretable Interaction Trees for Deep NLP Models. AAAI 2021: 14328-14337 - [c14]Zhi Chen, Lu Chen, Hanqi Li, Ruisheng Cao, Da Ma, Mengyue Wu, Kai Yu:
Decoupled Dialogue Modeling and Semantic Parsing for Multi-Turn Text-to-SQL. ACL/IJCNLP (Findings) 2021: 3063-3074 - [c13]Zhiling Zhang, Zelin Zhou, Haifeng Tang, Guangwei Li, Mengyue Wu, Kenny Q. Zhu:
Enriching Ontology with Temporal Commonsense for Low-Resource Audio Tagging. CIKM 2021: 3652-3656 - [c12]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Text-to-Audio Grounding: Building Correspondence Between Captions and Sound Events. ICASSP 2021: 606-610 - [c11]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Zeyu Xie, Kai Yu:
Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning. ICASSP 2021: 905-909 - [c10]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
A Lightweight Framework for Online Voice Activity Detection in the Wild. Interspeech 2021: 371-375 - [c9]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Audio Caption in a Car Setting with a Sentence-Level Loss. ISCSLP 2021: 1-5 - [c8]Pingyue Zhang, Mengyue Wu, Heinrich Dinkel, Kai Yu:
DEPA: Self-Supervised Audio Embedding for Depression Detection. ACM Multimedia 2021: 135-143 - [i14]Heinrich Dinkel, Mengyue Wu, Kai Yu:
Towards duration robust weakly supervised sound event detection. CoRR abs/2101.07687 (2021) - [i13]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Zeyu Xie, Kai Yu:
Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning. CoRR abs/2102.11457 (2021) - [i12]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Text-to-Audio Grounding: Building Correspondence Between Captions and Sound Events. CoRR abs/2102.11474 (2021) - [i11]Heinrich Dinkel, Shuai Wang, Xuenan Xu, Mengyue Wu, Kai Yu:
Voice activity detection in the wild: A data-driven approach using teacher-student training. CoRR abs/2105.04065 (2021) - [i10]Zhi Chen, Lu Chen, Hanqi Li, Ruisheng Cao, Da Ma, Mengyue Wu, Kai Yu:
Decoupled Dialogue Modeling and Semantic Parsing for Multi-Turn Text-to-SQL. CoRR abs/2106.02282 (2021) - [i9]Zhiling Zhang, Zelin Zhou, Haifeng Tang, Guangwei Li, Mengyue Wu, Kenny Q. Zhu:
Enriching Ontology with Temporal Commonsense for Low-Resource Audio Tagging. CoRR abs/2110.01009 (2021) - [i8]Zelin Zhou, Zhiling Zhang, Xuenan Xu, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu:
Can Audio Captions Be Evaluated with Image Caption Metrics? CoRR abs/2110.04684 (2021) - 2020
- [c7]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
A CRNN-GRU Based Reinforcement Learning Approach to Audio Captioning. DCASE 2020: 225-229 - [c6]Rui Qian, Di Hu, Heinrich Dinkel, Mengyue Wu, Ning Xu, Weiyao Lin:
Multiple Sound Sources Localization from Coarse to Fine. ECCV (20) 2020: 292-308 - [c5]Yefei Chen, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Voice Activity Detection in the Wild via Weakly Supervised Sound Event Detection. INTERSPEECH 2020: 3665-3669 - [i7]Heinrich Dinkel, Yefei Chen, Mengyue Wu, Kai Yu:
GPVAD: Towards noise robust voice activity detection via weakly supervised sound event detection. CoRR abs/2003.12222 (2020) - [i6]Die Zhang, Huilin Zhou, Xiaoyi Bao, Da Huo, Ruizhao Chen, Xu Cheng, Hao Zhang, Mengyue Wu, Quanshi Zhang:
Interpreting Hierarchical Linguistic Interactions in DNNs. CoRR abs/2007.04298 (2020) - [i5]Rui Qian, Di Hu, Heinrich Dinkel, Mengyue Wu, Ning Xu, Weiyao Lin:
Multiple Sound Sources Localization from Coarse to Fine. CoRR abs/2007.06355 (2020)
2010 – 2019
- 2019
- [c4]Mengyue Wu, Heinrich Dinkel, Kai Yu:
Audio Caption: Listen and Tell. ICASSP 2019: 830-834 - [i4]Mengyue Wu, Heinrich Dinkel, Kai Yu:
Audio Caption: Listen and Tell. CoRR abs/1902.09254 (2019) - [i3]Heinrich Dinkel, Mengyue Wu, Kai Yu:
Text-based Depression Detection: What Triggers An Alert. CoRR abs/1904.05154 (2019) - [i2]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
What does a Car-ssette tape tell? CoRR abs/1905.13448 (2019) - [i1]Heinrich Dinkel, Pingyue Zhang, Mengyue Wu, Kai Yu:
Depa: Self-supervised audio embedding for depression detection. CoRR abs/1910.13028 (2019) - 2018
- [c3]Mengyue Wu, Huabo Sun, Chun Wang, Binbin Lu:
Detecting and Analysing Spatial-Temporal Aggregation of Flight Turbulence with the QAR Big Data. Geoinformatics 2018: 1-6 - 2015
- [c2]Mengyue Wu, Rikke L. Bundgaard-Nielsen, Brett Baker, Catherine T. Best, Janet Fletcher:
Perception of Cantonese tones by Mandarin speakers. ICPhS 2015
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-23 20:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint