default search action
Siqi Zheng
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j4]Chongyang Wang, Yuan Feng, Lingxiao Zhong, Siyi Zhu, Chi Zhang, Siqi Zheng, Chen Liang, Yuntao Wang, Chengqi He, Chun Yu, Yuanchun Shi:
UbiPhysio: Support Daily Functioning, Fitness, and Rehabilitation with Action Understanding and Feedback in Natural Language. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 8(1): 20:1-20:27 (2024) - [j3]Xiaofan Liang, César A. Hidalgo, Pierre-Alexandre Balland, Siqi Zheng, Jianghao Wang:
Intercity connectivity and urban innovation. Comput. Environ. Urban Syst. 109: 102092 (2024) - [c29]Chongyang Wang, Siqi Zheng, Lingxiao Zhong, Chun Yu, Chen Liang, Yuntao Wang, Yuan Gao, Tin Lun Lam, Yuanchun Shi:
PepperPose: Full-Body Pose Estimation with a Companion Robot. CHI 2024: 586:1-586:16 - [c28]Zhihao Du, Shiliang Zhang, Kai Hu, Siqi Zheng:
FunCodec: A Fundamental, Reproducible and Integrable Open-Source Toolkit for Neural Speech Codec. ICASSP 2024: 591-595 - [c27]Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Shiliang Zhang, Chong Deng, Yukun Ma, Hai Yu, Jiaqing Liu, Chong Zhang:
Loss Masking Is Not Needed In Decoder-Only Transformer For Discrete-Token-Based ASR. ICASSP 2024: 11056-11060 - [c26]Huadai Liu, Rongjie Huang, Yang Liu, Hengyuan Cao, Jialei Wang, Xize Cheng, Siqi Zheng, Zhou Zhao:
AudioLCM: Efficient and High-Quality Text-to-Audio Generation with Minimal Inference Steps. ACM Multimedia 2024: 7008-7017 - [c25]Rongjie Huang, Yongqi Wang, Ruofan Hu, Xiaoshan Xu, Zhiqing Hong, Dongchao Yang, Xize Cheng, Zehan Wang, Ziyue Jiang, Zhenhui Ye, Luping Liu, Siqi Zheng, Zhou Zhao:
VoiceTuner: Self-Supervised Pre-training and Efficient Fine-tuning For Voice Generation. ACM Multimedia 2024: 10630-10639 - [i40]Ziyang Ma, Guanrou Yang, Yifan Yang, Zhifu Gao, Jiaming Wang, Zhihao Du, Fan Yu, Qian Chen, Siqi Zheng, Shiliang Zhang, Xie Chen:
An Embarrassingly Simple Approach for LLM with Strong ASR Capacity. CoRR abs/2402.08846 (2024) - [i39]Huadai Liu, Rongjie Huang, Yang Liu, Hengyuan Cao, Jialei Wang, Xize Cheng, Siqi Zheng, Zhou Zhao:
AudioLCM: Text-to-Audio Generation with Latent Consistency Models. CoRR abs/2406.00356 (2024) - [i38]Shengpeng Ji, Jialong Zuo, Minghui Fang, Siqi Zheng, Qian Chen, Wen Wang, Ziyue Jiang, Hai Huang, Xize Cheng, Rongjie Huang, Zhou Zhao:
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec. CoRR abs/2406.01205 (2024) - [i37]Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Shiliang Zhang, Chong Deng, Hai Yu, Jiaqing Liu, Yukun Ma, Chong Zhang:
Skip-Layer Attention: Bridging Abstract and Detailed Dependencies in Transformers. CoRR abs/2406.11274 (2024) - [i36]Ruiqi Li, Zhiqing Hong, Yongqi Wang, Lichao Zhang, Rongjie Huang, Siqi Zheng, Zhou Zhao:
Accompanied Singing Voice Synthesis with Fully Text-controlled Melody. CoRR abs/2407.02049 (2024) - [i35]Keyu An, Qian Chen, Chong Deng, Zhihao Du, Changfeng Gao, Zhifu Gao, Yue Gu, Ting He, Hangrui Hu, Kai Hu, Shengpeng Ji, Yabin Li, Zerui Li, Heng Lu, Haoneng Luo, Xiang Lv, Bin Ma, Ziyang Ma, Chongjia Ni, Changhe Song, Jiaqi Shi, Xian Shi, Hao Wang, Wen Wang, Yuxuan Wang, Zhangyu Xiao, Zhijie Yan, Yexin Yang, Bin Zhang, Qinglin Zhang, Shiliang Zhang, Nan Zhao, Siqi Zheng:
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs. CoRR abs/2407.04051 (2024) - [i34]Zhihao Du, Qian Chen, Shiliang Zhang, Kai Hu, Heng Lu, Yexin Yang, Hangrui Hu, Siqi Zheng, Yue Gu, Ziyang Ma, Zhifu Gao, Zhijie Yan:
CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens. CoRR abs/2407.05407 (2024) - [i33]Luyao Cheng, Hui Wang, Siqi Zheng, Yafeng Chen, Rongjie Huang, Qinglin Zhang, Qian Chen, Xihao Li:
Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization. CoRR abs/2408.12102 (2024) - [i32]Shengpeng Ji, Ziyue Jiang, Xize Cheng, Yifu Chen, Minghui Fang, Jialong Zuo, Qian Yang, Ruiqi Li, Ziang Zhang, Xiaoda Yang, Rongjie Huang, Yidi Jiang, Qian Chen, Siqi Zheng, Wen Wang, Zhou Zhao:
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling. CoRR abs/2408.16532 (2024) - [i31]Han Yin, Jisheng Bai, Yang Xiao, Hui Wang, Siqi Zheng, Yafeng Chen, Rohan Kumar Das, Chong Deng, Jianfeng Chen:
Exploring Text-Queried Sound Event Detection with Audio Source Separation. CoRR abs/2409.13292 (2024) - [i30]Ruiqi Li, Siqi Zheng, Xize Cheng, Ziang Zhang, Shengpeng Ji, Zhou Zhao:
MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization. CoRR abs/2410.12957 (2024) - [i29]Qinglin Zhang, Luyao Cheng, Chong Deng, Qian Chen, Wen Wang, Siqi Zheng, Jiaqing Liu, Hai Yu, Chaohong Tan:
OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation. CoRR abs/2410.17799 (2024) - [i28]Xize Cheng, Siqi Zheng, Zehan Wang, Minghui Fang, Ziang Zhang, Rongjie Huang, Ziyang Ma, Shengpeng Ji, Jialong Zuo, Tao Jin, Zhou Zhao:
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup. CoRR abs/2410.21269 (2024) - 2023
- [c24]Jinglin Liu, Zhenhui Ye, Qian Chen, Siqi Zheng, Wen Wang, Qinglin Zhang, Zhou Zhao:
DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect. ACL (Findings) 2023: 11905-11912 - [c23]Luyao Cheng, Siqi Zheng, Qinglin Zhang, Hui Wang, Yafeng Chen, Qian Chen:
Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization. ACL (Findings) 2023: 14068-14077 - [c22]Siqi Zheng, Ge Lv:
A Two-Layer Human-in-the-Loop Optimization Framework for Customizing Lower-Limb Exoskeleton Assistance. ACC 2023: 3913-3920 - [c21]Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Chong Deng, Hai Yu, Jiaqing Liu, Yukun Ma, Chong Zhang:
Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings. EMNLP 2023: 5868-5875 - [c20]Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Qian Chen:
Pushing the Limits of Self-Supervised Speaker Verification using Regularized Distillation Framework. ICASSP 2023: 1-5 - [c19]Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Qian Chen, Jiajun Qi:
An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification. INTERSPEECH 2023: 2228-2232 - [c18]Hui Wang, Siqi Zheng, Yafeng Chen, Luyao Cheng, Qian Chen:
CAM++: A Fast and Efficient Network for Speaker Verification Using Context-Aware Masking. INTERSPEECH 2023: 5301-5305 - [i27]Hui Wang, Siqi Zheng, Yafeng Chen, Luyao Cheng, Qian Chen:
CAM++: A Fast and Efficient Network for Speaker Verification Using Context-Aware Masking. CoRR abs/2303.00332 (2023) - [i26]Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Chong Deng, Hai Yu, Jiaqing Liu, Yukun Ma, Chong Zhang:
Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings. CoRR abs/2305.10786 (2023) - [i25]Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Qian Chen, Jiajun Qi:
An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification. CoRR abs/2305.12838 (2023) - [i24]Luyao Cheng, Siqi Zheng, Qinglin Zhang, Hui Wang, Yafeng Chen, Qian Chen:
Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization. CoRR abs/2305.12927 (2023) - [i23]Siqi Zheng, Luyao Cheng, Yafeng Chen, Hui Wang, Qian Chen:
3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and Multi-Dialect Corpus for Speech Representation Disentanglement. CoRR abs/2306.15354 (2023) - [i22]Qian Chen, Wen Wang, Qinglin Zhang, Chong Deng, Yukun Ma, Siqi Zheng:
Improving BERT with Hybrid Pooling Network and Drop Mask. CoRR abs/2307.07258 (2023) - [i21]Yafeng Chen, Siqi Zheng, Qian Chen:
Self-Distillation Network with Ensemble Prototypes: Learning Robust Speaker Representations without Supervision. CoRR abs/2308.02774 (2023) - [i20]Chongyang Wang, Yuan Feng, Lingxiao Zhong, Siyi Zhu, Chi Zhang, Siqi Zheng, Chen Liang, Yuntao Wang, Chengqi He, Chun Yu, Yuanchun Shi:
UbiPhysio: Support Daily Functioning, Fitness, and Rehabilitation with Action Understanding and Feedback in Natural Language. CoRR abs/2308.10526 (2023) - [i19]Zhihao Du, Shiliang Zhang, Kai Hu, Siqi Zheng:
FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec. CoRR abs/2309.07405 (2023) - [i18]Luyao Cheng, Siqi Zheng, Qinglin Zhang, Hui Wang, Yafeng Chen, Qian Chen, Shiliang Zhang:
Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation. CoRR abs/2309.10456 (2023) - [i17]Jiaming Wang, Zhihao Du, Qian Chen, Yunfei Chu, Zhifu Gao, Zerui Li, Kai Hu, Xiaohuan Zhou, Jin Xu, Ziyang Ma, Wen Wang, Siqi Zheng, Chang Zhou, Zhijie Yan, Shiliang Zhang:
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT. CoRR abs/2310.04673 (2023) - [i16]Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Shiliang Zhang, Chong Deng, Yukun Ma, Hai Yu, Jiaqing Liu, Chong Zhang:
Loss Masking Is Not Needed in Decoder-only Transformer for Discrete-token Based ASR. CoRR abs/2311.04534 (2023) - 2022
- [j2]Jin Yan, Yuanyuan Chen, Jiazhu Zheng, Lin Guo, Siqi Zheng, Rongchun Zhang:
Multi-Source Time Series Remote Sensing Feature Selection and Urban Forest Extraction Based on Improved Artificial Bee Colony. Remote. Sens. 14(19): 4859 (2022) - [c17]Zhihao Du, Shiliang Zhang, Siqi Zheng, Zhi-Jie Yan:
Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis. EMNLP 2022: 7458-7469 - [c16]Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, Weilong Huang, Pengcheng Guo, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
M2Met: The Icassp 2022 Multi-Channel Multi-Party Meeting Transcription Challenge. ICASSP 2022: 6167-6171 - [c15]Fuchuan Tong, Siqi Zheng, Min Zhang, Yafeng Chen, Hongbin Suo, Qingyang Hong, Lin Li:
Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data. ICASSP 2022: 6622-6626 - [c14]Siqi Zheng, Hongbin Suo:
Reformulating Speaker Diarization As Community Detection With Emphasis On Topological Structure. ICASSP 2022: 8097-8101 - [c13]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. ICASSP 2022: 9156-9160 - [c12]Chao-Hong Tan, Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Zhen-Hua Ling:
PoNet: Pooling Network for Efficient Token Mixing in Long Sequences. ICLR 2022 - [c11]Siqi Zheng, Jie Zhou, Kui Meng, Gongshen Liu:
Label-Dividing Gated Graph Neural Network for Hierarchical Text Classification. IJCNN 2022: 1-8 - [c10]Siqi Zheng, Hongbin Suo, Qian Chen:
PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification. INTERSPEECH 2022: 1431-1435 - [c9]Fuchuan Tong, Siqi Zheng, Haodong Zhou, Xingjia Xie, Qingyang Hong, Lin Li:
Deep Representation Decomposition for Rate-Invariant Speaker Verification. Odyssey 2022: 228-232 - [i15]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. CoRR abs/2202.03647 (2022) - [i14]Zhihao Du, Shiliang Zhang, Siqi Zheng, Zhijie Yan:
Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios. CoRR abs/2203.09767 (2022) - [i13]Fuchuan Tong, Siqi Zheng, Min Zhang, Yafeng Chen, Hongbin Suo, Qingyang Hong, Lin Li:
Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data. CoRR abs/2204.11501 (2022) - [i12]Siqi Zheng, Hongbin Suo:
Reformulating Speaker Diarization as Community Detection With Emphasis On Topological Structure. CoRR abs/2204.12112 (2022) - [i11]Siqi Zheng, Hongbin Suo, Qian Chen:
PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification. CoRR abs/2205.07450 (2022) - [i10]Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Qian Chen:
Pushing the limits of self-supervised speaker verification using regularized distillation framework. CoRR abs/2211.04168 (2022) - [i9]Zhihao Du, Shiliang Zhang, Siqi Zheng, Zhijie Yan:
Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis. CoRR abs/2211.10243 (2022) - [i8]Jianhong Tu, Zeyu Cui, Xiaohuan Zhou, Siqi Zheng, Kai Hu, Ju Fan, Chang Zhou:
Contextual Expressive Text-to-Speech. CoRR abs/2211.14548 (2022) - 2021
- [c8]Ya-Qi Yu, Siqi Zheng, Hongbin Suo, Yun Lei, Wu-Jun Li:
Cam: Context-Aware Masking for Robust Speaker Verification. ICASSP 2021: 6703-6707 - [c7]Siqi Zheng, Weilong Huang, Xianliang Wang, Hongbin Suo, Jinwei Feng, Zhijie Yan:
A Real-Time Speaker Diarization System Based on Spatial Spectrum. ICASSP 2021: 7208-7212 - [c6]Shiliang Zhang, Siqi Zheng, Weilong Huang, Ming Lei, Hongbin Suo, Jinwei Feng, Zhijie Yan:
Investigation of Spatial-Acoustic Features for Overlapping Speech Detection in Multiparty Meetings. Interspeech 2021: 3550-3554 - [i7]Da Zhang, Qingyi Wang, Shaojie Song, Simiao Chen, Mingwei Li, Lu Shen, Siqi Zheng, Bofeng Cai, Shenhao Wang:
Estimating air quality co-benefits of energy transition using machine learning. CoRR abs/2105.14318 (2021) - [i6]Siqi Zheng, Weilong Huang, Xianliang Wang, Hongbin Suo, Jinwei Feng, Zhijie Yan:
A Real-time Speaker Diarization System Based on Spatial Spectrum. CoRR abs/2107.09321 (2021) - [i5]Yuchen Chai, Juan Palacios, Jianghao Wang, Yichun Fan, Siqi Zheng:
Measuring daily-life fear perception change: a computational study in the context of COVID-19. CoRR abs/2107.12606 (2021) - [i4]Siqi Zheng, Shiliang Zhang, Weilong Huang, Qian Chen, Hongbin Suo, Ming Lei, Jinwei Feng, Zhijie Yan:
BeamTransformer: Microphone Array-based Overlapping Speech Detection. CoRR abs/2109.04049 (2021) - [i3]Chao-Hong Tan, Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Zhen-Hua Ling:
PoNet: Pooling Network for Efficient Token Mixing in Long Sequences. CoRR abs/2110.02442 (2021) - [i2]Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, Weilong Huang, Pengcheng Guo, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge. CoRR abs/2110.07393 (2021) - [i1]Zhihao Du, Shiliang Zhang, Siqi Zheng, Weilong Huang, Ming Lei:
Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual Information. CoRR abs/2111.13694 (2021) - 2020
- [c5]Siqi Zheng, Yun Lei, Hongbin Suo:
Phonetically-Aware Coupled Network For Short Duration Text-Independent Speaker Verification. INTERSPEECH 2020: 926-930
2010 – 2019
- 2019
- [j1]Jie Wang, Yuan Liu, Yanjun Liu, Siqi Zheng, Xin Wang, Jingyi Zhao, Fan Yang, Gong Zhang, Chu Wang, Peng R. Chen:
Time-resolved protein activation by proximal decaging in living systems. Nat. 569(7757): 509-513 (2019) - [c4]Guihang Guo, Ying Li, Siqi Zheng:
Factors Influencing University Students' Intention to Redeem Digital Takeaway Coupons - Analysis Based on A Survey in China. ICIT 2019: 419-425 - [c3]Siqi Zheng, Gang Liu, Hongbin Suo, Yun Lei:
Towards a Fault-Tolerant Speaker Verification System: A Regularization Approach to Reduce the Condition Number. INTERSPEECH 2019: 4065-4069 - [c2]Siqi Zheng, Gang Liu, Hongbin Suo, Yun Lei:
Autoencoder-Based Semi-Supervised Curriculum Learning for Out-of-Domain Speaker Verification. INTERSPEECH 2019: 4360-4364 - 2018
- [c1]Siqi Zheng, Jianzong Wang, Jing Xiao, Wei-Ning Hsu, James R. Glass:
A Noise-Robust Self-Adaptive Multitarget Speaker Detection System. ICPR 2018: 1068-1072
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-01 01:16 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint