default search action
Zhou Zhao
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j77]Zhiyi Yang, Zhou Zhao, Yuliang Gu, Yongchao Xu:
Query-guided generalizable medical image segmentation. Pattern Recognit. Lett. 184: 52-58 (2024) - [j76]Zhou Zhao, Wenhao He, Zhenyu Lu:
Tactile-Based Grasping Stability Prediction Based on Human Grasp Demonstration for Robot Manipulation. IEEE Robotics Autom. Lett. 9(3): 2646-2653 (2024) - [j75]Hong Nie, Zhou Zhao, Lu Chen, Zhenyu Lu, Zhuomao Li, Jing Yang:
Smaller and Faster Robotic Grasp Detection Model via Knowledge Distillation and Unequal Feature Encoding. IEEE Robotics Autom. Lett. 9(8): 7206-7213 (2024) - [j74]Zhou Zhao, Dongyuan Zheng, Lu Chen:
Detecting Transitions from Stability to Instability in Robotic Grasping Based on Tactile Perception. Sensors 24(15): 5080 (2024) - [j73]Zhenyu Lu, Zhou Zhao, Tianqi Yue, Xu Zhu, Ning Wang:
A Bioinspired Multifunctional Tendon-Driven Tactile Sensor and Application in Obstacle Avoidance Using Reinforcement Learning. IEEE Trans. Cogn. Dev. Syst. 16(2): 407-415 (2024) - [j72]Linjun Li, Tao Jin, Wang Lin, Hao Jiang, Wenwen Pan, Jian Wang, Shuwen Xiao, Yan Xia, Weihao Jiang, Zhou Zhao:
Multi-Granularity Relational Attention Network for Audio-Visual Question Answering. IEEE Trans. Circuits Syst. Video Technol. 34(8): 7080-7094 (2024) - [j71]Shengyu Zhang, Ziqi Jiang, Jiangchao Yao, Fuli Feng, Kun Kuang, Zhou Zhao, Shuo Li, Hongxia Yang, Tat-Seng Chua, Fei Wu:
Causal Distillation for Alleviating Performance Heterogeneity in Recommender Systems. IEEE Trans. Knowl. Data Eng. 36(2): 459-474 (2024) - [j70]Wenwen Pan, Zhou Zhao, Wencan Huang, Zhu Zhang, Liyong Fu, Zhigeng Pan, Jun Yu, Fei Wu:
Video Moment Retrieval With Noisy Labels. IEEE Trans. Neural Networks Learn. Syst. 35(5): 6779-6791 (2024) - [j69]Shengyu Zhang, Tan Jiang, Kun Kuang, Fuli Feng, Jin Yu, Jianxin Ma, Zhou Zhao, Jianke Zhu, Hongxia Yang, Tat-Seng Chua, Fei Wu:
SLED: Structure Learning based Denoising for Recommendation. ACM Trans. Inf. Syst. 42(2): 43:1-43:31 (2024) - [c268]Yufeng Huang, Jiji Tang, Zhuo Chen, Rongsheng Zhang, Xinfeng Zhang, Weijie Chen, Zeng Zhao, Zhou Zhao, Tangjie Lv, Zhipeng Hu, Wen Zhang:
Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-Modal Structured Representations. AAAI 2024: 2417-2425 - [c267]Yu Zhang, Rongjie Huang, Ruiqi Li, Jinzheng He, Yan Xia, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis. AAAI 2024: 19597-19605 - [c266]Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Yuexian Zou, Zhou Zhao, Shinji Watanabe:
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head. AAAI 2024: 23802-23804 - [c265]Yongqi Wang, Jionghao Bai, Rongjie Huang, Ruiqi Li, Zhiqing Hong, Zhou Zhao:
Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer. ACL (Student Research Workshop) 2024: 42-49 - [c264]Zirun Guo, Tao Jin, Zhou Zhao:
Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition. ACL (1) 2024: 1726-1736 - [c263]Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou:
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension. ACL (1) 2024: 1979-1998 - [c262]Huadai Liu, Rongjie Huang, Jinzheng He, Gang Sun, Ran Shen, Xize Cheng, Zhou Zhao:
Wav2SQL: Direct Generalizable Speech-To-SQL Parsing. ACL (Findings) 2024: 4230-4242 - [c261]Tao Jin, Wang Lin, Ye Wang, Linjun Li, Xize Cheng, Zhou Zhao:
Rethinking the Multimodal Correlation of Multimodal Sequential Learning via Generalizable Attentional Results Alignment. ACL (1) 2024: 5247-5265 - [c260]Zhiqing Hong, Rongjie Huang, Xize Cheng, Yongqi Wang, Ruiqi Li, Fuming You, Zhou Zhao, Zhimeng Zhang:
Text-to-Song: Towards Controllable Music Generation Incorporating Vocal and Accompaniment. ACL (1) 2024: 6248-6261 - [c259]Ruiqi Li, Yu Zhang, Yongqi Wang, Zhiqing Hong, Rongjie Huang, Zhou Zhao:
Robust Singing Voice Transcription Serves Synthesis. ACL (1) 2024: 9751-9766 - [c258]Ruiqi Li, Rongjie Huang, Yongqi Wang, Zhiqing Hong, Zhou Zhao:
Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion. ACL (Findings) 2024: 9819-9831 - [c257]Xize Cheng, Rongjie Huang, Linjun Li, Zehan Wang, Tao Jin, Aoxiong Yin, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation. ACL (Findings) 2024: 9973-9986 - [c256]Songju Lei, Xize Cheng, Mengjiao Lyu, Jianqiao Hu, Jintao Tan, Runlin Liu, Lingyu Xiong, Tao Jin, Xiandong Li, Zhou Zhao:
Uni-Dubbing: Zero-Shot Speech Synthesis from Visual Articulation. ACL (1) 2024: 10082-10099 - [c255]Rongjie Huang, Chunlei Zhang, Yongqi Wang, Dongchao Yang, Jinchuan Tian, Zhenhui Ye, Luping Liu, Zehan Wang, Ziyue Jiang, Xuankai Chang, Jiatong Shi, Chao Weng, Zhou Zhao, Dong Yu:
Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners. ACL (1) 2024: 10929-10942 - [c254]Shengpeng Ji, Ziyue Jiang, Hanting Wang, Jialong Zuo, Zhou Zhao:
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech. ACL (1) 2024: 13588-13600 - [c253]Huadai Liu, Wenqiang Xu, Xuan Lin, Jingjing Huo, Hong Chen, Zhou Zhao:
AntCritic: Argument Mining for Free-Form and Visually-Rich Financial Comments. LREC/COLING 2024: 1306-1317 - [c252]Jimin Xu, Tianbao Wang, Tao Jin, Shengyu Zhang, Dongjie Fu, Zhe Wang, Jiangjing Lyu, Chengfei Lv, Chaoyue Niu, Zhou Yu, Zhou Zhao, Fei Wu:
MPOD123: One Image to 3D Content Generation Using Mask-Enhanced Progressive Outline-to-Detail Optimization. CVPR 2024: 10682-10692 - [c251]Yu Zhang, Ziyue Jiang, Ruiqi Li, Changhao Pan, Jinzheng He, Rongjie Huang, Chuxin Wang, Zhou Zhao:
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control. EMNLP 2024: 1960-1975 - [c250]Aoxiong Yin, Tianyun Zhong, Haoyuan Li, Siliang Tang, Zhou Zhao:
Language Model is a Branch Predictor for Simultaneous Machine Translation. ICASSP 2024: 9976-9980 - [c249]Shengpeng Ji, Jialong Zuo, Minghui Fang, Ziyue Jiang, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
TextrolSpeech: A Text Style Control Speech Corpus with Codec Language Text-to-Speech Models. ICASSP 2024: 10301-10305 - [c248]Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Zhenhui Ye, Shengpeng Ji, Qian Yang, Chen Zhang, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis. ICLR 2024 - [c247]Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li, Jiawei Huang, Ziyue Jiang, Jinzheng He, Rongjie Huang, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis. ICLR 2024 - [c246]Zehan Wang, Ziang Zhang, Xize Cheng, Rongjie Huang, Luping Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao Jin, Peng Gao, Zhou Zhao:
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion. ICML 2024 - [c245]Rongjie Huang, Ruofan Hu, Yongqi Wang, Zehan Wang, Xize Cheng, Ziyue Jiang, Zhenhui Ye, Dongchao Yang, Luping Liu, Peng Gao, Zhou Zhao:
InstructSpeech: Following Speech Editing Instructions via Large Language Models. ICML 2024 - [c244]Wang Lin, Jingyuan Chen, Jiaxin Shi, Yichen Zhu, Chen Liang, Junzhong Miao, Tao Jin, Zhou Zhao, Fei Wu, Shuicheng Yan, Hanwang Zhang:
Non-confusing Generation of Customized Concepts in Diffusion Models. ICML 2024 - [c243]Dongchao Yang, Jinchuan Tian, Xu Tan, Rongjie Huang, Songxiang Liu, Haohan Guo, Xuankai Chang, Jiatong Shi, Sheng Zhao, Jiang Bian, Zhou Zhao, Xixin Wu, Helen M. Meng:
UniAudio: Towards Universal Audio Generation with Large Language Models. ICML 2024 - [c242]Ye Wang, Jiahao Xun, Minjie Hong, Jieming Zhu, Tao Jin, Wang Lin, Haoyuan Li, Linjun Li, Yan Xia, Zhou Zhao, Zhenhua Dong:
EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration. KDD 2024: 3245-3254 - [c241]Qijiong Liu, Jieming Zhu, Yanting Yang, Quanyu Dai, Zhaocheng Du, Xiao-Ming Wu, Zhou Zhao, Rui Zhang, Zhenhua Dong:
Multimodal Pretraining, Adaptation, and Generation for Recommendation: A Survey. KDD 2024: 6566-6576 - [c240]Haoyu Zhao, Wenhui Dong, Rui Yu, Zhou Zhao, Bo Du, Yongchao Xu:
MoreStyle: Relax Low-Frequency Constraint of Fourier-Based Image Reconstruction in Generalizable Medical Image Segmentation. MICCAI (8) 2024: 434-444 - [c239]Zhikai Wei, Wenhui Dong, Peilin Zhou, Yuliang Gu, Zhou Zhao, Yongchao Xu:
Prompting Segment Anything Model with Domain-Adaptive Prototype for Generalizable Medical Image Segmentation. MICCAI (8) 2024: 533-543 - [c238]Zhichao Sun, Yuliang Gu, Yepeng Liu, Zerui Zhang, Zhou Zhao, Yongchao Xu:
Position-Guided Prompt Learning for Anomaly Detection in Chest X-Rays. MICCAI (1) 2024: 567-577 - [c237]Zerui Zhang, Zhichao Sun, Zelong Liu, Zhou Zhao, Rui Yu, Bo Du, Yongchao Xu:
Spatial-Aware Attention Generative Adversarial Network for Semi-supervised Anomaly Detection in Medical Image. MICCAI (5) 2024: 638-648 - [c236]Haoyu Zhao, Yuliang Gu, Zhou Zhao, Bo Du, Yongchao Xu, Rui Yu:
WIA-LD2ND: Wavelet-Based Image Alignment for Self-supervised Low-Dose CT Denoising. MICCAI (7) 2024: 764-774 - [c235]Mengze Li, Kairong Han, Jiahe Xu, Yueying Li, Tao Wu, Zhou Zhao, Jiaxu Miao, Shengyu Zhang, Jingyuan Chen:
Cross-modal Observation Hypothesis Inference. ACM Multimedia 2024: 466-475 - [c234]Tao Wu, Mengze Li, Jingyuan Chen, Wei Ji, Wang Lin, Jinyang Gao, Kun Kuang, Zhou Zhao, Fei Wu:
Semantic Alignment for Multimodal Large Language Models. ACM Multimedia 2024: 3489-3498 - [c233]Dongjie Fu, Xize Cheng, Xiaoda Yang, Hanting Wang, Zhou Zhao, Tao Jin:
Boosting Speech Recognition Robustness to Modality-Distortion with Contrast-Augmented Prompts. ACM Multimedia 2024: 3838-3847 - [c232]Tao Jin, Weicai Yan, Ye Wang, Sihang Cai, Qifan Shuai, Zhou Zhao:
Calibrating Prompt from History for Continual Vision-Language Retrieval and Grounding. ACM Multimedia 2024: 4302-4311 - [c231]Huadai Liu, Rongjie Huang, Yang Liu, Hengyuan Cao, Jialei Wang, Xize Cheng, Siqi Zheng, Zhou Zhao:
AudioLCM: Efficient and High-Quality Text-to-Audio Generation with Minimal Inference Steps. ACM Multimedia 2024: 7008-7017 - [c230]Xiaoda Yang, Xize Cheng, Dongjie Fu, Minghui Fang, Jialung Zuo, Shengpeng Ji, Zhou Zhao, Tao Jin:
SyncTalklip: Highly Synchronized Lip-Readable Speaker Generation with Multi-Task Learning. ACM Multimedia 2024: 8149-8158 - [c229]Weicai Yan, Ye Wang, Wang Lin, Zirun Guo, Zhou Zhao, Tao Jin:
Low-rank Prompt Interaction for Continual Vision-Language Retrieval. ACM Multimedia 2024: 8257-8266 - [c228]Zheqi Lv, Shaoxuan He, Tianyu Zhan, Shengyu Zhang, Wenqiao Zhang, Jingyuan Chen, Zhou Zhao, Fei Wu:
Semantic Codebook Learning for Dynamic Recommendation Models. ACM Multimedia 2024: 9611-9620 - [c227]Rongjie Huang, Yongqi Wang, Ruofan Hu, Xiaoshan Xu, Zhiqing Hong, Dongchao Yang, Xize Cheng, Zehan Wang, Ziyue Jiang, Zhenhui Ye, Luping Liu, Siqi Zheng, Zhou Zhao:
VoiceTuner: Self-Supervised Pre-training and Efficient Fine-tuning For Voice Generation. ACM Multimedia 2024: 10630-10639 - [c226]Yongqi Wang, Ruofan Hu, Rongjie Huang, Zhiqing Hong, Ruiqi Li, Wenrui Liu, Fuming You, Tao Jin, Zhou Zhao:
Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt. NAACL-HLT 2024: 4780-4794 - [c225]Dong Yao, Jieming Zhu, Jiahao Xun, Shengyu Zhang, Zhou Zhao, Liqun Deng, Wenqiao Zhang, Zhenhua Dong, Xin Jiang:
MART: Learning Hierarchical Music Audio Representations with Part-Whole Transformer. WWW (Companion Volume) 2024: 967-970 - [i185]Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li, Jiawei Huang, Ziyue Jiang, Jinzheng He, Rongjie Huang, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis. CoRR abs/2401.08503 (2024) - [i184]Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou:
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension. CoRR abs/2402.07729 (2024) - [i183]Shengpeng Ji, Ziyue Jiang, Hanting Wang, Jialong Zuo, Zhou Zhao:
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech. CoRR abs/2402.09378 (2024) - [i182]Shengpeng Ji, Minghui Fang, Ziyue Jiang, Rongjie Huang, Jialong Zuo, Shulei Wang, Zhou Zhao:
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models. CoRR abs/2402.12208 (2024) - [i181]Hai Huang, Yan Xia, Shengpeng Ji, Shulei Wang, Hanting Wang, Jieming Zhu, Zhenhua Dong, Zhou Zhao:
Unlocking the Potential of Multimodal Unified Discrete Representation through Training-Free Codebook Optimization and Hierarchical Alignment. CoRR abs/2403.05168 (2024) - [i180]Haoyu Zhao, Yuliang Gu, Zhou Zhao, Bo Du, Yongchao Xu, Rui Yu:
WIA-LD2ND: Wavelet-based Image Alignment for Self-supervised Low-Dose CT Denoising. CoRR abs/2403.11672 (2024) - [i179]Haoyu Zhao, Wenhui Dong, Rui Yu, Zhou Zhao, Bo Du, Yongchao Xu:
MoreStyle: Relax Low-frequency Constraint of Fourier-based Image Reconstruction in Generalizable Medical Image Segmentation. CoRR abs/2403.11689 (2024) - [i178]Yongqi Wang, Ruofan Hu, Rongjie Huang, Zhiqing Hong, Ruiqi Li, Wenrui Liu, Fuming You, Tao Jin, Zhou Zhao:
Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt. CoRR abs/2403.11780 (2024) - [i177]Qijiong Liu, Jieming Zhu, Yanting Yang, Quanyu Dai, Zhaocheng Du, Xiao-Ming Wu, Zhou Zhao, Rui Zhang, Zhenhua Dong:
Multimodal Pretraining, Adaptation, and Generation for Recommendation: A Survey. CoRR abs/2404.00621 (2024) - [i176]Zhiqing Hong, Rongjie Huang, Xize Cheng, Yongqi Wang, Ruiqi Li, Fuming You, Zhou Zhao, Zhimeng Zhang:
Text-to-Song: Towards Controllable Music Generation Incorporating Vocals and Accompaniment. CoRR abs/2404.09313 (2024) - [i175]Kunxi Li, Tianyu Zhan, Shengyu Zhang, Kun Kuang, Jiwei Li, Zhou Zhao, Fei Wu:
MergeNet: Knowledge Migration across Heterogeneous Models, Tasks, and Modalities. CoRR abs/2404.13322 (2024) - [i174]Bo Lin, Yingjing Xu, Xuanwen Bao, Zhou Zhao, Zuyong Zhang, Zhouyang Wang, Jie Zhang, Shuiguang Deng, Jianwei Yin:
SkinGEN: an Explainable Dermatology Diagnosis-to-Generation Framework with Interactive Vision-Language Models. CoRR abs/2404.14755 (2024) - [i173]Zehan Wang, Ziang Zhang, Xize Cheng, Rongjie Huang, Luping Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao Jin, Peng Gao, Zhou Zhao:
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion. CoRR abs/2405.04883 (2024) - [i172]Wang Lin, Jingyuan Chen, Jiaxin Shi, Yichen Zhu, Chen Liang, Junzhong Miao, Tao Jin, Zhou Zhao, Fei Wu, Shuicheng Yan, Hanwang Zhang:
Non-confusing Generation of Customized Concepts in Diffusion Models. CoRR abs/2405.06914 (2024) - [i171]Ruiqi Li, Yu Zhang, Yongqi Wang, Zhiqing Hong, Rongjie Huang, Zhou Zhao:
Robust Singing Voice Transcription Serves Synthesis. CoRR abs/2405.09940 (2024) - [i170]Zhichao Sun, Yuliang Gu, Yepeng Liu, Zerui Zhang, Zhou Zhao, Yongchao Xu:
Position-Guided Prompt Learning for Anomaly Detection in Chest X-Rays. CoRR abs/2405.11976 (2024) - [i169]Zerui Zhang, Zhichao Sun, Zelong Liu, Bo Du, Rui Yu, Zhou Zhao, Yongchao Xu:
Spatial-aware Attention Generative Adversarial Network for Semi-supervised Anomaly Detection in Medical Image. CoRR abs/2405.12872 (2024) - [i168]Shengyu Zhang, Ziqi Jiang, Jiangchao Yao, Fuli Feng, Kun Kuang, Zhou Zhao, Shuo Li, Hongxia Yang, Tat-Seng Chua, Fei Wu:
Causal Distillation for Alleviating Performance Heterogeneity in Recommender Systems. CoRR abs/2405.20626 (2024) - [i167]Yongqi Wang, Wenxiang Guo, Rongjie Huang, Jiawei Huang, Zehan Wang, Fuming You, Ruiqi Li, Zhou Zhao:
Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching. CoRR abs/2406.00320 (2024) - [i166]Huadai Liu, Rongjie Huang, Yang Liu, Hengyuan Cao, Jialei Wang, Xize Cheng, Siqi Zheng, Zhou Zhao:
AudioLCM: Text-to-Audio Generation with Latent Consistency Models. CoRR abs/2406.00356 (2024) - [i165]Shengpeng Ji, Jialong Zuo, Minghui Fang, Siqi Zheng, Qian Chen, Wen Wang, Ziyue Jiang, Hai Huang, Xize Cheng, Rongjie Huang, Zhou Zhao:
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec. CoRR abs/2406.01205 (2024) - [i164]Ruiqi Li, Rongjie Huang, Yongqi Wang, Zhiqing Hong, Zhou Zhao:
Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion. CoRR abs/2406.02429 (2024) - [i163]Ye Wang, Jiahao Xun, Mingjie Hong, Jieming Zhu, Tao Jin, Wang Lin, Haoyuan Li, Linjun Li, Yan Xia, Zhou Zhao, Zhenhua Dong:
EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration. CoRR abs/2406.14017 (2024) - [i162]Minghui Fang, Shengpeng Ji, Jialong Zuo, Hai Huang, Yan Xia, Jieming Zhu, Xize Cheng, Xiaoda Yang, Wenrui Liu, Gang Wang, Zhenhua Dong, Zhou Zhao:
ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling. CoRR abs/2406.17507 (2024) - [i161]Ruiqi Li, Zhiqing Hong, Yongqi Wang, Lichao Zhang, Rongjie Huang, Siqi Zheng, Zhou Zhao:
Accompanied Singing Voice Synthesis with Fully Text-controlled Melody. CoRR abs/2407.02049 (2024) - [i160]Zirun Guo, Tao Jin, Zhou Zhao:
Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition. CoRR abs/2407.05374 (2024) - [i159]Zehan Wang, Ziang Zhang, Hang Zhang, Luping Liu, Rongjie Huang, Xize Cheng, Hengshuang Zhao, Zhou Zhao:
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces. CoRR abs/2407.11895 (2024) - [i158]Huadai Liu, Jialei Wang, Rongjie Huang, Yang Liu, Jiayang Xu, Zhou Zhao:
MEDIC: Zero-shot Music Editing with Disentangled Inversion Control. CoRR abs/2407.13220 (2024) - [i157]Qian Yang, Jialong Zuo, Zhe Su, Ziyue Jiang, Mingze Li, Zhou Zhao, Feiyang Chen, Zhefeng Wang, Baoxing Huai:
MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis. CoRR abs/2407.14006 (2024) - [i156]Zheqi Lv, Shaoxuan He, Tianyu Zhan, Shengyu Zhang, Wenqiao Zhang, Jingyuan Chen, Zhou Zhao, Fei Wu:
Semantic Codebook Learning for Dynamic Recommendation Models. CoRR abs/2408.00123 (2024) - [i155]Jiawei Huang, Chen Zhang, Yi Ren, Ziyue Jiang, Zhenhui Ye, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
MulliVC: Multi-lingual Voice Conversion With Cycle Consistency. CoRR abs/2408.04708 (2024) - [i154]Tao Wu, Mengze Li, Jingyuan Chen, Wei Ji, Wang Lin, Jinyang Gao, Kun Kuang, Zhou Zhao, Fei Wu:
Semantic Alignment for Multimodal Large Language Models. CoRR abs/2408.12867 (2024) - [i153]Shengpeng Ji, Ziyue Jiang, Xize Cheng, Yifu Chen, Minghui Fang, Jialong Zuo, Qian Yang, Ruiqi Li, Ziang Zhang, Xiaoda Yang, Rongjie Huang, Yidi Jiang, Qian Chen, Siqi Zheng, Wen Wang, Zhou Zhao:
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling. CoRR abs/2408.16532 (2024) - [i152]Qijiong Liu, Jieming Zhu, Lu Fan, Zhou Zhao, Xiao-Ming Wu:
STORE: Streamlining Semantic Tokenization and Generative Recommendation with A Single LLM. CoRR abs/2409.07276 (2024) - [i151]Zhikai Wei, Wenhui Dong, Peilin Zhou, Yuliang Gu, Zhou Zhao, Yongchao Xu:
Prompting Segment Anything Model with Domain-Adaptive Prototype for Generalizable Medical Image Segmentation. CoRR abs/2409.12522 (2024) - [i150]Yu Zhang, Changhao Pan, Wenxiang Guo, Ruiqi Li, Zhiyuan Zhu, Jialei Wang, Wenhao Xu, Jingyu Lu, Zhiqing Hong, Chuxin Wang, LiChao Zhang, Jinzheng He, Ziyue Jiang, Yuxin Chen, Chen Yang, Jiecheng Zhou, Xinyu Cheng, Zhou Zhao:
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks. CoRR abs/2409.13832 (2024) - [i149]Yu Zhang, Ziyue Jiang, Ruiqi Li, Changhao Pan, Jinzheng He, Rongjie Huang, Chuxin Wang, Zhou Zhao:
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control. CoRR abs/2409.15977 (2024) - [i148]Wenrui Liu, Zhifang Guo, Jin Xu, Yuanjun Lv, Yunfei Chu, Zhou Zhao, Junyang Lin:
Analyzing and Mitigating Inconsistency in Discrete Audio Tokens for Neural Codec Language Models. CoRR abs/2409.19283 (2024) - [i147]Zhenhui Ye, Tianyun Zhong, Yi Ren, Ziyue Jiang, Jiawei Huang, Rongjie Huang, Jinglin Liu, Jinzheng He, Chen Zhang, Zehan Wang, Xize Chen, Xiang Yin, Zhou Zhao:
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes. CoRR abs/2410.06734 (2024) - [i146]Huadai Liu, Jialei Wang, Rongjie Huang, Yang Liu, Heng Lu, Wei Xue, Zhou Zhao:
FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation. CoRR abs/2410.12266 (2024) - [i145]Ruiqi Li, Siqi Zheng, Xize Cheng, Ziang Zhang, Shengpeng Ji, Zhou Zhao:
MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization. CoRR abs/2410.12957 (2024) - [i144]Wenyi Xiao, Zechuan Wang, Leilei Gan, Shuai Zhao, Wanggui He, Luu Anh Tuan, Long Chen, Hao Jiang, Zhou Zhao, Fei Wu:
A Comprehensive Survey of Datasets, Theories, Variants, and Applications in Direct Preference Optimization. CoRR abs/2410.15595 (2024) - [i143]Xize Cheng, Siqi Zheng, Zehan Wang, Minghui Fang, Ziang Zhang, Rongjie Huang, Ziyang Ma, Shengpeng Ji, Jialong Zuo, Tao Jin, Zhou Zhao:
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup. CoRR abs/2410.21269 (2024) - [i142]Zirun Guo, Tao Jin, Jingyuan Chen, Zhou Zhao:
Classifier-guided Gradient Modulation for Enhanced Multimodal Learning. CoRR abs/2411.01409 (2024) - [i141]Fuming You, Minghui Fang, Li Tang, Rongjie Huang, Yongqi Wang, Zhou Zhao:
MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence. CoRR abs/2411.01805 (2024) - 2023
- [b1]Zhou Zhao:
Heart Segmentation and Evaluation of Fibrosis. (Segmentation cardiaque et évaluation de la fibrose). Sorbonne University, Paris, France, 2023 - [j68]Zhou Zhao, Qingkai Guo, Yu Sun, Ningli An, Pengzhe Hui, Laihao Yang, Xuefeng Chen:
Bioinspired Hierarchical Structure for an Ultrawide-Range Multifunctional Flexible Sensor Using Porous Expandable Polyethylene/Loofah-Like Polyurethane Sponge Material. Adv. Intell. Syst. 5(1) (2023) - [j67]Lei Li, Fuping Wu, Sihan Wang, Xinzhe Luo, Carlos Martín-Isla, Shuwei Zhai, Jianpeng Zhang, Yanfei Liu, Zhen Zhang, Markus J. Ankenbrand, Haochuan Jiang, Xiaoran Zhang, Linhong Wang, Tewodros Weldebirhan Arega, Elif Altunok, Zhou Zhao, Feiyan Li, Jun Ma, Xiaoping Yang, Élodie Puybareau, Ilkay Öksüz, Stéphanie Bricq, Weisheng Li, Kumaradevan Punithakumar, Sotirios A. Tsaftaris, Laura Maria Schreiber, Mingjing Yang, Guocai Liu, Yong Xia, Guotai Wang, Sergio Escalera, Xiahai Zhuang:
MyoPS: A benchmark of myocardial pathology segmentation combining three-sequence cardiac magnetic resonance images. Medical Image Anal. 87: 102808 (2023) - [j66]Shengyu Zhang, Fuli Feng, Kun Kuang, Wenqiao Zhang, Zhou Zhao, Hongxia Yang, Tat-Seng Chua, Fei Wu:
Personalized Latent Structure Learning for Recommendation. IEEE Trans. Pattern Anal. Mach. Intell. 45(8): 10285-10299 (2023) - [j65]Zhenyu Lu, Lu Chen, Hengtai Dai, Haoran Li, Zhou Zhao, Bofang Zheng, Nathan F. Lepora, Chenguang Yang:
Visual-Tactile Robot Grasping Based on Human Skill Learning From Demonstrations Using a Wearable Parallel Hand Exoskeleton. IEEE Robotics Autom. Lett. 8(9): 5384-5391 (2023) - [j64]Yuzhen Guo, Zengxing Zhang, Bin Yao, Jin Chai, Shiqiang Zhang, Jianwei Liu, Zhou Zhao, Chenyang Xue:
Fabrication and Performance of a Ta2O5 Thin Film pH Sensor Manufactured Using MEMS Processes. Sensors 23(13): 6061 (2023) - [c224]Zijian Zhang, Zhou Zhao, Jun Yu, Qi Tian:
ShiftDDPMs: Exploring Conditional Diffusion Models by Shifting Diffusion Trajectories. AAAI 2023: 3552-3560 - [c223]Shengyu Zhang, Xusheng Feng, Wenyan Fan, Wenjing Fang, Fuli Feng, Wei Ji, Shuo Li, Li Wang, Shanshan Zhao, Zhou Zhao, Tat-Seng Chua, Fei Wu:
Video-Audio Domain Generalization via Confounder Disentanglement. AAAI 2023: 15322-15330 - [c222]Zehan Wang, Yang Zhao, Haifeng Huang, Yan Xia, Zhou Zhao:
Scene-robust Natural Language Video Localization via Learning Domain-invariant Representations. ACL (Findings) 2023: 144-160 - [c221]Jinzheng He, Jinglin Liu, Zhenhui Ye, Rongjie Huang, Chenye Cui, Huadai Liu, Zhou Zhao:
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis. ACL (Findings) 2023: 236-248 - [c220]Mengze Li, Tianbao Wang, Jiahe Xu, Kairong Han, Shengyu Zhang, Zhou Zhao, Jiaxu Miao, Wenqiao Zhang, Shiliang Pu, Fei Wu:
Multi-modal Action Chain Abductive Reasoning. ACL (1) 2023: 4617-4628 - [c219]Xize Cheng, Tao Jin, Linjun Li, Wang Lin, Xinyu Duan, Zhou Zhao:
OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment. ACL (1) 2023: 6592-6607 - [c218]Rongjie Huang, Yi Ren, Ziyue Jiang, Chenye Cui, Jinglin Liu, Zhou Zhao:
FastDiff 2: Revisiting and Incorporating GANs and Diffusion Models in High-Fidelity Speech Synthesis. ACL (Findings) 2023: 6994-7009 - [c217]Ruiqi Li, Rongjie Huang, Lichao Zhang, Jinglin Liu, Zhou Zhao:
AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment. ACL (Findings) 2023: 7074-7088 - [c216]Rongjie Huang, Chunlei Zhang, Yi Ren, Zhou Zhao, Dong Yu:
Prosody-TTS: Improving Prosody with Masked Autoencoder and Conditional Diffusion Model For Expressive Text-to-Speech. ACL (Findings) 2023: 8018-8034 - [c215]Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao:
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation. ACL (1) 2023: 8590-8604 - [c214]Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training. ACL (1) 2023: 9317-9331 - [c213]Ye Wang, Tao Jin, Wang Lin, Xize Cheng, Linjun Li, Zhou Zhao:
Semantic-conditioned Dual Adaptation for Cross-domain Query-based Visual Segmentation. ACL (Findings) 2023: 9797-9815 - [c212]Ye Wang, Wang Lin, Shengyu Zhang, Tao Jin, Linjun Li, Xize Cheng, Zhou Zhao:
Weakly-Supervised Spoken Video Grounding via Semantic Interaction Learning. ACL (1) 2023: 10914-10932 - [c211]Linjun Li, Tao Jin, Xize Cheng, Ye Wang, Wang Lin, Rongjie Huang, Zhou Zhao:
Contrastive Token-Wise Meta-Learning for Unseen Performer Visual Temporal-Aligned Translation. ACL (Findings) 2023: 10993-11007 - [c210]Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao:
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models. ACL (Findings) 2023: 11655-11671 - [c209]Jinglin Liu, Zhenhui Ye, Qian Chen, Siqi Zheng, Wen Wang, Qinglin Zhang, Zhou Zhao:
DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect. ACL (Findings) 2023: 11905-11912 - [c208]Wang Lin, Tao Jin, Wenwen Pan, Linjun Li, Xize Cheng, Ye Wang, Zhou Zhao:
TAVT: Towards Transferable Audio-Visual Text Generation. ACL (1) 2023: 14983-14999 - [c207]Yujie Lu, Yingxuan Huang, Shengyu Zhang, Wei Han, Hui Chen, Wenyan Fan, Jiangliang Lai, Zhou Zhao, Fei Wu:
Multi-trends Enhanced Dynamic Micro-video Recommendation. CICAI (1) 2023: 430-441 - [c206]Pengcheng Zhang, Wenrui Liu, Ning Wang, Ran Shen, Gang Sun, Xinghua Jiang, Zheqian Chen, Fei Wu, Zhou Zhao:
Sequential Style Consistency Learning for Domain-Generalizable Text Recognition. CICAI (1) 2023: 493-504 - [c205]Aoxiong Yin, Tianyun Zhong, Li Tang, Weike Jin, Tao Jin, Zhou Zhao:
Gloss Attention for Gloss-free Sign Language Translation. CVPR 2023: 2551-2562 - [c204]Haoyuan Li, Hao Jiang, Tao Jin, Mengyan Li, Yan Chen, Zhijie Lin, Yang Zhao, Zhou Zhao:
DATE: Domain Adaptive Product Seeker for E-Commerce. CVPR 2023: 19315-19324 - [c203]Mengze Li, Han Wang, Wenqiao Zhang, Jiaxu Miao, Zhou Zhao, Shengyu Zhang, Wei Ji, Fei Wu:
WINNER: Weakly-supervised hIerarchical decompositioN and aligNment for spatio-tEmporal video gRounding. CVPR 2023: 23090-23099 - [c202]Zhou Yu, Lixiang Zheng, Zhou Zhao, Fei Wu, Jianping Fan, Kui Ren, Jun Yu:
ANetQA: A Large-scale Benchmark for Fine-grained Compositional Reasoning over Untrimmed Videos. CVPR 2023: 23191-23200 - [c201]Mengze Li, Tianqi Zhao, Jionghao Bai, Baoyi He, Jiaxu Miao, Wei Ji, Zheqi Lv, Zhou Zhao, Shengyu Zhang, Wenqiao Zhang, Fei Wu:
ART: rule bAsed futuRe-inference deducTion. EMNLP 2023: 9512-9522 - [c200]Zehan Wang, Haifeng Huang, Yang Zhao, Linjun Li, Xize Cheng, Yichen Zhu, Aoxiong Yin, Zhou Zhao:
3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding. EMNLP 2023: 10612-10625 - [c199]Huadai Liu, Rongjie Huang, Xuan Lin, Wenqiang Xu, Maozong Zheng, Hong Chen, Jinzheng He, Zhou Zhao:
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer. EMNLP 2023: 15957-15969 - [c198]Chenye Cui, Zhou Zhao, Yi Ren, Jinglin Liu, Rongjie Huang, Feiyang Chen, Zhefeng Wang, Baoxing Huai, Fei Wu:
VarietySound: Timbre-Controllable Video to Sound Generation Via Unsupervised Information Disentanglement. ICASSP 2023: 1-5 - [c197]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG). ICASSP 2023: 1-2 - [c196]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
MUG: A General Meeting Understanding and Generation Benchmark. ICASSP 2023: 1-5 - [c195]Zehan Wang, Haifeng Huang, Yang Zhao, Linjun Li, Xize Cheng, Yichen Zhu, Aoxiong Yin, Zhou Zhao:
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding. ICCV 2023: 2662-2671 - [c194]Jiong Wang, Huiming Zhang, Haiwen Hong, Xuan Jin, Yuan He, Hui Xue, Zhou Zhao:
Open-Vocabulary Object Detection With an Open Corpus. ICCV 2023: 6736-6746 - [c193]Wang Lin, Tao Jin, Ye Wang, Wenwen Pan, Linjun Li, Xize Cheng, Zhou Zhao:
Exploring Group Video Captioning with Efficient Relational Approximation. ICCV 2023: 15235-15244 - [c192]Xize Cheng, Tao Jin, Rongjie Huang, Linjun Li, Wang Lin, Zehan Wang, Ye Wang, Huadai Liu, Aoxiong Yin, Zhou Zhao:
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition. ICCV 2023: 15689-15699 - [c191]Rongjie Huang, Jinglin Liu, Huadai Liu, Yi Ren, Lichao Zhang, Jinzheng He, Zhou Zhao:
TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation. ICLR 2023 - [c190]Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Jinzheng He, Zhou Zhao:
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis. ICLR 2023 - [c189]Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao:
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models. ICML 2023: 13916-13932 - [c188]Zhenyu Lu, Tianqi Yue, Zhou Zhao, Weiyong Si, Ning Wang, Chenguang Yang:
MechTac: A Multifunctional Tendon-Linked Optical Tactile Sensor for In/Out-the-Field-of-View Perception with Deep Learning. IECON 2023: 1-6 - [c187]Yazheng Yang, Zhou Zhao, Qi Liu:
MSSRNet: Manipulating Sequential Style Representation for Unsupervised Text Style Transfer. KDD 2023: 3022-3032 - [c186]Shengyu Zhang, Yunze Tong, Kun Kuang, Fuli Feng, Jiezhong Qiu, Jin Yu, Zhou Zhao, Hongxia Yang, Zhongfei Zhang, Fei Wu:
Stable Prediction on Graphs with Agnostic Distribution Shifts. CDPD 2023: 49-74 - [c185]Mengze Li, Haoyu Zhang, Juncheng Li, Zhou Zhao, Wenqiao Zhang, Shengyu Zhang, Shiliang Pu, Yueting Zhuang, Fei Wu:
Unsupervised Domain Adaptation for Video Object Grounding with Cascaded Debiasing Learning. ACM Multimedia 2023: 3807-3816 - [c184]Tao Jin, Xize Cheng, Linjun Li, Wang Lin, Ye Wang, Zhou Zhao:
Rethinking Missing Modality Learning from a Decoding Perspective. ACM Multimedia 2023: 4431-4439 - [c183]Haonan Shi, Wenwen Pan, Zhou Zhao, Mingmin Zhang, Fei Wu:
Unsupervised Domain Adaptation for Referring Semantic Segmentation. ACM Multimedia 2023: 5807-5818 - [c182]Zhiqing Hong, Chenye Cui, Rongjie Huang, Lichao Zhang, Jinglin Liu, Jinzheng He, Zhou Zhao:
UniSinger: Unified End-to-End Singing Voice Synthesis With Cross-Modality Information Matching. ACM Multimedia 2023: 7569-7579 - [c181]Haoyi Duan, Yan Xia, Mingze Zhou, Li Tang, Jieming Zhu, Zhou Zhao:
Cross-modal Prompts: Adapting Large Pre-trained Models for Audio-Visual Downstream Tasks. NeurIPS 2023 - [c180]Liya Hu, Zhiang Dong, Jingyuan Chen, Guifeng Wang, Zhihua Wang, Zhou Zhao, Fei Wu:
PTADisc: A Cross-Course Dataset Supporting Personalized Learning in Cold-Start Scenarios. NeurIPS 2023 - [c179]Zehan Wang, Yang Zhao, Xize Cheng, Haifeng Huang, Jiageng Liu, Aoxiong Yin, Li Tang, Linjun Li, Yongqi Wang, Ziang Zhang, Zhou Zhao:
Connecting Multi-modal Contrastive Representations. NeurIPS 2023 - [c178]Yan Xia, Hai Huang, Jieming Zhu, Zhou Zhao:
Achieving Cross Modal Generalization with Multimodal Unified Representation. NeurIPS 2023 - [c177]Jiahao Xun, Shengyu Zhang, Yanting Yang, Jieming Zhu, Liqun Deng, Zhou Zhao, Zhenhua Dong, Ruiqi Li, Lichao Zhang, Fei Wu:
DisCover: Disentangled Music Representation Learning for Cover Song Identification. SIGIR 2023: 453-463 - [c176]Liangcai Su, Fan Yan, Jieming Zhu, Xi Xiao, Haoyi Duan, Zhou Zhao, Zhenhua Dong, Ruiming Tang:
Beyond Two-Tower Matching: Learning Sparse Retrievable Cross-Interactions for Recommendation. SIGIR 2023: 548-557 - [i140]Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao:
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models. CoRR abs/2301.12661 (2023) - [i139]Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Jinzheng He, Zhou Zhao:
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis. CoRR abs/2301.13430 (2023) - [i138]Zijian Zhang, Zhou Zhao, Jun Yu, Qi Tian:
ShiftDDPMs: Exploring Conditional Diffusion Models by Shifting Diffusion Trajectories. CoRR abs/2302.02373 (2023) - [i137]Xize Cheng, Linjun Li, Tao Jin, Rongjie Huang, Wang Lin, Zehan Wang, Huangdai Liu, Ye Wang, Aoxiong Yin, Zhou Zhao:
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition. CoRR abs/2303.05309 (2023) - [i136]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG). CoRR abs/2303.13932 (2023) - [i135]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
MUG: A General Meeting Understanding and Generation Benchmark. CoRR abs/2303.13939 (2023) - [i134]Haoyuan Li, Hao Jiang, Tao Jin, Mengyan Li, Yan Chen, Zhijie Lin, Yang Zhao, Zhou Zhao:
DATE: Domain Adaptive Product Seeker for E-commerce. CoRR abs/2304.03669 (2023) - [i133]Jiong Wang, Zhou Zhao, Fei Wu:
Set-Based Face Recognition Beyond Disentanglement: Burstiness Suppression With Variance Vocabulary. CoRR abs/2304.06249 (2023) - [i132]Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Zhou Zhao, Shinji Watanabe:
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head. CoRR abs/2304.12995 (2023) - [i131]Zhenhui Ye, Jinzheng He, Ziyue Jiang, Rongjie Huang, Jiawei Huang, Jinglin Liu, Yi Ren, Xiang Yin, Zejun Ma, Zhou Zhao:
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation. CoRR abs/2305.00787 (2023) - [i130]Dong Yao, Shengyu Zhang, Zhou Zhao, Jieming Zhu, Wenqiao Zhang, Rui Zhang, Xiaofei He, Fei Wu:
Denoising Multi-modal Sequential Recommenders with Contrastive Learning. CoRR abs/2305.01915 (2023) - [i129]Zhou Yu, Lixiang Zheng, Zhou Zhao, Fei Wu, Jianping Fan, Kui Ren, Jun Yu:
ANetQA: A Large-scale Benchmark for Fine-grained Compositional Reasoning over Untrimmed Videos. CoRR abs/2305.02519 (2023) - [i128]Ruiqi Li, Rongjie Huang, Lichao Zhang, Jinglin Liu, Zhou Zhao:
AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment. CoRR abs/2305.04476 (2023) - [i127]Jinzheng He, Jinglin Liu, Zhenhui Ye, Rongjie Huang, Chenye Cui, Huadai Liu, Zhou Zhao:
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis. CoRR abs/2305.10686 (2023) - [i126]Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training. CoRR abs/2305.10763 (2023) - [i125]Huadai Liu, Rongjie Huang, Jinzheng He, Gang Sun, Ran Shen, Xize Cheng, Zhou Zhao:
Wav2SQL: Direct Generalizable Speech-To-SQL Parsing. CoRR abs/2305.12552 (2023) - [i124]Huadai Liu, Rongjie Huang, Xuan Lin, Wenqiang Xu, Maozong Zheng, Hong Chen, Jinzheng He, Zhou Zhao:
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer. CoRR abs/2305.12708 (2023) - [i123]Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao:
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models. CoRR abs/2305.13612 (2023) - [i122]Zehan Wang, Yang Zhao, Xize Cheng, Haifeng Huang, Jiageng Liu, Li Tang, Linjun Li, Yongqi Wang, Aoxiong Yin, Ziang Zhang, Zhou Zhao:
Connecting Multi-modal Contrastive Representations. CoRR abs/2305.14381 (2023) - [i121]Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao:
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation. CoRR abs/2305.15403 (2023) - [i120]Jiawei Huang, Yi Ren, Rongjie Huang, Dongchao Yang, Zhenhui Ye, Chen Zhang, Jinglin Liu, Xiang Yin, Zejun Ma, Zhou Zhao:
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation. CoRR abs/2305.18474 (2023) - [i119]Rongjie Huang, Chunlei Zhang, Yongqi Wang, Dongchao Yang, Luping Liu, Zhenhui Ye, Ziyue Jiang, Chao Weng, Zhou Zhao, Dong Yu:
Make-A-Voice: Unified Voice Synthesis With Discrete Representation. CoRR abs/2305.19269 (2023) - [i118]Luping Liu, Zijian Zhang, Yi Ren, Rongjie Huang, Xiang Yin, Zhou Zhao:
Detector Guidance for Multi-Object Text-to-Image Generation. CoRR abs/2306.02236 (2023) - [i117]Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis. CoRR abs/2306.03504 (2023) - [i116]Ziyue Jiang, Yi Ren, Zhenhui Ye, Jinglin Liu, Chen Zhang, Qian Yang, Shengpeng Ji, Rongjie Huang, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias. CoRR abs/2306.03509 (2023) - [i115]Xize Cheng, Tao Jin, Linjun Li, Wang Lin, Xinyu Duan, Zhou Zhao:
OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment. CoRR abs/2306.06410 (2023) - [i114]Yazheng Yang, Zhou Zhao, Qi Liu:
MSSRNet: Manipulating Sequential Style Representation for Unsupervised Text Style Transfer. CoRR abs/2306.07994 (2023) - [i113]Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Chen Zhang, Zhenhui Ye, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts. CoRR abs/2307.07218 (2023) - [i112]Aoxiong Yin, Tianyun Zhong, Li Tang, Weike Jin, Tao Jin, Zhou Zhao:
Gloss Attention for Gloss-free Sign Language Translation. CoRR abs/2307.07361 (2023) - [i111]Zehan Wang, Haifeng Huang, Yang Zhao, Linjun Li, Xize Cheng, Yichen Zhu, Aoxiong Yin, Zhou Zhao:
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding. CoRR abs/2307.09267 (2023) - [i110]Jiahao Xun, Shengyu Zhang, Yanting Yang, Jieming Zhu, Liqun Deng, Zhou Zhao, Zhenhua Dong, Ruiqi Li, Lichao Zhang, Fei Wu:
DisCover: Disentangled Music Representation Learning for Cover Song Identification. CoRR abs/2307.09775 (2023) - [i109]Zehan Wang, Haifeng Huang, Yang Zhao, Linjun Li, Xize Cheng, Yichen Zhu, Aoxiong Yin, Zhou Zhao:
3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding. CoRR abs/2307.13363 (2023) - [i108]Zehan Wang, Haifeng Huang, Yang Zhao, Ziang Zhang, Zhou Zhao:
Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes. CoRR abs/2308.08769 (2023) - [i107]Shengpeng Ji, Jialong Zuo, Minghui Fang, Ziyue Jiang, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models. CoRR abs/2308.14430 (2023) - [i106]Yongqi Wang, Jionghao Bai, Rongjie Huang, Ruiqi Li, Zhiqing Hong, Zhou Zhao:
Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer. CoRR abs/2309.07566 (2023) - [i105]Dongchao Yang, Jinchuan Tian, Xu Tan, Rongjie Huang, Songxiang Liu, Xuankai Chang, Jiatong Shi, Sheng Zhao, Jiang Bian, Xixin Wu, Zhou Zhao, Shinji Watanabe, Helen Meng:
UniAudio: An Audio Foundation Model Toward Universal Audio Generation. CoRR abs/2310.00704 (2023) - [i104]Zehan Wang, Ziang Zhang, Luping Liu, Yang Zhao, Haifeng Huang, Tao Jin, Zhou Zhao:
Extending Multi-modal Contrastive Representations. CoRR abs/2310.08884 (2023) - [i103]Zijian Zhang, Luping Liu, Zhijie Lin, Yichen Zhu, Zhou Zhao:
Unsupervised Discovery of Interpretable Directions in h-space of Pre-trained Diffusion Models. CoRR abs/2310.09912 (2023) - [i102]Haoyi Duan, Yan Xia, Mingze Zhou, Li Tang, Jieming Zhu, Zhou Zhao:
Cross-modal Prompts: Adapting Large Pre-trained Models for Audio-Visual Downstream Tasks. CoRR abs/2311.05152 (2023) - [i101]Haoyuan Li, Zhou Zhao, Zhu Zhang, Zhijie Lin:
Weakly-Supervised Video Moment Retrieval via Regularized Two-Branch Proposal Networks with Erasing Mechanism. CoRR abs/2311.13946 (2023) - [i100]Liangcai Su, Fan Yan, Jieming Zhu, Xi Xiao, Haoyi Duan, Zhou Zhao, Zhenhua Dong, Ruiming Tang:
Beyond Two-Tower Matching: Learning Sparse Retrievable Cross-Interactions for Recommendation. CoRR abs/2311.18213 (2023) - [i99]Dong Yao, Shengyu Zhang, Zhou Zhao, Jieming Zhu, Liqun Deng, Wenqiao Zhang, Zhenhua Dong, Ruiming Tang, Xin Jiang:
Music-PAW: Learning Music Representations via Hierarchical Part-whole Interaction and Contrast. CoRR abs/2312.06197 (2023) - [i98]Haifeng Huang, Zehan Wang, Rongjie Huang, Luping Liu, Xize Cheng, Yang Zhao, Tao Jin, Zhou Zhao:
Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers. CoRR abs/2312.08168 (2023) - [i97]Yu Zhang, Rongjie Huang, Ruiqi Li, Jinzheng He, Yan Xia, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis. CoRR abs/2312.10741 (2023) - [i96]Haifeng Huang, Yang Zhao, Zehan Wang, Yan Xia, Zhou Zhao:
Multi-Modal Domain Adaptation Across Video Scenes for Temporal Video Grounding. CoRR abs/2312.13633 (2023) - [i95]Aoxiong Yin, Tianyun Zhong, Haoyuan Li, Siliang Tang, Zhou Zhao:
Language Model is a Branch Predictor for Simultaneous Machine Translation. CoRR abs/2312.14488 (2023) - [i94]Xize Cheng, Rongjie Huang, Linjun Li, Tao Jin, Zehan Wang, Aoxiong Yin, Minglei Li, Xinyu Duan, Changpeng Yang, Zhou Zhao:
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation. CoRR abs/2312.15197 (2023) - 2022
- [j63]Tao Jin, Zhou Zhao, Peng Wang, Jun Yu, Fei Wu:
Interaction augmented transformer with decoupled decoding for video captioning. Neurocomputing 492: 496-507 (2022) - [j62]Pengcheng Zhang, Zhou Zhao, Nannan Wang, Jun Yu, Fei Wu:
Local-Global Graph Pooling via Mutual Information Maximization for Video-Paragraph Retrieval. IEEE Trans. Circuits Syst. Video Technol. 32(10): 7133-7146 (2022) - [j61]Jingkuan Song, Jingqiu Zhang, Lianli Gao, Zhou Zhao, Heng Tao Shen:
AgeGAN++: Face Aging and Rejuvenation With Dual Conditional GANs. IEEE Trans. Multim. 24: 791-804 (2022) - [j60]Zhaoyu Guo, Zhou Zhao, Weike Jin, Dazhou Wang, Ruitao Liu, Jun Yu:
TaoHighlight: Commodity-Aware Multi-Modal Video Highlight Detection in E-Commerce. IEEE Trans. Multim. 24: 2606-2616 (2022) - [j59]Wenhua Wang, Yuqun Zhang, Yulei Sui, Yao Wan, Zhou Zhao, Jian Wu, Philip S. Yu, Guandong Xu:
Reinforcement-Learning-Guided Source Code Summarization Using Hierarchical Attention. IEEE Trans. Software Eng. 48(2): 102-119 (2022) - [c175]Jinzheng He, Zhou Zhao, Yi Ren, Jinglin Liu, Baoxing Huai, Nicholas Jing Yuan:
Flow-Based Unconstrained Lip to Speech Generation. AAAI 2022: 843-851 - [c174]Jinglin Liu, Zhiying Zhu, Yi Ren, Wencan Huang, Baoxing Huai, Nicholas Jing Yuan, Zhou Zhao:
Parallel and High-Fidelity Text-to-Lip Generation. AAAI 2022: 1738-1746 - [c173]Jinglin Liu, Chengxi Li, Yi Ren, Feiyang Chen, Zhou Zhao:
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism. AAAI 2022: 11020-11028 - [c172]Tao Jin, Zhou Zhao, Meng Zhang, Xingshan Zeng:
Prior Knowledge and Memory Enriched Transformer for Sign Language Translation. ACL (Findings) 2022: 3766-3775 - [c171]Jinglin Liu, Chengxi Li, Yi Ren, Zhiying Zhu, Zhou Zhao:
Learning the Beauty in Songs: Neural Singing Voice Beautifier. ACL (1) 2022: 7970-7983 - [c170]Yi Ren, Xu Tan, Tao Qin, Zhou Zhao, Tie-Yan Liu:
Revisiting Over-Smoothness in Text to Speech. ACL (1) 2022: 8197-8213 - [c169]Mengze Li, Tianbao Wang, Haoyu Zhang, Shengyu Zhang, Zhou Zhao, Jiaxu Miao, Wenqiao Zhang, Wenming Tan, Jin Wang, Peng Wang, Shiliang Pu, Fei Wu:
End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding. ACL (1) 2022: 8707-8717 - [c168]Wenwen Pan, Haonan Shi, Zhou Zhao, Jieming Zhu, Xiuqiang He, Zhigeng Pan, Lianli Gao, Jun Yu, Fei Wu, Qi Tian:
Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross- Modal Denoising Networks. CVPR 2022: 1310-1321 - [c167]Aoxiong Yin, Zhou Zhao, Weike Jin, Meng Zhang, Xingshan Zeng, Xiaofei He:
MLSLT: Towards Multilingual Sign Language Translation. CVPR 2022: 5099-5109 - [c166]Xinyu Lyu, Lianli Gao, Yuyu Guo, Zhou Zhao, Hao Huang, Heng Tao Shen, Jingkuan Song:
Fine-Grained Predicates Learning for Scene Graph Generation. CVPR 2022: 19445-19453 - [c165]Yan Xia, Zhou Zhao:
Cross-modal Background Suppression for Audio-Visual Event Localization. CVPR 2022: 19957-19966 - [c164]Lichao Zhang, Yi Ren, Liqun Deng, Zhou Zhao:
HiFiDenoise: High-Fidelity Denoising Text to Speech with Adversarial Networks. ICASSP 2022: 7232-7236 - [c163]Yi Ren, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen, Zhijie Yan, Zhou Zhao:
Prosospeech: Enhancing Prosody with Quantized Vector Pre-Training in Text-To-Speech. ICASSP 2022: 7577-7581 - [c162]Luping Liu, Yi Ren, Zhijie Lin, Zhou Zhao:
Pseudo Numerical Methods for Diffusion Models on Manifolds. ICLR 2022 - [c161]Rongjie Huang, Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao:
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis. IJCAI 2022: 4157-4163 - [c160]Zhenhui Ye, Zhou Zhao, Yi Ren, Fei Wu:
SyntaSpeech: Syntax-Aware Generative Adversarial Text-to-Speech. IJCAI 2022: 4468-4474 - [c159]Lichao Zhang, Zhou Zhao, Yi Ren, Liqun Deng:
EditSinger: Zero-Shot Text-Based Singing Voice Editing System with Diverse Prosody Modeling. IJCAI 2022: 4503-4509 - [c158]Zhou Zhao, Zhenyu Lu:
Multi-purpose Tactile Perception Based on Deep Learning in a New Tendon-driven Optical Tactile Sensor. IROS 2022: 2099-2104 - [c157]Yuxiao Lin, Zhihao Du, Shiliang Zhang, Fan Yu, Zhou Zhao, Fei Wu:
Separate-to-Recognize: Joint Multi-target Speech Separation and Speech Recognition for Speaker-attributed ASR. ISCSLP 2022: 150-154 - [c156]Rongjie Huang, Chenye Cui, Feiyang Chen, Yi Ren, Jinglin Liu, Zhou Zhao, Baoxing Huai, Zhefeng Wang:
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation. ACM Multimedia 2022: 2525-2535 - [c155]Rongjie Huang, Zhou Zhao, Huadai Liu, Jinglin Liu, Chenye Cui, Yi Ren:
ProDiff: Progressive Fast Diffusion Model for High-Quality Text-to-Speech. ACM Multimedia 2022: 2595-2605 - [c154]Mengze Li, Tianbao Wang, Haoyu Zhang, Shengyu Zhang, Zhou Zhao, Wenqiao Zhang, Jiaxu Miao, Shiliang Pu, Fei Wu:
HERO: HiErarchical spatio-tempoRal reasOning with Contrastive Action Correspondence for End-to-End Video Object Grounding. ACM Multimedia 2022: 3801-3810 - [c153]Tao Jin, Zhou Zhao, Meng Zhang, Xingshan Zeng:
MC-SLT: Towards Low-Resource Signer-Adaptive Sign Language Translation. ACM Multimedia 2022: 4939-4947 - [c152]Yan Xia, Zhou Zhao, Shangwei Ye, Yang Zhao, Haoyuan Li, Yi Ren:
Video-Guided Curriculum Learning for Spoken Video Grounding. ACM Multimedia 2022: 5191-5200 - [c151]Ziqi Jiang, Shengyu Zhang, Siyuan Yao, Wenqiao Zhang, Sihan Zhang, Juncheng Li, Zhou Zhao, Fei Wu:
Weakly-supervised Disentanglement Network for Video Fingerspelling Detection. ACM Multimedia 2022: 5446-5455 - [c150]Wencan Huang, Zhou Zhao, Jinzheng He, Mingmin Zhang:
DualSign: Semi-Supervised Sign Language Production with Balanced Multi-Modal Multi-Task Dual Transformation. ACM Multimedia 2022: 5486-5495 - [c149]Yongqi Wang, Zhou Zhao:
FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis. ACM Multimedia 2022: 5678-5687 - [c148]Jiong Wang, Zhou Zhao, Fei Wu:
Set-Based Face Recognition Beyond Disentanglement: Burstiness Suppression With Variance Vocabulary. ACM Multimedia 2022: 6125-6135 - [c147]Rongjie Huang, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao:
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech. NeurIPS 2022 - [c146]Ziyue Jiang, Su Zhe, Zhou Zhao, Qian Yang, Yi Ren, Jinglin Liu:
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech. NeurIPS 2022 - [c145]Lichao Zhang, Ruiqi Li, Shoutong Wang, Liqun Deng, Jinglin Liu, Yi Ren, Jinzheng He, Rongjie Huang, Jieming Zhu, Xiao Chen, Zhou Zhao:
M4Singer: A Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus. NeurIPS 2022 - [c144]Zijian Zhang, Zhou Zhao, Zhijie Lin:
Unsupervised Representation Learning from Pre-trained Diffusion Probabilistic Models. NeurIPS 2022 - [c143]Yang Zhao, Chen Zhang, Haifeng Huang, Haoyuan Li, Zhou Zhao:
Towards Effective Multi-Modal Interchanges in Zero-Resource Sounding Object Localization. NeurIPS 2022 - [c142]Ziqi Tan, Shengyu Zhang, Nuanxin Hong, Kun Kuang, Yifan Yu, Jin Yu, Zhou Zhao, Hongxia Yang, Shiyuan Pan, Jingren Zhou, Fei Wu:
Uncovering Causal Effects of Online Short Videos on Consumer Behaviors. WSDM 2022: 997-1006 - [c141]Shengyu Zhang, Lingxiao Yang, Dong Yao, Yujie Lu, Fuli Feng, Zhou Zhao, Tat-Seng Chua, Fei Wu:
Re4: Learning to Re-contrast, Re-attend, Re-construct for Multi-interest Recommendation. WWW 2022: 2216-2226 - [c140]Dong Yao, Zhou Zhao, Shengyu Zhang, Jieming Zhu, Yudong Zhu, Rui Zhang, Xiuqiang He:
Contrastive Learning with Positive-Negative Frame Mask for Music Representation. WWW 2022: 2906-2915 - [i93]Lei Li, Fuping Wu, Sihan Wang, Xinzhe Luo, Carlos Martín-Isla, Shuwei Zhai, Jianpeng Zhang, Yanfei Liu, Zhen Zhang, Markus J. Ankenbrand, Haochuan Jiang, Xiaoran Zhang, Linhong Wang, Tewodros Weldebirhan Arega, Elif Altunok, Zhou Zhao, Feiyan Li, Jun Ma, Xiaoping Yang, Élodie Puybareau, Ilkay Öksüz, Stéphanie Bricq, Weisheng Li, Kumaradevan Punithakumar, Sotirios A. Tsaftaris, Laura Maria Schreiber, Mingjing Yang, Guocai Liu, Yong Xia, Guotai Wang, Sergio Escalera, Xiahai Zhuang:
MyoPS: A Benchmark of Myocardial Pathology Segmentation Combining Three-Sequence Cardiac Magnetic Resonance Images. CoRR abs/2201.03186 (2022) - [i92]Shoutong Wang, Jinglin Liu, Yi Ren, Zhen Wang, Changliang Xu, Zhou Zhao:
MR-SVS: Singing Voice Synthesis with Multi-Reference Encoder. CoRR abs/2201.03864 (2022) - [i91]Yi Ren, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen, Zhijie Yan, Zhou Zhao:
ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech. CoRR abs/2202.07816 (2022) - [i90]Luping Liu, Yi Ren, Zhijie Lin, Zhou Zhao:
Pseudo Numerical Methods for Diffusion Models on Manifolds. CoRR abs/2202.09778 (2022) - [i89]Jiong Wang, Zhou Zhao, Weike Jin, Xinyu Duan, Zhen Lei, Baoxing Huai, Yiling Wu, Xiaofei He:
VLAD-VSA: Cross-Domain Face Presentation Attack Detection with Vocabulary Separation and Adaptation. CoRR abs/2202.10301 (2022) - [i88]Yi Ren, Xu Tan, Tao Qin, Zhou Zhao, Tie-Yan Liu:
Revisiting Over-Smoothness in Text to Speech. CoRR abs/2202.13066 (2022) - [i87]Jinglin Liu, Chengxi Li, Yi Ren, Zhiying Zhu, Zhou Zhao:
Learning the Beauty in Songs: Neural Singing Voice Beautifier. CoRR abs/2202.13277 (2022) - [i86]Mengze Li, Tianbao Wang, Haoyu Zhang, Shengyu Zhang, Zhou Zhao, Jiaxu Miao, Wenqiao Zhang, Wenming Tan, Jin Wang, Peng Wang, Shiliang Pu, Fei Wu:
End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding. CoRR abs/2203.08013 (2022) - [i85]Dong Yao, Zhou Zhao, Shengyu Zhang, Jieming Zhu, Yudong Zhu, Rui Zhang, Xiuqiang He:
Contrastive Learning with Positive-Negative Frame Mask for Music Representation. CoRR abs/2203.09129 (2022) - [i84]Xinyu Lyu, Lianli Gao, Yuyu Guo, Zhou Zhao, Hao Huang, Heng Tao Shen, Jingkuan Song:
Fine-Grained Predicates Learning for Scene Graph Generation. CoRR abs/2204.02597 (2022) - [i83]Rongjie Huang, Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao:
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis. CoRR abs/2204.09934 (2022) - [i82]Zhenhui Ye, Zhou Zhao, Yi Ren, Fei Wu:
SyntaSpeech: Syntax-Aware Generative Adversarial Text-to-Speech. CoRR abs/2204.11792 (2022) - [i81]Rongjie Huang, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao:
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech Synthesis. CoRR abs/2205.07211 (2022) - [i80]Rongjie Huang, Zhou Zhao, Jinglin Liu, Huadai Liu, Yi Ren, Lichao Zhang, Jinzheng He:
TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation. CoRR abs/2205.12523 (2022) - [i79]Ziyue Jiang, Su Zhe, Zhou Zhao, Qian Yang, Yi Ren, Jinglin Liu, Zhenhui Ye:
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech. CoRR abs/2206.02147 (2022) - [i78]Yang Zhao, Xuan Lin, Wenqiang Xu, Maozong Zheng, Zhengyong Liu, Zhou Zhao:
AntPivot: Livestream Highlight Detection via Hierarchical Attention Mechanism. CoRR abs/2206.04888 (2022) - [i77]Yongqi Wang, Zhou Zhao:
FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis. CoRR abs/2207.03800 (2022) - [i76]Rongjie Huang, Zhou Zhao, Huadai Liu, Jinglin Liu, Chenye Cui, Yi Ren:
ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech. CoRR abs/2207.06389 (2022) - [i75]Mengze Li, Tianbao Wang, Haoyu Zhang, Shengyu Zhang, Zhou Zhao, Wenqiao Zhang, Jiaxu Miao, Shiliang Pu, Fei Wu:
HERO: HiErarchical spatio-tempoRal reasOning with Contrastive Action Correspondence for End-to-End Video Object Grounding. CoRR abs/2208.05818 (2022) - [i74]Shengyu Zhang, Lingxiao Yang, Dong Yao, Yujie Lu, Fuli Feng, Zhou Zhao, Tat-Seng Chua, Fei Wu:
Re4: Learning to Re-contrast, Re-attend, Re-construct for Multi-interest Recommendation. CoRR abs/2208.08011 (2022) - [i73]Shengyu Zhang, Bofang Li, Dong Yao, Fuli Feng, Jieming Zhu, Wenyan Fan, Zhou Zhao, Xiaofei He, Tat-Seng Chua, Fei Wu:
CCL4Rec: Contrast over Contrastive Learning for Micro-video Recommendation. CoRR abs/2208.08024 (2022) - [i72]Yang Zhao, Wenqiang Xu, Xuan Lin, Jingjing Huo, Hong Chen, Zhou Zhao:
AntCritic: Argument Mining for Free-Form and Visually-Rich Financial Comments. CoRR abs/2208.09612 (2022) - [i71]Yan Xia, Zhou Zhao, Shangwei Ye, Yang Zhao, Haoyuan Li, Yi Ren:
Video-Guided Curriculum Learning for Spoken Video Grounding. CoRR abs/2209.00277 (2022) - [i70]Jiong Wang, Zhou Zhao, Weike Jin:
Frame-Subtitle Self-Supervision for Multi-Modal Video Question Answering. CoRR abs/2209.03609 (2022) - [i69]Chenye Cui, Yi Ren, Jinglin Liu, Rongjie Huang, Zhou Zhao:
VarietySound: Timbre-Controllable Video to Sound Generation via Unsupervised Information Disentanglement. CoRR abs/2211.10666 (2022) - [i68]Luping Liu, Yi Ren, Xize Cheng, Zhou Zhao:
Diffusion Denoising Process for Perceptron Bias in Out-of-distribution Detection. CoRR abs/2211.11255 (2022) - [i67]Zijian Zhang, Zhou Zhao, Zhijie Lin:
Unsupervised Representation Learning from Pre-trained Diffusion Probabilistic Models. CoRR abs/2212.12990 (2022) - 2021
- [j58]Lianli Gao, Daiyuan Chen, Zhou Zhao, Jie Shao, Heng Tao Shen:
Lightweight dynamic conditional GAN with pyramid attention for text-to-image synthesis. Pattern Recognit. 110: 107384 (2021) - [j57]Zhaoyu Guo, Zhou Zhao, Weike Jin, Zhicheng Wei, Min Yang, Nannan Wang, Nicholas Jing Yuan:
Multi-Turn Video Question Generation via Reinforced Multi-Choice Attention Network. IEEE Trans. Circuits Syst. Video Technol. 31(5): 1697-1710 (2021) - [j56]Aming Wu, Yahong Han, Zhou Zhao, Yi Yang:
Hierarchical Memory Decoder for Visual Narrating. IEEE Trans. Circuits Syst. Video Technol. 31(6): 2438-2449 (2021) - [j55]Mao Gu, Zhou Zhao, Weike Jin, Richang Hong, Fei Wu:
Graph-Based Multi-Interaction Network for Video Question Answering. IEEE Trans. Image Process. 30: 2758-2770 (2021) - [j54]Weike Jin, Zhou Zhao, Xiaochun Cao, Jieming Zhu, Xiuqiang He, Yueting Zhuang:
Adaptive Spatio-Temporal Graph Enhanced Vision-Language Representation for Video QA. IEEE Trans. Image Process. 30: 5477-5489 (2021) - [j53]Zijian Zhang, Zhou Zhao, Zhu Zhang, Zhijie Lin, Qi Wang, Richang Hong:
Temporal Textual Localization in Video via Adversarial Bi-Directional Interaction Networks. IEEE Trans. Multim. 23: 3306-3317 (2021) - [j52]Min Yang, Chengming Li, Ying Shen, Qingyao Wu, Zhou Zhao, Xiaojun Chen:
Hierarchical Human-Like Deep Neural Networks for Abstractive Text Summarization. IEEE Trans. Neural Networks Learn. Syst. 32(6): 2744-2757 (2021) - [j51]Min Yang, Qiang Qu, Ying Shen, Zhou Zhao, Xiaojun Chen, Chengming Li:
An Effective Hybrid Learning Model for Real-Time Event Summarization. IEEE Trans. Neural Networks Learn. Syst. 32(10): 4419-4431 (2021) - [c139]Dong Yao, Shengyu Zhang, Zhou Zhao, Wenyan Fan, Jieming Zhu, Xiuqiang He, Fei Wu:
Modeling High-order Interactions across Multi-interests for Micro-video Reommendation (Student Abstract). AAAI 2021: 15945-15946 - [c138]Shengyu Zhang, Tan Jiang, Tan Wang, Kun Kuang, Zhou Zhao, Jianke Zhu, Jin Yu, Hongxia Yang, Fei Wu:
DeVLBert: Out-of-Distribution Visio-Linguistic Pretraining With Causality. CVPR Workshops 2021: 1744-1747 - [c137]Shengyu Zhang, Tan Jiang, Qinghao Huang, Ziqi Tan, Kun Kuang, Zhou Zhao, Siliang Tang, Jin Yu, Hongxia Yang, Yi Yang, Fei Wu:
Grounded, Controllable and Debiased Image Completion With Lexical Semantics. CVPR Workshops 2021: 1748-1751 - [c136]Yawen Zeng, Da Cao, Xiaochi Wei, Meng Liu, Zhou Zhao, Zheng Qin:
Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval. CVPR 2021: 2215-2224 - [c135]Yang Zhao, Zhou Zhao, Zhu Zhang, Zhijie Lin:
Cascaded Prediction Network via Segment Tree for Temporal Video Grounding. CVPR 2021: 4197-4206 - [c134]Min Zhang, Yang Guo, Na Lei, Zhou Zhao, Jianfeng Wu, Xiaoyin Xu, Yalin Wang, Xianfeng Gu:
Cortical Surface Shape Analysis Based on Alexandrov Polyhedra. ICCV 2021: 14224-14232 - [c133]Yi Ren, Chenxu Hu, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu:
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech. ICLR 2021 - [c132]Zhu Zhang, Chang Zhou, Jianxin Ma, Zhijie Lin, Jingren Zhou, Hongxia Yang, Zhou Zhao:
Learning to Rehearse in Long Sequence Memorization. ICML 2021: 12663-12673 - [c131]Ziyue Jiang, Yi Ren, Ming Lei, Zhou Zhao:
FedSpeech: Federated Text-to-Speech with Continual Learning. IJCAI 2021: 3829-3835 - [c130]Kexun Zhang, Yi Ren, Changliang Xu, Zhou Zhao:
WSRGlow: A Glow-Based Waveform Generative Model for Audio Super-Resolution. Interspeech 2021: 1649-1653 - [c129]Chenye Cui, Yi Ren, Jinglin Liu, Feiyang Chen, Rongjie Huang, Ming Lei, Zhou Zhao:
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model. Interspeech 2021: 2766-2770 - [c128]Zhijie Lin, Zhou Zhao, Haoyuan Li, Jinglin Liu, Meng Zhang, Xingshan Zeng, Xiaofei He:
SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory. ACM Multimedia 2021: 1359-1367 - [c127]Jiong Wang, Zhou Zhao, Weike Jin, Xinyu Duan, Zhen Lei, Baoxing Huai, Yiling Wu, Xiaofei He:
VLAD-VSA: Cross-Domain Face Presentation Attack Detection with Vocabulary Separation and Adaptation. ACM Multimedia 2021: 1497-1506 - [c126]Wencan Huang, Wenwen Pan, Zhou Zhao, Qi Tian:
Towards Fast and High-Quality Sign Language Production. ACM Multimedia 2021: 3172-3181 - [c125]Jiahao Xun, Shengyu Zhang, Zhou Zhao, Jieming Zhu, Qi Zhang, Jingjie Li, Xiuqiang He, Xiaofei He, Tat-Seng Chua, Fei Wu:
Why Do We Click: Visual Impression-aware News Recommendation. ACM Multimedia 2021: 3881-3890 - [c124]Rongjie Huang, Feiyang Chen, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao:
Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus. ACM Multimedia 2021: 3945-3954 - [c123]Aoxiong Yin, Zhou Zhao, Jinglin Liu, Weike Jin, Meng Zhang, Xingshan Zeng, Xiaofei He:
SimulSLT: End-to-End Simultaneous Sign Language Translation. ACM Multimedia 2021: 4118-4127 - [c122]Tao Jin, Zhou Zhao:
Contrastive Disentangled Meta-Learning for Signer-Independent Sign Language Translation. ACM Multimedia 2021: 5065-5073 - [c121]Tao Jin, Zhou Zhao:
Generalizable Multi-linear Attention Network. NeurIPS 2021: 9049-9060 - [c120]Yi Ren, Jinglin Liu, Zhou Zhao:
PortaSpeech: Portable and High-Quality Generative Text-to-Speech. NeurIPS 2021: 13963-13974 - [c119]Shengyu Zhang, Donghui Wang, Zhou Zhao, Siliang Tang, Kun Kuang, Di Xie, Fei Wu:
MGD-GAN: Text-to-Pedestrian Generation Through Multi-grained Discrimination. PRCV (2) 2021: 662-673 - [c118]Shengyu Zhang, Dong Yao, Zhou Zhao, Tat-Seng Chua, Fei Wu:
CauseRec: Counterfactual User Sequence Synthesis for Sequential Recommendation. SIGIR 2021: 367-377 - [c117]Weike Jin, Zhou Zhao, Pengcheng Zhang, Jieming Zhu, Xiuqiang He, Yueting Zhuang:
Hierarchical Cross-Modal Graph Consistency Learning for Video-Text Retrieval. SIGIR 2021: 1114-1124 - [c116]Yujie Lu, Shengyu Zhang, Yingxuan Huang, Luyao Wang, Xinyao Yu, Zhou Zhao, Fei Wu:
Future-Aware Diverse Trends Framework for Recommendation. WWW 2021: 2992-3001 - [i66]Dong Yao, Shengyu Zhang, Zhou Zhao, Wenyan Fan, Jieming Zhu, Xiuqiang He, Fei Wu:
Modeling High-order Interactions across Multi-interests for Micro-video Reommendation. CoRR abs/2104.00305 (2021) - [i65]Jinglin Liu, Chengxi Li, Yi Ren, Feiyang Chen, Peng Liu, Zhou Zhao:
DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis. CoRR abs/2105.02446 (2021) - [i64]Zhu Zhang, Chang Zhou, Jianxin Ma, Zhijie Lin, Jingren Zhou, Hongxia Yang, Zhou Zhao:
Learning to Rehearse in Long Sequence Memorization. CoRR abs/2106.01096 (2021) - [i63]Kexun Zhang, Yi Ren, Changliang Xu, Zhou Zhao:
WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution. CoRR abs/2106.08507 (2021) - [i62]Chenye Cui, Yi Ren, Jinglin Liu, Feiyang Chen, Rongjie Huang, Ming Lei, Zhou Zhao:
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model. CoRR abs/2106.09317 (2021) - [i61]Jinglin Liu, Zhiying Zhu, Yi Ren, Zhou Zhao:
High-Speed and High-Quality Text-to-Lip Generation. CoRR abs/2107.06831 (2021) - [i60]Zhijie Lin, Zhou Zhao, Haoyuan Li, Jinglin Liu, Meng Zhang, Xingshan Zeng, Xiaofei He:
SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory. CoRR abs/2108.13630 (2021) - [i59]Shengyu Zhang, Dong Yao, Zhou Zhao, Tat-Seng Chua, Fei Wu:
CauseRec: Counterfactual User Sequence Synthesis for Sequential Recommendation. CoRR abs/2109.05261 (2021) - [i58]Jiahao Xun, Shengyu Zhang, Zhou Zhao, Jieming Zhu, Qi Zhang, Jingjie Li, Xiuqiang He, Xiaofei He, Tat-Seng Chua, Fei Wu:
Why Do We Click: Visual Impression-aware News Recommendation. CoRR abs/2109.12651 (2021) - [i57]Yi Ren, Jinglin Liu, Zhou Zhao:
PortaSpeech: Portable and High-Quality Generative Text-to-Speech. CoRR abs/2109.15166 (2021) - [i56]Shengyu Zhang, Kun Kuang, Jiezhong Qiu, Jin Yu, Zhou Zhao, Hongxia Yang, Zhongfei Zhang, Fei Wu:
Stable Prediction on Graphs with Agnostic Distribution Shift. CoRR abs/2110.03865 (2021) - [i55]Yujie Lu, Yingxuan Huang, Shengyu Zhang, Wei Han, Hui Chen, Zhou Zhao, Fei Wu:
Multi-trends Enhanced Dynamic Micro-video Recommendation. CoRR abs/2110.03902 (2021) - [i54]Fuming You, Jingjing Li, Zhou Zhao:
Test-time Batch Statistics Calibration for Covariate Shift. CoRR abs/2110.04065 (2021) - [i53]Ziyue Jiang, Yi Ren, Ming Lei, Zhou Zhao:
FedSpeech: Federated Text-to-Speech with Continual Learning. CoRR abs/2110.07216 (2021) - [i52]Feiyang Chen, Rongjie Huang, Chenye Cui, Yi Ren, Jinglin Liu, Zhou Zhao, Nicholas Jing Yuan, Baoxing Huai:
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation. CoRR abs/2110.07468 (2021) - [i51]Jianyun Zou, Min Yang, Lichao Zhang, Yechen Xu, Qifan Pan, Fengqing Jiang, Ran Qin, Shushu Wang, Yifan He, Songfang Huang, Zhou Zhao:
A Chinese Multi-type Complex Questions Answering Dataset over Wikidata. CoRR abs/2111.06086 (2021) - [i50]Aoxiong Yin, Zhou Zhao, Jinglin Liu, Weike Jin, Meng Zhang, Xingshan Zeng, Xiaofei He:
SimulSLT: End-to-End Simultaneous Sign Language Translation. CoRR abs/2112.04228 (2021) - [i49]Rongjie Huang, Feiyang Chen, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao:
Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus. CoRR abs/2112.10358 (2021) - 2020
- [j50]Jiyuan Zheng, Zhou Zhao, Zehan Song, Min Yang, Jun Xiao, Xiaohui Yan:
Abstractive meeting summarization by hierarchical adaptive segmental network learning with multiple revising steps. Neurocomputing 378: 179-188 (2020) - [j49]Boyuan Pan, Yazheng Yang, Zhou Zhao, Yueting Zhuang, Deng Cai:
Bi-Decoder Augmented Network for Neural Machine Translation. Neurocomputing 387: 188-194 (2020) - [j48]Shaoning Xiao, Yimeng Li, Yunan Ye, Long Chen, Shiliang Pu, Zhou Zhao, Jian Shao, Jun Xiao:
Hierarchical Temporal Fusion of Multi-grained Attention Features for Video Question Answering. Neural Process. Lett. 52(2): 993-1003 (2020) - [j47]Lianli Gao, Tao Li, Jingkuan Song, Zhou Zhao, Heng Tao Shen:
Play and rewind: Context-aware video temporal action proposals. Pattern Recognit. 107: 107477 (2020) - [j46]Mao Gu, Zhou Zhao, Weike Jin, Deng Cai, Fei Wu:
Video Dialog via Multi-Grained Convolutional Self-Attention Context Multi-Modal Networks. IEEE Trans. Circuits Syst. Video Technol. 30(12): 4453-4466 (2020) - [j45]Wei Zhao, Benyou Wang, Min Yang, Jianbo Ye, Zhou Zhao, Xiaojun Chen, Ying Shen:
Leveraging Long and Short-Term Information in Content-Aware Movie Recommendation via Adversarial Training. IEEE Trans. Cybern. 50(11): 4680-4693 (2020) - [j44]Min Yang, Junhao Liu, Lei Chen, Zhou Zhao, Xiaojun Chen, Ying Shen:
An Advanced Deep Generative Framework for Temporal Link Prediction in Dynamic Networks. IEEE Trans. Cybern. 50(12): 4946-4957 (2020) - [j43]Zhijie Lin, Zhou Zhao, Zhu Zhang, Zijian Zhang, Deng Cai:
Moment Retrieval via Cross-Modal Interaction Networks With Query Reconstruction. IEEE Trans. Image Process. 29: 3750-3762 (2020) - [j42]Zhou Zhao, Shuwen Xiao, Zehan Song, Chujie Lu, Jun Xiao, Yueting Zhuang:
Open-Ended Video Question Answering via Multi-Modal Conditional Adversarial Networks. IEEE Trans. Image Process. 29: 3859-3870 (2020) - [j41]Shuwen Xiao, Zhou Zhao, Zijian Zhang, Ziyu Guan, Deng Cai:
Query-Biased Self-Attentive Network for Query-Focused Video Summarization. IEEE Trans. Image Process. 29: 5889-5899 (2020) - [j40]Min Yang, Junhao Liu, Ying Shen, Zhou Zhao, Xiaojun Chen, Qingyao Wu, Chengming Li:
An Ensemble of Generation- and Retrieval-Based Image Captioning With Dual Generator Generative Adversarial Network. IEEE Trans. Image Process. 29: 9627-9640 (2020) - [j39]Jiarong Xu, Yifan Luo, Jianrong Tao, Changjie Fan, Zhou Zhao, Jiangang Lu:
NGUARD+: An Attention-based Game Bot Detection Framework via Player Behavior Sequences. ACM Trans. Knowl. Discov. Data 14(6): 65:1-65:24 (2020) - [j38]Yueting Zhuang, Dejing Xu, Xin Yan, Wenzhuo Cheng, Zhou Zhao, Shiliang Pu, Jun Xiao:
Multichannel Attention Refinement for Video Question Answering. ACM Trans. Multim. Comput. Commun. Appl. 16(1s): 24:1-24:23 (2020) - [c115]Min Yang, Chengming Li, Fei Sun, Zhou Zhao, Ying Shen, Chenglin Wu:
Be Relevant, Non-Redundant, and Timely: Deep Reinforcement Learning for Real-Time Event Summarization. AAAI 2020: 9410-9417 - [c114]Zhijie Lin, Zhou Zhao, Zhu Zhang, Qi Wang, Huasheng Liu:
Weakly-Supervised Video Moment Retrieval via Semantic Completion Network. AAAI 2020: 11539-11546 - [c113]Junhao Liu, Kai Wang, Chunpu Xu, Zhou Zhao, Ruifeng Xu, Ying Shen, Min Yang:
Interactive Dual Generative Adversarial Networks for Image Captioning. AAAI 2020: 11588-11595 - [c112]Qiang Wang, Pin Jiang, Zhiyi Guo, Yahong Han, Zhou Zhao:
Multi-Speaker Video Dialog with Frame-Level Temporal Localization. AAAI 2020: 12200-12207 - [c111]Shuwen Xiao, Zhou Zhao, Zijian Zhang, Xiaohui Yan, Min Yang:
Convolutional Hierarchical Attention Network for Query-Focused Video Summarization. AAAI 2020: 12426-12433 - [c110]Yi Ren, Jinglin Liu, Xu Tan, Zhou Zhao, Sheng Zhao, Tie-Yan Liu:
A Study of Non-autoregressive Model for Sequence Generation. ACL 2020: 149-159 - [c109]Yi Ren, Jinglin Liu, Xu Tan, Chen Zhang, Tao Qin, Zhou Zhao, Tie-Yan Liu:
SimulSpeech: End-to-End Simultaneous Speech to Text Translation. ACL 2020: 3787-3796 - [c108]Zhu Zhang, Zhou Zhao, Yang Zhao, Qi Wang, Huasheng Liu, Lianli Gao:
Where Does It Exist: Spatio-Temporal Video Grounding for Multi-Form Sentences. CVPR 2020: 10665-10674 - [c107]Zhou Zhao, Élodie Puybareau, Nicolas Boutry, Thierry Géraud:
FOANet: A Focus of Attention Network with Application to Myocardium Segmentation. ICPR 2020: 1120-1127 - [c106]Zhou Zhao, Élodie Puybareau, Nicolas Boutry, Thierry Géraud:
Do not Treat Boundaries and Regions Differently: An Example on Heart Left Atrial Segmentation. ICPR 2020: 7447-7453 - [c105]Zhu Zhang, Zhou Zhao, Zhijie Lin, Baoxing Huai, Jing Yuan:
Object-Aware Multi-Branch Relation Networks for Spatio-Temporal Video Grounding. IJCAI 2020: 1069-1075 - [c104]Jinglin Liu, Yi Ren, Xu Tan, Chen Zhang, Tao Qin, Zhou Zhao, Tie-Yan Liu:
Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation. IJCAI 2020: 3861-3867 - [c103]Yi Ren, Xu Tan, Tao Qin, Jian Luan, Zhou Zhao, Tie-Yan Liu:
DeepSinger: Singing Voice Synthesis with Data Mined From the Web. KDD 2020: 1979-1989 - [c102]Shengyu Zhang, Ziqi Tan, Zhou Zhao, Jin Yu, Kun Kuang, Tan Jiang, Jingren Zhou, Hongxia Yang, Fei Wu:
Comprehensive Information Integration Modeling Framework for Video Titling. KDD 2020: 2744-2754 - [c101]Zhou Zhao, Nicolas Boutry, Élodie Puybareau:
Stacked and Parallel U-Nets with Multi-output for Myocardial Pathology Segmentation. MyoPS@MICCAI 2020: 138-145 - [c100]Yi Ren, Jinzheng He, Xu Tan, Tao Qin, Zhou Zhao, Tie-Yan Liu:
PopMAG: Pop Music Accompaniment Generation. ACM Multimedia 2020: 1198-1206 - [c99]Shengyu Zhang, Ziqi Tan, Jin Yu, Zhou Zhao, Kun Kuang, Jie Liu, Jingren Zhou, Hongxia Yang, Fei Wu:
Poet: Product-oriented Video Captioner for E-commerce. ACM Multimedia 2020: 1292-1301 - [c98]Zijian Zhang, Zhou Zhao, Zhu Zhang, Baoxing Huai, Jing Yuan:
Text-Guided Image Inpainting. ACM Multimedia 2020: 4079-4087 - [c97]Zhu Zhang, Zhijie Lin, Zhou Zhao, Jieming Zhu, Xiuqiang He:
Regularized Two-Branch Proposal Networks for Weakly-Supervised Moment Retrieval in Videos. ACM Multimedia 2020: 4098-4106 - [c96]Jinglin Liu, Yi Ren, Zhou Zhao, Chen Zhang, Baoxing Huai, Jing Yuan:
FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire. ACM Multimedia 2020: 4328-4336 - [c95]Shengyu Zhang, Tan Jiang, Tan Wang, Kun Kuang, Zhou Zhao, Jianke Zhu, Jin Yu, Hongxia Yang, Fei Wu:
DeVLBert: Learning Deconfounded Visio-Linguistic Representations. ACM Multimedia 2020: 4373-4382 - [c94]Zhu Zhang, Zhou Zhao, Zhijie Lin, Jieming Zhu, Xiuqiang He:
Counterfactual Contrastive Learning for Weakly-Supervised Vision-Language Grounding. NeurIPS 2020 - [c93]Yingying Zhu, Biao Li, Jiong Wang, Zhou Zhao:
Regional Relation Modeling for Visual Place Recognition. SIGIR 2020: 821-830 - [c92]Yang Sun, Fajie Yuan, Min Yang, Guoao Wei, Zhou Zhao, Duo Liu:
A Generic Network Compression Framework for Sequential Recommender Systems. SIGIR 2020: 1299-1308 - [i48]Boyuan Pan, Yazheng Yang, Zhou Zhao, Yueting Zhuang, Deng Cai:
Bi-Decoder Augmented Network for Neural Machine Translation. CoRR abs/2001.04586 (2020) - [i47]Zhu Zhang, Zhou Zhao, Yang Zhao, Qi Wang, Huasheng Liu, Lianli Gao:
Where Does It Exist: Spatio-Temporal Video Grounding for Multi-Form Sentences. CoRR abs/2001.06891 (2020) - [i46]Shuwen Xiao, Zhou Zhao, Zijian Zhang, Xiaohui Yan, Min Yang:
Convolutional Hierarchical Attention Network for Query-Focused Video Summarization. CoRR abs/2002.03740 (2020) - [i45]Shengyu Zhang, Tan Jiang, Qinghao Huang, Ziqi Tan, Zhou Zhao, Siliang Tang, Jin Yu, Hongxia Yang, Yi Yang, Fei Wu:
Grounded and Controllable Image Completion by Incorporating Lexical Semantics. CoRR abs/2003.00303 (2020) - [i44]Yi Ren, Jinglin Liu, Xu Tan, Sheng Zhao, Zhou Zhao, Tie-Yan Liu:
A Study of Non-autoregressive Model for Sequence Generation. CoRR abs/2004.10454 (2020) - [i43]Yang Sun, Fajie Yuan, Min Yang, Guoao Wei, Zhou Zhao, Duo Liu:
A Generic Network Compression Framework for Sequential Recommender Systems. CoRR abs/2004.13139 (2020) - [i42]Yi Ren, Chenxu Hu, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu:
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech. CoRR abs/2006.04558 (2020) - [i41]Shengyu Zhang, Ziqi Tan, Jin Yu, Zhou Zhao, Kun Kuang, Tan Jiang, Jingren Zhou, Hongxia Yang, Fei Wu:
Comprehensive Information Integration Modeling Framework for Video Titling. CoRR abs/2006.13608 (2020) - [i40]Yi Ren, Xu Tan, Tao Qin, Jian Luan, Zhou Zhao, Tie-Yan Liu:
DeepSinger: Singing Voice Synthesis with Data Mined From the Web. CoRR abs/2007.04590 (2020) - [i39]Jinglin Liu, Yi Ren, Xu Tan, Chen Zhang, Tao Qin, Zhou Zhao, Tie-Yan Liu:
Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation. CoRR abs/2007.08772 (2020) - [i38]Jinglin Liu, Yi Ren, Zhou Zhao, Chen Zhang, Baoxing Huai, Jing Yuan:
FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire. CoRR abs/2008.02516 (2020) - [i37]Shengyu Zhang, Ziqi Tan, Jin Yu, Zhou Zhao, Kun Kuang, Jie Liu, Jingren Zhou, Hongxia Yang, Fei Wu:
Poet: Product-oriented Video Captioner for E-commerce. CoRR abs/2008.06880 (2020) - [i36]Shengyu Zhang, Tan Jiang, Tan Wang, Kun Kuang, Zhou Zhao, Jianke Zhu, Jin Yu, Hongxia Yang, Fei Wu:
DeVLBert: Learning Deconfounded Visio-Linguistic Representations. CoRR abs/2008.06884 (2020) - [i35]Zhu Zhang, Zhou Zhao, Zhijie Lin, Baoxing Huai, Nicholas Jing Yuan:
Object-Aware Multi-Branch Relation Networks for Spatio-Temporal Video Grounding. CoRR abs/2008.06941 (2020) - [i34]Yi Ren, Jinzheng He, Xu Tan, Tao Qin, Zhou Zhao, Tie-Yan Liu:
PopMAG: Pop Music Accompaniment Generation. CoRR abs/2008.07703 (2020) - [i33]Zhu Zhang, Zhijie Lin, Zhou Zhao, Jieming Zhu, Xiuqiang He:
Regularized Two-Branch Proposal Networks for Weakly-Supervised Moment Retrieval in Videos. CoRR abs/2008.08257 (2020) - [i32]Shengyu Zhang, Donghui Wang, Zhou Zhao, Siliang Tang, Di Xie, Fei Wu:
MGD-GAN: Text-to-Pedestrian generation through Multi-Grained Discrimination. CoRR abs/2010.00947 (2020) - [i31]Yujie Lu, Shengyu Zhang, Yingxuan Huang, Luyao Wang, Xinyao Yu, Zhou Zhao, Fei Wu:
Future-Aware Diverse Trends Framework for Recommendation. CoRR abs/2011.00422 (2020)
2010 – 2019
- 2019
- [j37]Zhou Zhao, Ashok Srivastava, Lu Peng, Qing Chen:
Long Short-Term Memory Network Design for Analog Computing. ACM J. Emerg. Technol. Comput. Syst. 15(1): 13:1-13:27 (2019) - [j36]Min Yang, Qingnan Jiang, Ying Shen, Qingyao Wu, Zhou Zhao, Wei Zhou:
Hierarchical human-like strategy for aspect-level sentiment classification with sentiment linguistic knowledge and reinforcement learning. Neural Networks 117: 240-248 (2019) - [j35]Min Yang, Wei Zhao, Lei Chen, Qiang Qu, Zhou Zhao, Ying Shen:
Investigating the transferring capability of capsule networks for text classification. Neural Networks 118: 247-261 (2019) - [j34]Zhou Zhao, Zhu Zhang, Xinghua Jiang, Deng Cai:
Multi-Turn Video Question Answering via Hierarchical Attention Context Reinforced Networks. IEEE Trans. Image Process. 28(8): 3860-3872 (2019) - [j33]Zhou Zhao, Zhu Zhang, Shuwen Xiao, Zhenxin Xiao, Xiaohui Yan, Jun Yu, Deng Cai, Fei Wu:
Long-Form Video Question Answering via Dynamic Hierarchical Reinforced Networks. IEEE Trans. Image Process. 28(12): 5939-5952 (2019) - [j32]Min Yang, Wei Zhao, Wei Xu, Yabing Feng, Zhou Zhao, Xiaojun Chen, Kai Lei:
Multitask Learning for Cross-Domain Image Captioning. IEEE Trans. Multim. 21(4): 1047-1061 (2019) - [j31]Weike Jin, Zhou Zhao, Yimeng Li, Jie Li, Jun Xiao, Yueting Zhuang:
Video Question Answering via Knowledge-based Progressive Spatial-Temporal Attention Network. ACM Trans. Multim. Comput. Commun. Appl. 15(2s): 52:1-52:22 (2019) - [c91]Long Chen, Ziyu Guan, Wei Zhao, Wanqing Zhao, Xiaopeng Wang, Zhou Zhao, Huan Sun:
Answer Identification from Product Reviews for User Questions by Multi-Task Attentive Networks. AAAI 2019: 45-52 - [c90]Min Yang, Qiang Qu, Wenting Tu, Ying Shen, Zhou Zhao, Xiaojun Chen:
Exploring Human-Like Reading Strategy for Abstractive Text Summarization. AAAI 2019: 7362-7369 - [c89]Zhou Yu, Dejing Xu, Jun Yu, Ting Yu, Zhou Zhao, Yueting Zhuang, Dacheng Tao:
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering. AAAI 2019: 9127-9134 - [c88]Zhijie Lin, Kaiyang Lin, Shiling Chen, Linlin Li, Zhou Zhao:
Location-Based End-to-End Speech Recognition with Multiple Language Models. AAAI 2019: 9975-9976 - [c87]Wei Huang, Enhong Chen, Qi Liu, Yuying Chen, Zai Huang, Yang Liu, Zhou Zhao, Dan Zhang, Shijin Wang:
Hierarchical Multi-label Text Classification: An Attention-based Recurrent Network Approach. CIKM 2019: 1051-1060 - [c86]Junyu Luo, Ying Shen, Xiang Ao, Zhou Zhao, Min Yang:
Cross-modal Image-Text Retrieval with Multitask Learning. CIKM 2019: 2309-2312 - [c85]Dejing Xu, Jun Xiao, Zhou Zhao, Jian Shao, Di Xie, Yueting Zhuang:
Self-Supervised Spatiotemporal Learning via Video Clip Order Prediction. CVPR 2019: 10334-10343 - [c84]Weike Jin, Zhou Zhao, Mao Gu, Jun Xiao, Furu Wei, Yueting Zhuang:
Video Dialog via Progressive Inference and Cross-Transformer. EMNLP/IJCNLP (1) 2019: 2109-2118 - [c83]Xu Tan, Yi Ren, Di He, Tao Qin, Zhou Zhao, Tie-Yan Liu:
Multilingual Neural Machine Translation with Knowledge Distillation. ICLR (Poster) 2019 - [c82]Yi Ren, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu:
Almost Unsupervised Text to Speech and Automatic Speech Recognition. ICML 2019: 5410-5419 - [c81]Lianli Gao, Xiaosu Zhu, Jingkuan Song, Zhou Zhao, Heng Tao Shen:
Beyond Product Quantization: Deep Progressive Quantization for Image Retrieval. IJCAI 2019: 723-729 - [c80]Yutong Wang, Jiyuan Zheng, Qijiong Liu, Zhou Zhao, Jun Xiao, Yueting Zhuang:
Weak Supervision Enhanced Generative Network for Question Generation. IJCAI 2019: 3806-3812 - [c79]Zhu Zhang, Zhou Zhao, Zhijie Lin, Jingkuan Song, Xiaofei He:
Open-Ended Long-Form Video Question Answering via Hierarchical Convolutional Self-Attention Networks. IJCAI 2019: 4383-4389 - [c78]Zhu Zhang, Zhou Zhao, Zhijie Lin, Jingkuan Song, Deng Cai:
Localizing Unseen Activities in Video via Image Query. IJCAI 2019: 4390-4396 - [c77]Yao Wan, Jingdong Shu, Yulei Sui, Guandong Xu, Zhou Zhao, Jian Wu, Philip S. Yu:
Multi-modal Attention Network Learning for Semantic Source Code Retrieval. ASE 2019: 13-25 - [c76]Zhou Zhao, Nicolas Boutry, Élodie Puybareau, Thierry Géraud:
A Two-Stage Temporal-Like Fully Convolutional Network Framework for Left Ventricle Segmentation and Quantification on MR Images. STACOM@MICCAI 2019: 405-413 - [c75]Weike Jin, Zhou Zhao, Mao Gu, Jun Yu, Jun Xiao, Yueting Zhuang:
Multi-interaction Network with Object Relation for Video Question Answering. ACM Multimedia 2019: 1193-1201 - [c74]Yinwei Wei, Zhiyong Cheng, Xuzheng Yu, Zhou Zhao, Lei Zhu, Liqiang Nie:
Personalized Hashtag Recommendation for Micro-videos. ACM Multimedia 2019: 1446-1454 - [c73]Yi Ren, Yangjun Ruan, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu:
FastSpeech: Fast, Robust and Controllable Text to Speech. NeurIPS 2019: 3165-3174 - [c72]Weike Jin, Zhou Zhao, Mao Gu, Jun Yu, Jun Xiao, Yueting Zhuang:
Video Dialog via Multi-Grained Convolutional Self-Attention Context Networks. SIGIR 2019: 465-474 - [c71]Zhu Zhang, Zhijie Lin, Zhou Zhao, Zhenxin Xiao:
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos. SIGIR 2019: 655-664 - [c70]Zhou Zhao, Haojie Pan, Changjie Fan, Yan Liu, Linlin Li, Min Yang:
Abstractive Meeting Summarization via Hierarchical Adaptive Segmental Network Learning. WWW 2019: 3455-3461 - [i30]Xu Tan, Yi Ren, Di He, Tao Qin, Zhou Zhao, Tie-Yan Liu:
Multilingual Neural Machine Translation with Knowledge Distillation. CoRR abs/1902.10461 (2019) - [i29]Yi Ren, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu:
Almost Unsupervised Text to Speech and Automatic Speech Recognition. CoRR abs/1905.06791 (2019) - [i28]Yi Ren, Yangjun Ruan, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu:
FastSpeech: Fast, Robust and Controllable Text to Speech. CoRR abs/1905.09263 (2019) - [i27]Zhou Yu, Dejing Xu, Jun Yu, Ting Yu, Zhou Zhao, Yueting Zhuang, Dacheng Tao:
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering. CoRR abs/1906.02467 (2019) - [i26]Zhu Zhang, Zhijie Lin, Zhou Zhao, Zhenxin Xiao:
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos. CoRR abs/1906.02497 (2019) - [i25]Lianli Gao, Xiaosu Zhu, Jingkuan Song, Zhou Zhao, Heng Tao Shen:
Beyond Product Quantization: Deep Progressive Quantization for Image Retrieval. CoRR abs/1906.06698 (2019) - [i24]Zhu Zhang, Zhou Zhao, Zhijie Lin, Jingkuan Song, Xiaofei He:
Open-Ended Long-Form Video Question Answering via Hierarchical Convolutional Self-Attention Networks. CoRR abs/1906.12158 (2019) - [i23]Zhu Zhang, Zhou Zhao, Zhijie Lin, Jingkuan Song, Deng Cai:
Localizing Unseen Activities in Video via Image Query. CoRR abs/1906.12165 (2019) - [i22]Yutong Wang, Jiyuan Zheng, Qijiong Liu, Zhou Zhao, Jun Xiao, Yueting Zhuang:
Weak Supervision Enhanced Generative Network for Question Generation. CoRR abs/1907.00607 (2019) - [i21]Boyuan Pan, Yazheng Yang, Zhou Zhao, Yueting Zhuang, Deng Cai, Xiaofei He:
Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference. CoRR abs/1907.09692 (2019) - [i20]Boyuan Pan, Yazheng Yang, Hao Li, Zhou Zhao, Yueting Zhuang, Deng Cai, Xiaofei He:
MacNet: Transferring Knowledge from Machine Comprehension to Sequence-to-Sequence Models. CoRR abs/1908.01816 (2019) - [i19]Yinwei Wei, Zhiyong Cheng, Xuzheng Yu, Zhou Zhao, Lei Zhu, Liqiang Nie:
Personalized Hashtag Recommendation for Micro-videos. CoRR abs/1908.09987 (2019) - [i18]Hongyang Xue, Wenqing Chu, Zhou Zhao, Deng Cai:
A Better Way to Attend: Attention with Trees for Video Question Answering. CoRR abs/1909.02218 (2019) - [i17]Yao Wan, Jingdong Shu, Yulei Sui, Guandong Xu, Zhou Zhao, Jian Wu, Philip S. Yu:
Multi-Modal Attention Network Learning for Semantic Source Code Retrieval. CoRR abs/1909.13516 (2019) - [i16]Zhijie Lin, Zhou Zhao, Zhu Zhang, Qi Wang, Huasheng Liu:
Weakly-Supervised Video Moment Retrieval via Semantic Completion Network. CoRR abs/1911.08199 (2019) - 2018
- [j30]Anush Bekal, Bharathi Mathyarasa, Manish Goswami, Zhou Zhao, Ashok Srivatsava:
Six-bit, reusable comparator stage-based asynchronous binary-search SAR ADC using smart switching network. IET Circuits Devices Syst. 12(1): 124-131 (2018) - [j29]Zhou Zhao, Ashok Srivastava, Lu Peng, Saraju P. Mohanty:
Calibration method to reduce the error in logarithmic conversion with its circuit implementation. IET Circuits Devices Syst. 12(4): 301-308 (2018) - [j28]Zheqian Chen, Chi Zhang, Zhou Zhao, Chengwei Yao, Deng Cai:
Question retrieval for community-based question answering via heterogeneous social influential network. Neurocomputing 285: 117-124 (2018) - [j27]Wenqing Chu, Hongyang Xue, Zhou Zhao, Deng Cai, Chengwei Yao:
The forgettable-watcher model for video question answering. Neurocomputing 314: 386-393 (2018) - [j26]Yao Wan, Guandong Xu, Liang Chen, Zhou Zhao, Jian Wu:
Exploiting cross-source knowledge for warming up community question answering services. Neurocomputing 320: 25-34 (2018) - [j25]Zhou Zhao, Ashok Srivastava, Lu Peng, Saraju P. Mohanty:
A Multiple Input Floating Gate Based Arithmetic Logic Unit with a Feedback Loop for Digital Calibration. J. Low Power Electron. 14(4): 535-547 (2018) - [j24]Xinyu Duan, Siliang Tang, Shengyu Zhang, Yin Zhang, Zhou Zhao, Jianru Xue, Yueting Zhuang, Fei Wu:
Temporality-enhanced knowledgememory network for factoid question answering. Frontiers Inf. Technol. Electron. Eng. 19(1): 104-115 (2018) - [j23]Min Yang, Wenting Tu, Qiang Qu, Zhou Zhao, Xiaojun Chen, Jia Zhu:
Personalized response generation by Dual-learning based domain adaptation. Neural Networks 103: 72-82 (2018) - [j22]Lu Chen, Panfeng Huang, Zhou Zhao:
Refining object proposals using structured edge and superpixel contrast in robotic grasping. Robotics Auton. Syst. 100: 194-205 (2018) - [j21]Shaoming Chen, Lu Peng, Samuel Irving, Zhou Zhao, Weihua Zhang, Ashok Srivastava:
qSwitch: Dynamical Off-Chip Bandwidth Allocation Between Local and Remote Accesses. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 37(1): 75-87 (2018) - [j20]Hongyang Xue, Wenqing Chu, Zhou Zhao, Deng Cai:
A Better Way to Attend: Attention With Trees for Video Question Answering. IEEE Trans. Image Process. 27(11): 5563-5574 (2018) - [j19]Xiaojun Chen, Yixiang Fang, Min Yang, Feiping Nie, Zhou Zhao, Joshua Zhexue Huang:
PurTreeClust: A Clustering Algorithm for Customer Segmentation from Massive Customer Transaction Data. IEEE Trans. Knowl. Data Eng. 30(3): 559-572 (2018) - [j18]Zhou Zhao, Qifan Yang, Hanqing Lu, Tim Weninger, Deng Cai, Xiaofei He, Yueting Zhuang:
Social-Aware Movie Recommendation via Multimodal Network Learning. IEEE Trans. Multim. 20(2): 430-440 (2018) - [j17]Yao Wan, Liang Chen, Guandong Xu, Zhou Zhao, Jie Tang, Jian Wu:
SCSMiner: mining social coding sites for software developer recommendation with relevance propagation. World Wide Web 21(6): 1523-1543 (2018) - [c69]Zemin Liu, Vincent W. Zheng, Zhou Zhao, Fanwei Zhu, Kevin Chen-Chuan Chang, Minghui Wu, Jing Ying:
Distance-Aware DAG Embedding for Proximity Search on Heterogeneous Graphs. AAAI 2018: 2355-2362 - [c68]Xinyu Duan, Shengyu Zhang, Zhou Zhao, Fei Wu, Yueting Zhuang:
Multi-Label Community-Based Question Classification via Personalized Sequence Memory Network Learning. AAAI 2018: 8071-8072 - [c67]Yibo Jiang, Zhou Zhao:
StackReader: An RNN-Free Reading Comprehension Model. AAAI 2018: 8091-8092 - [c66]Boyuan Pan, Yazheng Yang, Zhou Zhao, Yueting Zhuang, Deng Cai, Xiaofei He:
Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference. ACL (1) 2018: 989-999 - [c65]Yao Wan, Wenqiang Yan, Jianwei Gao, Zhou Zhao, Jian Wu, Philip S. Yu:
Improved Dynamic Memory Network for Dialogue Act Classification with Adversarial Training. IEEE BigData 2018: 841-850 - [c64]Min Yang, Qiang Qu, Jia Zhu, Ying Shen, Zhou Zhao:
Cross-domain Aspect/Sentiment-aware Abstractive Review Summarization. CIKM 2018: 1531-1534 - [c63]Min Yang, Wei Zhao, Jianbo Ye, Zeyang Lei, Zhou Zhao, Soufei Zhang:
Investigating Capsule Networks with Dynamic Routing for Text Classification. EMNLP 2018: 3110-3119 - [c62]Shaoning Xiao, Yimeng Li, Yunan Ye, Zhou Zhao, Jun Xiao, Fei Wu, Jiang Zhu, Yueting Zhuang:
Video question answering via multi-granularity temporal attention network learning. ICIMCS 2018: 46:1-46:5 - [c61]Zhou Yu, Jun Yu, Chenchao Xiang, Zhou Zhao, Qi Tian, Dacheng Tao:
Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding. IJCAI 2018: 1114-1120 - [c60]Wei Zhao, Benyou Wang, Jianbo Ye, Min Yang, Zhou Zhao, Ruotian Luo, Yu Qiao:
A Multi-task Learning Approach for Image Captioning. IJCAI 2018: 1205-1211 - [c59]Zhou Zhao, Lingtao Meng, Jun Xiao, Min Yang, Fei Wu, Deng Cai, Xiaofei He, Yueting Zhuang:
Attentional Image Retweet Modeling via Multi-Faceted Ranking Network Learning. IJCAI 2018: 3184-3190 - [c58]Zhou Zhao, Zhu Zhang, Shuwen Xiao, Zhou Yu, Jun Yu, Deng Cai, Fei Wu, Yueting Zhuang:
Open-Ended Long-form Video Question Answering via Adaptive Hierarchical Reinforced Networks. IJCAI 2018: 3683-3689 - [c57]Zhou Zhao, Xinghua Jiang, Deng Cai, Jun Xiao, Xiaofei He, Shiliang Pu:
Multi-Turn Video Question Answering via Multi-Stream Hierarchical Attention Context Network. IJCAI 2018: 3690-3696 - [c56]Yao Wan, Zhou Zhao, Min Yang, Guandong Xu, Haochao Ying, Jian Wu, Philip S. Yu:
Improving automatic source code summarization via deep reinforcement learning. ASE 2018: 397-407 - [c55]Jianrong Tao, Jiarong Xu, Linxia Gong, Yifu Li, Changjie Fan, Zhou Zhao:
NGUARD: A Game Bot Detection Framework for NetEase MMORPGs. KDD 2018: 811-820 - [c54]Zemin Liu, Vincent W. Zheng, Zhou Zhao, Zhao Li, Hongxia Yang, Minghui Wu, Jing Ying:
Interactive Paths Embedding for Semantic Proximity Search on Heterogeneous Graphs. KDD 2018: 1860-1869 - [c53]Élodie Puybareau, Zhou Zhao, Younes Khoudli, Edwin Carlinet, Yongchao Xu, Jérôme Lacotte, Thierry Géraud:
Left Atrial Segmentation in a Few Seconds Using Fully Convolutional Network and Transfer Learning. STACOM@MICCAI 2018: 339-347 - [c52]Boyuan Pan, Yazheng Yang, Hao Li, Zhou Zhao, Yueting Zhuang, Deng Cai, Xiaofei He:
MacNet: Transferring Knowledge from Machine Comprehension to Sequence-to-Sequence Models. NeurIPS 2018: 6095-6105 - [c51]Min Yang, Qiang Qu, Kai Lei, Jia Zhu, Zhou Zhao, Xiaojun Chen, Joshua Zhexue Huang:
Investigating Deep Reinforcement Learning Techniques in Personalized Dialogue Generation. SDM 2018: 630-638 - [c50]Zheqian Chen, Rongqin Yang, Zhou Zhao, Deng Cai, Xiaofei He:
Dialogue Act Recognition via CRF-Attentive Structured Network. SIGIR 2018: 225-234 - [c49]Zemin Liu, Vincent W. Zheng, Zhou Zhao, Hongxia Yang, Kevin Chen-Chuan Chang, Minghui Wu, Jing Ying:
Subgraph-augmented Path Embedding for Semantic User Search on Heterogeneous Social Network. WWW 2018: 1613-1622 - [i15]Wei Zhao, Jianbo Ye, Min Yang, Zeyang Lei, Soufei Zhang, Zhou Zhao:
Investigating Capsule Networks with Dynamic Routing for Text Classification. CoRR abs/1804.00538 (2018) - [i14]Zhou Yu, Jun Yu, Chenchao Xiang, Zhou Zhao, Qi Tian, Dacheng Tao:
Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding. CoRR abs/1805.03508 (2018) - [i13]Zhou Zhao, Hanbing Zhan, Lingtao Meng, Jun Xiao, Jun Yu, Min Yang, Fei Wu, Deng Cai:
Textually Guided Ranking Network for Attentional Image Retweet Modeling. CoRR abs/1810.10226 (2018) - [i12]Haojie Pan, Junpei Zhou, Zhou Zhao, Yan Liu, Deng Cai, Min Yang:
Dial2Desc: End-to-end Dialogue Description Generation. CoRR abs/1811.00185 (2018) - [i11]Yao Wan, Wenqiang Yan, Jianwei Gao, Zhou Zhao, Jian Wu, Philip S. Yu:
Improved Dynamic Memory Network for Dialogue Act Classification with Adversarial Training. CoRR abs/1811.05021 (2018) - [i10]Yao Wan, Zhou Zhao, Min Yang, Guandong Xu, Haochao Ying, Jian Wu, Philip S. Yu:
Improving Automatic Source Code Summarization via Deep Reinforcement Learning. CoRR abs/1811.07234 (2018) - 2017
- [j16]Zhou Zhao, Ashok Srivastava, Lu Peng, Shaoming Chen, Saraju P. Mohanty:
A novel switchable pin method for regulating power in chip-multiprocessor. Integr. 58: 329-338 (2017) - [j15]Zhou Zhao, Panfeng Huang, Zhenyu Lu, Zhengxiong Liu:
Augmented reality for enhancing tele-robotic system with force feedback. Robotics Auton. Syst. 96: 93-101 (2017) - [j14]Hongyang Xue, Zhou Zhao, Deng Cai:
Unifying the Video and Question Attentions for Open-Ended Video Question Answering. IEEE Trans. Image Process. 26(12): 5656-5666 (2017) - [j13]Fei Wu, Xinyu Duan, Jun Xiao, Zhou Zhao, Siliang Tang, Yin Zhang, Yueting Zhuang:
Temporal Interaction and Causal Influence in Community-Based Question Answering. IEEE Trans. Knowl. Data Eng. 29(10): 2304-2317 (2017) - [c48]Zemin Liu, Vincent W. Zheng, Zhou Zhao, Fanwei Zhu, Kevin Chen-Chuan Chang, Minghui Wu, Jing Ying:
Semantic Proximity Search on Heterogeneous Graph by Proximity Embedding. AAAI 2017: 154-160 - [c47]Zhou Zhao, Hanqing Lu, Vincent W. Zheng, Deng Cai, Xiaofei He, Yueting Zhuang:
Community-Based Question Answering via Asymmetric Multi-Faceted Ranking Network Learning. AAAI 2017: 3532-3539 - [c46]Wei Zhao, Wei Xu, Min Yang, Jianbo Ye, Zhou Zhao, Yabing Feng, Yu Qiao:
Dual Learning for Cross-domain Image Captioning. CIKM 2017: 29-38 - [c45]Yutong Wang, Yixin Xu, Min Yang, Zhou Zhao, Jun Xiao, Yueting Zhuang:
Integrating Side Information for Boosting Machine Comprehension. CIKM 2017: 2355-2358 - [c44]Zhou Zhao, Yicong Zhou:
An Image Contrast Enhancement Algorithm Using PLIP-Based Histogram Modification. CYBCONF 2017: 1-4 - [c43]Min Yang, Jincheng Mei, Heng Ji, Wei Zhao, Zhou Zhao, Xiaojun Chen:
Identifying and Tracking Sentiments and Topics from Social Media Texts during Natural Disasters. EMNLP 2017: 527-533 - [c42]Zhou Zhao, Qifan Yang, Deng Cai, Xiaofei He, Yueting Zhuang:
Video Question Answering via Hierarchical Spatio-Temporal Attention Networks. IJCAI 2017: 3518-3524 - [c41]Zhou Zhao, Ben Gao, Vincent W. Zheng, Deng Cai, Xiaofei He, Yueting Zhuang:
Link Prediction via Ranking Metric Dual-Level Attention Network Learning. IJCAI 2017: 3525-3531 - [c40]Zhou Zhao, Hanqing Lu, Deng Cai, Xiaofei He, Yueting Zhuang:
Microblog Sentiment Classification via Recurrent Random Walk Network Learning. IJCAI 2017: 3532-3538 - [c39]Zhou Zhao, Xinlu Chen, Ashok Srivastava, Lu Peng, Saraju P. Mohanty:
Compact Modeling of Graphene Barristor for Digital Integrated Circuit Design. ISVLSI 2017: 562-567 - [c38]Zhou Zhao, Jinghao Lin, Xinghua Jiang, Deng Cai, Xiaofei He, Yueting Zhuang:
Video Question Answering via Hierarchical Dual-Level Attention Network Learning. ACM Multimedia 2017: 1050-1058 - [c37]Xinyang Jiang, Siliang Tang, Yang Yang, Zhou Zhao, Yin Zhang, Fei Wu, Yueting Zhuang:
Detecting Temporal Proposal for Action Localization with Tree-structured Search Policy. ACM Multimedia 2017: 1069-1077 - [c36]Dejing Xu, Zhou Zhao, Jun Xiao, Fei Wu, Hanwang Zhang, Xiangnan He, Yueting Zhuang:
Video Question Answering via Gradually Refined Attention over Appearance and Motion. ACM Multimedia 2017: 1645-1653 - [c35]Lu Chen, Panfeng Huang, Zhou Zhao:
Saliency based proposal refinement in robotic vision. RCAR 2017: 85-90 - [c34]Zhou Zhao, Panfeng Huang, Lu Chen:
Visual tracking and grasping of moving objects and its application to an industrial robot. RCAR 2017: 555-560 - [c33]Yunan Ye, Zhou Zhao, Yimeng Li, Long Chen, Jun Xiao, Yueting Zhuang:
Video Question Answering via Attribute-Augmented Attention Network Learning. SIGIR 2017: 829-832 - [c32]Zhou Zhao, Qifan Yang, Hanqing Lu, Min Yang, Jun Xiao, Fei Wu, Yueting Zhuang:
Learning Max-Margin GeoSocial Multimedia Network Representations for Point-of-Interest Suggestion. SIGIR 2017: 833-836 - [c31]Min Yang, Zhou Zhao, Wei Zhao, Xiaojun Chen, Jia Zhu, Lianqiang Zhou, Zigang Cao:
Personalized Response Generation via Domain adaptation. SIGIR 2017: 1021-1024 - [c30]Zheqian Chen, Ben Gao, Huimin Zhang, Zhou Zhao, Haifeng Liu, Deng Cai:
User Personalized Satisfaction Prediction via Multiple Instance Deep Learning. WWW 2017: 907-915 - [i9]Hongyang Xue, Zhou Zhao, Deng Cai:
The Forgettable-Watcher Model for Video Question Answering. CoRR abs/1705.01253 (2017) - [i8]Yunan Ye, Zhou Zhao, Yimeng Li, Long Chen, Jun Xiao, Yueting Zhuang:
Video Question Answering via Attribute-Augmented Attention Network Learning. CoRR abs/1707.06355 (2017) - [i7]Boyuan Pan, Hao Li, Zhou Zhao, Bin Cao, Deng Cai, Xiaofei He:
MEMEN: Multi-layer Embedding with Memory Networks for Machine Comprehension. CoRR abs/1707.09098 (2017) - [i6]Zheqian Chen, Rongqin Yang, Bin Cao, Zhou Zhao, Deng Cai, Xiaofei He:
Smarnet: Teaching Machines to Read and Comprehend Like Human. CoRR abs/1710.02772 (2017) - [i5]Boyuan Pan, Hao Li, Zhou Zhao, Deng Cai, Xiaofei He:
Keyword-based Query Comprehending via Multiple Optimized-Demand Augmentation. CoRR abs/1711.00179 (2017) - [i4]Zheqian Chen, Rongqin Yang, Zhou Zhao, Deng Cai, Xiaofei He:
Dialogue Act Recognition via CRF-Attentive Structured Network. CoRR abs/1711.05568 (2017) - [i3]Wei Zhao, Benyou Wang, Jianbo Ye, Yongqiang Gao, Min Yang, Zhou Zhao, Xiaojun Chen:
Leveraging Long and Short-term Information in Content-aware Movie Recommendation. CoRR abs/1712.09059 (2017) - 2016
- [j12]Hanqing Lu, Chaochao Chen, Ming Kong, Hanyi Zhang, Zhou Zhao:
Social recommendation via multi-view user preference learning. Neurocomputing 216: 61-71 (2016) - [j11]Xinyu Wang, Zhou Zhao, Wilfred Ng:
USTF: A Unified System of Team Formation. IEEE Trans. Big Data 2(1): 70-84 (2016) - [j10]Zhou Zhao, Xiaofei He, Deng Cai, Lijun Zhang, Wilfred Ng, Yueting Zhuang:
Graph Regularized Feature Selection with Data Reconstruction. IEEE Trans. Knowl. Data Eng. 28(3): 689-700 (2016) - [j9]Zhou Zhao, Hanqing Lu, Deng Cai, Xiaofei He, Yueting Zhuang:
User Preference Learning for Online Social Recommendation. IEEE Trans. Knowl. Data Eng. 28(9): 2522-2534 (2016) - [c29]Hanyin Fang, Fei Wu, Zhou Zhao, Xinyu Duan, Yueting Zhuang, Martin Ester:
Community-Based Question Answering via Heterogeneous Social Network Learning. AAAI 2016: 122-128 - [c28]Weikeng Chen, Zhou Zhao, Xinyu Wang, Wilfred Ng:
Crowdsourced Query Processing on Microblogs. DASFAA (1) 2016: 18-32 - [c27]Zhou Zhao, Yicong Zhou:
PLIP based unsharp masking for medical image enhancement. ICASSP 2016: 1238-1242 - [c26]Zhou Zhao, Qifan Yang, Deng Cai, Xiaofei He, Yueting Zhuang:
Expert Finding for Community-Based Question Answering via Ranking Metric Network Learning. IJCAI 2016: 3000-3006 - [c25]Md. Fahad, Zhou Zhao, Ashok Srivastava, Lu Peng:
Modeling of Graphene Nanoribbon Tunnel Field Effect Transistor in Verilog-A for Digital Circuit Design. iNIS 2016: 1-5 - [c24]Zhou Zhao, Ashok Srivastava, Lu Peng, Saraju P. Mohanty:
A Low-Cost Mixed Clock Generator for High Speed Adiabatic Logic. ISVLSI 2016: 587-590 - [c23]Zhou Zhao, Hanqing Lu, Deng Cai, Xiaofei He, Yueting Zhuang:
Partial Multi-Modal Sparse Coding via Adaptive Similarity Structure Regularization. ACM Multimedia 2016: 152-156 - [c22]Zhou Zhao, Yicong Zhou:
Comparative study of logarithmic image processing models for medical image enhancement. SMC 2016: 1046-1050 - [i2]Zheqian Chen, Ben Gao, Huimin Zhang, Zhou Zhao, Deng Cai:
User Personalized Satisfaction Prediction via Multiple Instance Deep Learning. CoRR abs/1611.08096 (2016) - [i1]Zheqian Chen, Chi Zhang, Zhou Zhao, Deng Cai:
Question Retrieval for Community-based Question Answering via Heterogeneous Network Integration Learning. CoRR abs/1611.08135 (2016) - 2015
- [j8]Da Yan, Zhou Zhao, Wilfred Ng:
Efficient processing of optimal meeting point queries in Euclidean space and road networks. Knowl. Inf. Syst. 42(2): 319-351 (2015) - [j7]Da Yan, James Cheng, Zhou Zhao, Wilfred Ng:
Efficient location-based search of trajectories with location importance. Knowl. Inf. Syst. 45(1): 215-245 (2015) - [j6]Shaoming Chen, Lu Peng, Yue Hu, Zhou Zhao, Ashok Srivastava, Ying Zhang, Jin-Woo Choi, Bin Li, Edward Song:
Powering Up Dark Silicon: Mitigating the Limitation of Power Delivery via Dynamic Pin Switching. IEEE Trans. Emerg. Top. Comput. 3(4): 489-501 (2015) - [j5]Da Yan, Zhou Zhao, Wilfred Ng, Steven Liu:
Probabilistic Convex Hull Queries over Uncertain Data. IEEE Trans. Knowl. Data Eng. 27(3): 852-865 (2015) - [j4]Zhou Zhao, Lijun Zhang, Xiaofei He, Wilfred Ng:
Expert Finding for Question Answering via Graph Regularized Matrix Completion. IEEE Trans. Knowl. Data Eng. 27(4): 993-1004 (2015) - [c21]Zhou Zhao, Furu Wei, Ming Zhou, Wilfred Ng:
Cold-Start Expert Finding in Community Question Answering via Graph Regularization. DASFAA (1) 2015: 21-38 - [c20]Xinyu Wang, Zhou Zhao, Wilfred Ng:
A Comparative Study of Team Formation in Social Networks. DASFAA (1) 2015: 389-404 - [c19]Zhou Zhao, Furu Wei, Ming Zhou, Weikeng Chen, Wilfred Ng:
Crowd-Selection Query Processing in Crowdsourcing Databases: A Task-Driven Approach. EDBT 2015: 397-408 - [c18]Juan Li, Xiao Liu, Zhou Zhao, Jin Liu:
Energy Consumption Prediction Based on Time-Series Models for CPU-Intensive Activities in the Cloud. ICA3PP (4) 2015: 756-769 - [c17]Zhou Zhao, Ruihua Song, Xing Xie, Xiaofei He, Yueting Zhuang:
Mobile Query Recommendation via Tensor Function Learning. IJCAI 2015: 4084-4090 - [c16]Zhou Zhao, Ashok Srivastava, Lu Peng, Shaoming Chen, Saraju P. Mohanty:
Circuit Implementation of Switchable Pins in Chip Multiprocessor. iNIS 2015: 89-94 - [c15]Zhou Zhao, Ashok Srivastava, Shaoming Chen, Saraju P. Mohanty:
An Algorithm Used in a Power Monitor to Mitigate Dark Silicon on VLSI Chip. ISVLSI 2015: 191-194 - [c14]Xinyang Jiang, Fei Wu, Xi Li, Zhou Zhao, Weiming Lu, Siliang Tang, Yueting Zhuang:
Deep Compositional Cross-modal Learning to Rank via Local-Global Alignment. ACM Multimedia 2015: 69-78 - 2014
- [j3]Zhou Zhao, Da Yan, Wilfred Ng:
Mining Probabilistically Frequent Sequential Patterns in Large Uncertain Databases. IEEE Trans. Knowl. Data Eng. 26(5): 1171-1184 (2014) - [c13]Zhou Zhao, James Cheng, Furu Wei, Ming Zhou, Wilfred Ng, Yingjun Wu:
SocialTransfer: Transferring Social Knowledge for Cold-Start Cowdsourcing. CIKM 2014: 779-788 - [c12]Zhou Zhao, James Cheng, Wilfred Ng:
Truth Discovery in Data Streams: A Single-Pass Probabilistic Approach. CIKM 2014: 1589-1598 - [c11]Zhou Zhao, Futian Wang, Xiaoliang Fan, Xiao Liu:
Temporal Verification for Business Cloud Workflows: Open Research Issues. SKG 2014: 33-40 - 2013
- [c10]Zhou Zhao, Wilfred Ng, Zhijun Zhang:
CrowdSeed: query processing on microblogs. EDBT 2013: 729-732 - [c9]Zhou Zhao, Da Yan, Wilfred Ng, Shi Gao:
A transfer learning based framework of crowd-selection on twitter. KDD 2013: 1514-1517 - 2012
- [c8]Da Yan, Zhou Zhao, Wilfred Ng:
Leveraging read rates of passive RFID tags for real-time indoor location tracking. CIKM 2012: 375-384 - [c7]Zhou Zhao, Wilfred Ng:
A model-based approach for RFID data stream cleansing. CIKM 2012: 862-871 - [c6]Da Yan, Zhou Zhao, Wilfred Ng:
Monochromatic and bichromatic reverse nearest neighbor queries on land surfaces. CIKM 2012: 942-951 - [c5]Zhou Zhao, Da Yan, Wilfred Ng:
Mining probabilistically frequent sequential patterns in uncertain databases. EDBT 2012: 74-85 - [c4]Zhou Zhao, Da Yan, Wilfred Ng:
A probabilistic convex hull query tool. EDBT 2012: 570-573 - 2011
- [j2]Da Yan, Zhou Zhao, Wilfred Ng:
Efficient Algorithms for Finding Optimal Meeting Point on Road Networks. Proc. VLDB Endow. 4(11): 968-979 (2011) - [c3]Qiang Wang, Huoming Zhang, Juan Jiang, Wenjun Gao, Zhou Zhao:
Study on the application of improved simulated annealing algorithm for several types of optimization problems. ICNC 2011: 1574-1577 - 2010
- [j1]Shirui Wang, Zhou Zhao, Chao You:
0.18 μm CMOS integrated circuit design for impedance-based structural health monitoring. IET Circuits Devices Syst. 4(3): 227-238 (2010)
2000 – 2009
- 2009
- [c2]Zhou Zhao, En-ke Hou, Zhihua Zhang, Nian-dong Deng:
Three Dimensional Geological Modeling from Component-based Topological Data Model. ICCMS 2009: 43-46 - [c1]Zhihua Zhang, En-ke Hou, Zhou Zhao, Nian-dong Deng, Wen Pang:
An Improved Symmetrical Modeling Method on 3D Tunnel Modeling. ICCMS 2009: 251-256
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-17 21:53 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint