default search action

combined dblp search
author search
venue search
publication search

ask others

Zhiyuan Zhao 0001

> Home > Persons

Person information

affiliation: Microsoft Research Asia, Beijing, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WengFWDWYZQBYLZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WengFWDWYZQBYLZ22
Wenming Weng, Ruoyu Feng, Yanhui Wang, Qi Dai, Chunyu Wang, Dacheng Yin, Zhiyuan Zhao, Kai Qiu, Jianmin Bao, Yuhui Yuan, Chong Luo, Yueyi Zhang, Zhiwei Xiong:
ART•V: Auto-Regressive Text-to-Video Generation with Diffusion Models. CVPR Workshops 2024: 7395-7405
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WangBWFYYZDZWQY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WangBWFYYZDZWQY24
Yanhui Wang, Jianmin Bao, Wenming Weng, Ruoyu Feng, Dacheng Yin, Tao Yang, Jingxu Zhang, Qi Dai, Zhiyuan Zhao, Chunyu Wang, Kai Qiu, Yuhui Yuan, Xiaoyan Sun, Chong Luo, Baining Guo:
MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation. CVPR 2024: 8414-8424
2023
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoWTYZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoWTYZL23
Zhiyuan Zhao, Lijun Wu, Chuanxin Tang, Dacheng Yin, Yucheng Zhao, Chong Luo:
Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss. ICASSP 2023: 1-5
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YinZTXL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YinZTXL23
Dacheng Yin, Zhiyuan Zhao, Chuanxin Tang, Zhiwei Xiong, Chong Luo:
TridentSE: Guiding Speech Enhancement with 32 Global Tokens. INTERSPEECH 2023: 3839-3843
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-05922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-05922
Zhiyuan Zhao, Lijun Wu, Chuanxin Tang, Dacheng Yin, Yucheng Zhao, Chong Luo:
Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss. CoRR abs/2304.05922 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-18829
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-18829
Yanhui Wang, Jianmin Bao, Wenming Weng, Ruoyu Feng, Dacheng Yin, Tao Yang, Jingxu Zhang, Qi Dai, Zhiyuan Zhao, Chunyu Wang, Kai Qiu, Yuhui Yuan, Xiaoyan Sun, Chong Luo, Baining Guo:
MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation. CoRR abs/2311.18829 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-18834
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-18834
Wenming Weng, Ruoyu Feng, Yanhui Wang, Qi Dai, Chunyu Wang, Dacheng Yin, Zhiyuan Zhao, Kai Qiu, Jianmin Bao, Yuhui Yuan, Chong Luo, Yueyi Zhang, Zhiwei Xiong:
ART·V: Auto-Regressive Text-to-Video Generation with Diffusion Models. CoRR abs/2311.18834 (2023)
2022
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YinTLWZZXZL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YinTLWZZXZL22
Dacheng Yin, Chuanxin Tang, Yanqing Liu, Xiaoqiang Wang, Zhiyuan Zhao, Yucheng Zhao, Zhiwei Xiong, Sheng Zhao, Chong Luo:
RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion. INTERSPEECH 2022: 1571-1575
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoTYL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoTYL22
Zhiyuan Zhao, Chuanxin Tang, Chengdong Yao, Chong Luo:
An Anchor-Free Detector for Continuous Speech Keyword Spotting. INTERSPEECH 2022: 3228-3232
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-13865
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-13865
Dacheng Yin, Chuanxin Tang, Yanqing Liu, Xiaoqiang Wang, Zhiyuan Zhao, Yucheng Zhao, Zhiwei Xiong, Sheng Zhao, Chong Luo:
RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion. CoRR abs/2206.13865 (2022)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-04622
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-04622
Zhiyuan Zhao, Chuanxin Tang, Chengdong Yao, Chong Luo:
An Anchor-Free Detector for Continuous Speech Keyword Spotting. CoRR abs/2208.04622 (2022)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-12995
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-12995
Dacheng Yin, Zhiyuan Zhao, Chuanxin Tang, Zhiwei Xiong, Chong Luo:
TridentSE: Guiding Speech Enhancement with 32 Global Tokens. CoRR abs/2210.12995 (2022)
2021
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TangLZYZZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TangLZYZZ21
Chuanxin Tang, Chong Luo, Zhiyuan Zhao, Dacheng Yin, Yucheng Zhao, Wenjun Zeng:
Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration. Interspeech 2021: 3600-3604
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-01930
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-01930
Yucheng Zhao, Dacheng Yin, Chong Luo, Zhiyuan Zhao, Chuanxin Tang, Wenjun Zeng, Zheng-Jun Zha:
General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework. CoRR abs/2102.01930 (2021)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-05426
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-05426
Chuanxin Tang, Chong Luo, Zhiyuan Zhao, Dacheng Yin, Yucheng Zhao, Wenjun Zeng:
Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration. CoRR abs/2109.05426 (2021)
2020
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/TangLZXZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/TangLZXZ20
Chuanxin Tang, Chong Luo, Zhiyuan Zhao, Wenxuan Xie, Wenjun Zeng:
Joint Time-Frequency and Time Domain Learning for Speech Enhancement. IJCAI 2020: 3816-3822

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.