default search action

combined dblp search
author search
venue search
publication search

ask others

Hieu-Thi Luong

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-17376
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-17376
Duc-Tuan Truong, Ruijie Tao, Tuan Nguyen, Hieu-Thi Luong, Kong Aik Lee, Eng Siong Chng:
Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection. CoRR abs/2406.17376 (2024)
2023
[c10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuongY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuongY23
Hieu-Thi Luong, Junichi Yamagishi:
Controlling Multi-Class Human Vocalization Generation via a Simple Segment-based Labeling Scheme. INTERSPEECH 2023: 4379-4383
2021
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/LuongY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/LuongY21
Hieu-Thi Luong, Junichi Yamagishi:
Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance. SSW 2021: 136-141
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-13479
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-13479
Hieu-Thi Luong, Junichi Yamagishi:
Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance. CoRR abs/2106.13479 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04946
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04946
Hieu-Thi Luong, Junichi Yamagishi:
LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example. CoRR abs/2110.04946 (2021)
2020
[b1]
- view
- export record
  dblp key:
  - phd/jp/Hieu-Thi20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/jp/Hieu-Thi20
Hieu-Thi Luong:
Deep learning based voice cloning framework for a unified system of text-to-speech and voice conversion. Graduate University for Advanced Studies, Japan, 2020
[j2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/LuongY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LuongY20
Hieu-Thi Luong, Junichi Yamagishi:
NAUTILUS: A Versatile Voice Cloning System. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2967-2981 (2020)
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/LuongY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/LuongY20
Hieu-Thi Luong, Junichi Yamagishi:
Latent linguistic embedding for cross-lingual text-to-speech and voice conversion. Blizzard Challenge / Voice Conversion Challenge 2020
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-11004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-11004
Hieu-Thi Luong, Junichi Yamagishi:
NAUTILUS: a Versatile Voice Cloning System. CoRR abs/2005.11004 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-03717
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-03717
Hieu-Thi Luong, Junichi Yamagishi:
Latent linguistic embedding for cross-lingual text-to-speech and voice conversion. CoRR abs/2010.03717 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LuongY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LuongY19
Hieu-Thi Luong, Junichi Yamagishi:
Bootstrapping Non-Parallel Voice Conversion from Speaker-Adaptive Text-to-Speech. ASRU 2019: 200-207
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuongWYN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuongWYN19
Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa:
Training Multi-Speaker Neural Text-to-Speech Systems Using Speaker-Imbalanced Speech Corpora. INTERSPEECH 2019: 1303-1307
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-00771
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-00771
Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa:
Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora. CoRR abs/1904.00771 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-07414
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-07414
Hieu-Thi Luong, Junichi Yamagishi:
A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation. CoRR abs/1906.07414 (2019)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-06532
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-06532
Hieu-Thi Luong, Junichi Yamagishi:
Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech. CoRR abs/1909.06532 (2019)
2018
[j1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/ZhaoTLYSM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/ZhaoTLYSM18
Yi Zhao, Shinji Takaki, Hieu-Thi Luong, Junichi Yamagishi, Daisuke Saito, Nobuaki Minematsu:
Wasserstein GAN and Waveform Loss-Based Acoustic Model Training for Multi-Speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder. IEEE Access 6: 60478-60488 (2018)
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Luong0YN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Luong0YN18
Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa:
Investigating Accuracy of Pitch-accent Annotations in Neural Network-based Speech Synthesis and Denoising Effects. INTERSPEECH 2018: 37-41
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuongY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuongY18
Hieu-Thi Luong, Junichi Yamagishi:
Multimodal Speech Synthesis Architecture for Unsupervised Speaker Adaptation. INTERSPEECH 2018: 2494-2498
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LuongY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LuongY18
Hieu-Thi Luong, Junichi Yamagishi:
Scaling and Bias Codes for Modeling Speaker-Adaptive DNN-Based Speech Synthesis Systems. SLT 2018: 610-617
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1807-11632
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-11632
Hieu-Thi Luong, Junichi Yamagishi:
Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems. CoRR abs/1807.11632 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1807-11679
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-11679
Yi Zhao, Shinji Takaki, Hieu-Thi Luong, Junichi Yamagishi, Daisuke Saito, Nobuaki Minematsu:
Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder. CoRR abs/1807.11679 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1808-00665
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-00665
Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa:
Investigating accuracy of pitch-accent annotations in neural network-based speech synthesis and denoising effects. CoRR abs/1808.00665 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1808-06288
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-06288
Hieu-Thi Luong, Junichi Yamagishi:
Multimodal speech synthesis architecture for unsupervised speaker adaptation. CoRR abs/1808.06288 (2018)
2017
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuongTHY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuongTHY17
Hieu-Thi Luong, Shinji Takaki, Gustav Eje Henter, Junichi Yamagishi:
Adapting and controlling DNN-based speech synthesis using input codes. ICASSP 2017: 4905-4909
2016
[c1]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/wlsi/LuongV16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wlsi/LuongV16
Hieu-Thi Luong, Hai-Quan Vu:
A non-expert Kaldi recipe for Vietnamese Speech Recognition System. WLSI/OIAF4HLT@COLING 2016: 51-55

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.