default search action
Yuhta Takida
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j3]Yuhta Takida, Yukara Ikemiya, Takashi Shibuya, Kazuki Shimada, Woosung Choi, Chieh-Hsin Lai, Naoki Murata, Toshimitsu Uesaka, Kengo Uchida, Wei-Hsiang Liao, Yuki Mitsufuji:
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes. Trans. Mach. Learn. Res. 2024 (2024) - [c16]Mengjie Zhao, Junya Ono, Zhi Zhong, Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Wei-Hsiang Liao, Takashi Shibuya, Hiromi Wakaki, Yuki Mitsufuji:
On the Language Encoder of Contrastive Cross-modal Models. ACL (Findings) 2024: 4923-4940 - [c15]Takashi Shibuya, Yuhta Takida, Yuki Mitsufuji:
BIGVSAN: Enhancing Gan-Based Neural Vocoders with Slicing Adversarial Network. ICASSP 2024: 10121-10125 - [c14]Yutong He, Naoki Murata, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Dongjun Kim, Wei-Hsiang Liao, Yuki Mitsufuji, J. Zico Kolter, Ruslan Salakhutdinov, Stefano Ermon:
Manifold Preserving Guided Diffusion. ICLR 2024 - [c13]Dongjun Kim, Chieh-Hsin Lai, Wei-Hsiang Liao, Naoki Murata, Yuhta Takida, Toshimitsu Uesaka, Yutong He, Yuki Mitsufuji, Stefano Ermon:
Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion. ICLR 2024 - [c12]Yuhta Takida, Masaaki Imaizumi, Takashi Shibuya, Chieh-Hsin Lai, Toshimitsu Uesaka, Naoki Murata, Yuki Mitsufuji:
SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer. ICLR 2024 - [i25]Yuhta Takida, Yukara Ikemiya, Takashi Shibuya, Kazuki Shimada, Woosung Choi, Chieh-Hsin Lai, Naoki Murata, Toshimitsu Uesaka, Kengo Uchida, Wei-Hsiang Liao, Yuki Mitsufuji:
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes. CoRR abs/2401.00365 (2024) - [i24]Toshimitsu Uesaka, Taiji Suzuki, Yuhta Takida, Chieh-Hsin Lai, Naoki Murata, Yuki Mitsufuji:
Understanding Multimodal Contrastive Learning Through Pointwise Mutual Information. CoRR abs/2404.19228 (2024) - [i23]Dongjun Kim, Chieh-Hsin Lai, Wei-Hsiang Liao, Yuhta Takida, Naoki Murata, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon:
PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher. CoRR abs/2405.14822 (2024) - [i22]Koichi Saito, Dongjun Kim, Takashi Shibuya, Chieh-Hsin Lai, Zhi Zhong, Yuhta Takida, Yuki Mitsufuji:
SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation. CoRR abs/2405.18503 (2024) - [i21]Kengo Uchida, Takashi Shibuya, Yuhta Takida, Naoki Murata, Shusuke Takahashi, Yuki Mitsufuji:
MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training. CoRR abs/2406.01867 (2024) - [i20]Yin-Jyun Luo, Kin Wai Cheuk, Woosung Choi, Toshimitsu Uesaka, Keisuke Toyama, Koichi Saito, Chieh-Hsin Lai, Yuhta Takida, Wei-Hsiang Liao, Simon Dixon, Yuki Mitsufuji:
DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation. CoRR abs/2408.10807 (2024) - [i19]Yunkee Chae, Woosung Choi, Yuhta Takida, Junghyun Koo, Yukara Ikemiya, Zhi Zhong, Kin Wai Cheuk, Marco A. Martínez Ramírez, Kyogu Lee, Wei-Hsiang Liao, Yuki Mitsufuji:
VRVQ: Variable Bitrate Residual Vector Quantization for Audio Compression. CoRR abs/2410.06016 (2024) - [i18]Yong-Hyun Park, Chieh-Hsin Lai, Satoshi Hayakawa, Yuhta Takida, Yuki Mitsufuji:
Jump Your Steps: Optimizing Sampling Schedule of Discrete Diffusion Models. CoRR abs/2410.07761 (2024) - [i17]Satoshi Hayakawa, Yuhta Takida, Masaaki Imaizumi, Hiromi Wakaki, Yuki Mitsufuji:
Distillation of Discrete Diffusion through Dimensional Correlations. CoRR abs/2410.08709 (2024) - [i16]Naoki Murata, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Bac Nguyen, Stefano Ermon, Yuki Mitsufuji:
G2D2: Gradient-guided Discrete Diffusion for image inverse problem solving. CoRR abs/2410.14710 (2024) - [i15]Bac Nguyen, Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Toshimitsu Uesaka, Stefano Ermon, Yuki Mitsufuji:
Mitigating Embedding Collapse in Diffusion Models for Categorical Data. CoRR abs/2410.14758 (2024) - [i14]Wei-Hsiang Liao, Yuhta Takida, Yukara Ikemiya, Zhi Zhong, Chieh-Hsin Lai, Giorgio Fabbro, Kazuki Shimada, Keisuke Toyama, Kin Wai Cheuk, Marco A. Martínez Ramírez, Shusuke Takahashi, Stefan Uhlich, Taketo Akama, Woosung Choi, Yuichiro Koyama, Yuki Mitsufuji:
Music Foundation Model as Generic Booster for Music Downstream Tasks. CoRR abs/2411.01135 (2024) - 2023
- [c11]Koichi Saito, Naoki Murata, Toshimitsu Uesaka, Chieh-Hsin Lai, Yuhta Takida, Takao Fukui, Yuki Mitsufuji:
Unsupervised Vocal Dereverberation with Diffusion-Based Generative Models. ICASSP 2023: 1-5 - [c10]Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon:
FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation. ICML 2023: 18365-18398 - [c9]Naoki Murata, Koichi Saito, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon:
GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration. ICML 2023: 25501-25522 - [c8]Ryosuke Sawata, Naoki Murata, Yuhta Takida, Toshimitsu Uesaka, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji:
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement. INTERSPEECH 2023: 3824-3828 - [c7]Keisuke Toyama, Taketo Akama, Yukara Ikemiya, Yuhta Takida, Wei-Hsiang Liao, Yuki Mitsufuji:
Automatic Piano Transcription With Hierarchical Frequency-Time Transformer. ISMIR 2023: 215-222 - [i13]Naoki Murata, Koichi Saito, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon:
GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration. CoRR abs/2301.12686 (2023) - [i12]Yuhta Takida, Masaaki Imaizumi, Chieh-Hsin Lai, Toshimitsu Uesaka, Naoki Murata, Yuki Mitsufuji:
Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport. CoRR abs/2301.12811 (2023) - [i11]Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Naoki Murata, Yuki Mitsufuji, Stefano Ermon:
On the Equivalence of Consistency-Type Models: Consistency Models, Consistent Diffusion Models, and Fokker-Planck Regularization. CoRR abs/2306.00367 (2023) - [i10]Keisuke Toyama, Taketo Akama, Yukara Ikemiya, Yuhta Takida, Wei-Hsiang Liao, Yuki Mitsufuji:
Automatic Piano Transcription with Hierarchical Frequency-Time Transformer. CoRR abs/2307.04305 (2023) - [i9]Takashi Shibuya, Yuhta Takida, Yuki Mitsufuji:
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network. CoRR abs/2309.02836 (2023) - [i8]Dongjun Kim, Chieh-Hsin Lai, Wei-Hsiang Liao, Naoki Murata, Yuhta Takida, Toshimitsu Uesaka, Yutong He, Yuki Mitsufuji, Stefano Ermon:
Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion. CoRR abs/2310.02279 (2023) - [i7]Mengjie Zhao, Junya Ono, Zhi Zhong, Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Wei-Hsiang Liao, Takashi Shibuya, Hiromi Wakaki, Yuki Mitsufuji:
On the Language Encoder of Contrastive Cross-modal Models. CoRR abs/2310.13267 (2023) - [i6]Yutong He, Naoki Murata, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Dongjun Kim, Wei-Hsiang Liao, Yuki Mitsufuji, J. Zico Kolter, Ruslan Salakhutdinov, Stefano Ermon:
Manifold Preserving Guided Diffusion. CoRR abs/2311.16424 (2023) - 2022
- [j2]Yuhta Takida, Wei-Hsiang Liao, Chieh-Hsin Lai, Toshimitsu Uesaka, Shusuke Takahashi, Yuki Mitsufuji:
Preventing oversmoothing in VAE via generalized variance parameterization. Neurocomputing 509: 137-156 (2022) - [c6]Yuhta Takida, Takashi Shibuya, Wei-Hsiang Liao, Chieh-Hsin Lai, Junki Ohmura, Toshimitsu Uesaka, Naoki Murata, Shusuke Takahashi, Toshiyuki Kumakura, Yuki Mitsufuji:
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization. ICML 2022: 20987-21012 - [i5]Yuhta Takida, Takashi Shibuya, Wei-Hsiang Liao, Chieh-Hsin Lai, Junki Ohmura, Toshimitsu Uesaka, Naoki Murata, Shusuke Takahashi, Toshiyuki Kumakura, Yuki Mitsufuji:
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization. CoRR abs/2205.07547 (2022) - [i4]Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon:
Regularizing Score-based Models with Score Fokker-Planck Equations. CoRR abs/2210.04296 (2022) - [i3]Ryosuke Sawata, Naoki Murata, Yuhta Takida, Toshimitsu Uesaka, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji:
A Versatile Diffusion-based Generative Refiner for Speech Enhancement. CoRR abs/2210.17287 (2022) - [i2]Koichi Saito, Naoki Murata, Toshimitsu Uesaka, Chieh-Hsin Lai, Yuhta Takida, Takao Fukui, Yuki Mitsufuji:
Unsupervised vocal dereverberation with diffusion-based generative models. CoRR abs/2211.04124 (2022) - 2021
- [c5]Naoki Murata, Yuhta Takida, Tetsu Magariyachi:
Fast Convergent Method for Active Noise Control Over Spatial Region with Causal Constraint. WASPAA 2021: 296-300 - [i1]Yuhta Takida, Wei-Hsiang Liao, Toshimitsu Uesaka, Shusuke Takahashi, Yuki Mitsufuji:
Preventing Posterior Collapse Induced by Oversmoothing in Gaussian VAE. CoRR abs/2102.08663 (2021) - 2020
- [j1]Yuhta Takida, Shoichi Koyama, Natsuki Ueno, Hiroshi Saruwatari:
Reciprocity gap functional in spherical harmonic domain for gridless sound field decomposition. Signal Process. 169: 107383 (2020) - [c4]Yu Maeno, Yuhta Takida, Naoki Murata, Yuki Mitsufuji:
Array-Geometry-Aware Spatial Active Noise Control Based on Direction-of-Arrival Weighting. ICASSP 2020: 8414-8418
2010 – 2019
- 2019
- [c3]Yuhta Takida, Shoichi Koyama, Natsuki Ueno, Hiroshi Saruwatari:
Robust Gridless Sound Field Decomposition Based on Structured Reciprocity Gap Functional in Spherical Harmonic Domain. ICASSP 2019: 581-585 - 2018
- [c2]Yuhta Takida, Shoichi Koyama, Hiroshi Saruwatari:
Exterior and Interior Sound Field Separation Using Convex Optimization: Comparison of Signal Models. EUSIPCO 2018: 2549-2553 - [c1]Yuhta Takida, Shoichi Koyama, Natsuki Ueno, Hiroshi Saruwatari:
Gridless Sound Field Decomposition Based on Reciprocity Gap Functional in Spherical Harmonic Domain. SAM 2018: 627-631
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-12 21:55 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint