Alert button
Picture for Yuhta Takida

Yuhta Takida

Alert button

HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes

Add code
Bookmark button
Alert button
Dec 31, 2023
Yuhta Takida, Yukara Ikemiya, Takashi Shibuya, Kazuki Shimada, Woosung Choi, Chieh-Hsin Lai, Naoki Murata, Toshimitsu Uesaka, Kengo Uchida, Wei-Hsiang Liao, Yuki Mitsufuji

Viaarxiv icon

Manifold Preserving Guided Diffusion

Add code
Bookmark button
Alert button
Nov 28, 2023
Yutong He, Naoki Murata, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Dongjun Kim, Wei-Hsiang Liao, Yuki Mitsufuji, J. Zico Kolter, Ruslan Salakhutdinov, Stefano Ermon

Viaarxiv icon

On the Language Encoder of Contrastive Cross-modal Models

Add code
Bookmark button
Alert button
Oct 20, 2023
Mengjie Zhao, Junya Ono, Zhi Zhong, Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Wei-Hsiang Liao, Takashi Shibuya, Hiromi Wakaki, Yuki Mitsufuji

Viaarxiv icon

Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion

Add code
Bookmark button
Alert button
Oct 01, 2023
Dongjun Kim, Chieh-Hsin Lai, Wei-Hsiang Liao, Naoki Murata, Yuhta Takida, Toshimitsu Uesaka, Yutong He, Yuki Mitsufuji, Stefano Ermon

Figure 1 for Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
Figure 2 for Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
Figure 3 for Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
Figure 4 for Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
Viaarxiv icon

BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network

Add code
Bookmark button
Alert button
Sep 06, 2023
Takashi Shibuya, Yuhta Takida, Yuki Mitsufuji

Figure 1 for BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network
Figure 2 for BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network
Figure 3 for BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network
Figure 4 for BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network
Viaarxiv icon

Automatic Piano Transcription with Hierarchical Frequency-Time Transformer

Add code
Bookmark button
Alert button
Jul 10, 2023
Keisuke Toyama, Taketo Akama, Yukara Ikemiya, Yuhta Takida, Wei-Hsiang Liao, Yuki Mitsufuji

Figure 1 for Automatic Piano Transcription with Hierarchical Frequency-Time Transformer
Figure 2 for Automatic Piano Transcription with Hierarchical Frequency-Time Transformer
Figure 3 for Automatic Piano Transcription with Hierarchical Frequency-Time Transformer
Figure 4 for Automatic Piano Transcription with Hierarchical Frequency-Time Transformer
Viaarxiv icon

On the Equivalence of Consistency-Type Models: Consistency Models, Consistent Diffusion Models, and Fokker-Planck Regularization

Add code
Bookmark button
Alert button
Jun 01, 2023
Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Naoki Murata, Yuki Mitsufuji, Stefano Ermon

Figure 1 for On the Equivalence of Consistency-Type Models: Consistency Models, Consistent Diffusion Models, and Fokker-Planck Regularization
Figure 2 for On the Equivalence of Consistency-Type Models: Consistency Models, Consistent Diffusion Models, and Fokker-Planck Regularization
Viaarxiv icon

Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport

Add code
Bookmark button
Alert button
Jan 30, 2023
Yuhta Takida, Masaaki Imaizumi, Chieh-Hsin Lai, Toshimitsu Uesaka, Naoki Murata, Yuki Mitsufuji

Figure 1 for Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport
Figure 2 for Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport
Figure 3 for Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport
Figure 4 for Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport
Viaarxiv icon

GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration

Add code
Bookmark button
Alert button
Jan 30, 2023
Naoki Murata, Koichi Saito, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon

Figure 1 for GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration
Figure 2 for GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration
Figure 3 for GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration
Figure 4 for GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration
Viaarxiv icon

Unsupervised vocal dereverberation with diffusion-based generative models

Add code
Bookmark button
Alert button
Nov 08, 2022
Koichi Saito, Naoki Murata, Toshimitsu Uesaka, Chieh-Hsin Lai, Yuhta Takida, Takao Fukui, Yuki Mitsufuji

Figure 1 for Unsupervised vocal dereverberation with diffusion-based generative models
Figure 2 for Unsupervised vocal dereverberation with diffusion-based generative models
Figure 3 for Unsupervised vocal dereverberation with diffusion-based generative models
Figure 4 for Unsupervised vocal dereverberation with diffusion-based generative models
Viaarxiv icon