Alert button
Picture for Yuki Mitsufuji

Yuki Mitsufuji

Alert button

Automated Black-box Prompt Engineering for Personalized Text-to-Image Generation

Add code
Bookmark button
Alert button
Mar 28, 2024
Yutong He, Alexander Robey, Naoki Murata, Yiding Jiang, Joshua Williams, George J. Pappas, Hamed Hassani, Yuki Mitsufuji, Ruslan Salakhutdinov, J. Zico Kolter

Viaarxiv icon

MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage

Add code
Bookmark button
Alert button
Mar 15, 2024
Hao Hao Tan, Kin Wai Cheuk, Taemin Cho, Wei-Hsiang Liao, Yuki Mitsufuji

Figure 1 for MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage
Figure 2 for MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage
Figure 3 for MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage
Figure 4 for MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage
Viaarxiv icon

DiffuCOMET: Contextual Commonsense Knowledge Diffusion

Add code
Bookmark button
Alert button
Feb 26, 2024
Silin Gao, Mete Ismayilzada, Mengjie Zhao, Hiromi Wakaki, Yuki Mitsufuji, Antoine Bosselut

Viaarxiv icon

MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models

Add code
Bookmark button
Alert button
Feb 09, 2024
Yixiao Zhang, Yukara Ikemiya, Gus Xia, Naoki Murata, Marco Martínez, Wei-Hsiang Liao, Yuki Mitsufuji, Simon Dixon

Viaarxiv icon

HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes

Add code
Bookmark button
Alert button
Dec 31, 2023
Yuhta Takida, Yukara Ikemiya, Takashi Shibuya, Kazuki Shimada, Woosung Choi, Chieh-Hsin Lai, Naoki Murata, Toshimitsu Uesaka, Kengo Uchida, Wei-Hsiang Liao, Yuki Mitsufuji

Viaarxiv icon

Manifold Preserving Guided Diffusion

Add code
Bookmark button
Alert button
Nov 28, 2023
Yutong He, Naoki Murata, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Dongjun Kim, Wei-Hsiang Liao, Yuki Mitsufuji, J. Zico Kolter, Ruslan Salakhutdinov, Stefano Ermon

Viaarxiv icon

On the Language Encoder of Contrastive Cross-modal Models

Add code
Bookmark button
Alert button
Oct 20, 2023
Mengjie Zhao, Junya Ono, Zhi Zhong, Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Wei-Hsiang Liao, Takashi Shibuya, Hiromi Wakaki, Yuki Mitsufuji

Viaarxiv icon

Towards reporting bias in visual-language datasets: bimodal augmentation by decoupling object-attribute association

Add code
Bookmark button
Alert button
Oct 02, 2023
Qiyu Wu, Mengjie Zhao, Yutong He, Lang Huang, Junya Ono, Hiromi Wakaki, Yuki Mitsufuji

Figure 1 for Towards reporting bias in visual-language datasets: bimodal augmentation by decoupling object-attribute association
Figure 2 for Towards reporting bias in visual-language datasets: bimodal augmentation by decoupling object-attribute association
Figure 3 for Towards reporting bias in visual-language datasets: bimodal augmentation by decoupling object-attribute association
Figure 4 for Towards reporting bias in visual-language datasets: bimodal augmentation by decoupling object-attribute association
Viaarxiv icon

Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion

Add code
Bookmark button
Alert button
Oct 01, 2023
Dongjun Kim, Chieh-Hsin Lai, Wei-Hsiang Liao, Naoki Murata, Yuhta Takida, Toshimitsu Uesaka, Yutong He, Yuki Mitsufuji, Stefano Ermon

Figure 1 for Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
Figure 2 for Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
Figure 3 for Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
Figure 4 for Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
Viaarxiv icon

Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription

Add code
Bookmark button
Alert button
Sep 27, 2023
Frank Cwitkowitz, Kin Wai Cheuk, Woosung Choi, Marco A. Martínez-Ramírez, Keisuke Toyama, Wei-Hsiang Liao, Yuki Mitsufuji

Viaarxiv icon