Picture for Itai Gat

Itai Gat

Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation

Add code
Jun 16, 2024
Viaarxiv icon

D-Flow: Differentiating through Flows for Controlled Generation

Add code
Feb 21, 2024
Figure 1 for D-Flow: Differentiating through Flows for Controlled Generation
Figure 2 for D-Flow: Differentiating through Flows for Controlled Generation
Figure 3 for D-Flow: Differentiating through Flows for Controlled Generation
Figure 4 for D-Flow: Differentiating through Flows for Controlled Generation
Viaarxiv icon

SpiRit-LM: Interleaved Spoken and Written Language Model

Add code
Feb 08, 2024
Viaarxiv icon

Masked Audio Generation using a Single Non-Autoregressive Transformer

Add code
Jan 09, 2024
Viaarxiv icon

Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

Add code
Sep 28, 2023
Figure 1 for Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
Figure 2 for Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
Figure 3 for Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
Figure 4 for Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
Viaarxiv icon

Code Llama: Open Foundation Models for Code

Add code
Aug 25, 2023
Figure 1 for Code Llama: Open Foundation Models for Code
Figure 2 for Code Llama: Open Foundation Models for Code
Figure 3 for Code Llama: Open Foundation Models for Code
Figure 4 for Code Llama: Open Foundation Models for Code
Viaarxiv icon

EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis

Add code
Aug 10, 2023
Figure 1 for EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis
Figure 2 for EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis
Figure 3 for EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis
Figure 4 for EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis
Viaarxiv icon

Simple and Controllable Music Generation

Add code
Jun 08, 2023
Figure 1 for Simple and Controllable Music Generation
Figure 2 for Simple and Controllable Music Generation
Figure 3 for Simple and Controllable Music Generation
Figure 4 for Simple and Controllable Music Generation
Viaarxiv icon

AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation

Add code
May 22, 2023
Figure 1 for AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Figure 2 for AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Figure 3 for AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Figure 4 for AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Viaarxiv icon

Textually Pretrained Speech Language Models

Add code
May 22, 2023
Figure 1 for Textually Pretrained Speech Language Models
Figure 2 for Textually Pretrained Speech Language Models
Figure 3 for Textually Pretrained Speech Language Models
Figure 4 for Textually Pretrained Speech Language Models
Viaarxiv icon