Picture for Robin San Roman

Robin San Roman

MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling

Add code
Jan 07, 2025
Figure 1 for MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling
Figure 2 for MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling
Figure 3 for MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling
Figure 4 for MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling
Viaarxiv icon

Large Concept Models: Language Modeling in a Sentence Representation Space

Add code
Dec 11, 2024
Figure 1 for Large Concept Models: Language Modeling in a Sentence Representation Space
Figure 2 for Large Concept Models: Language Modeling in a Sentence Representation Space
Figure 3 for Large Concept Models: Language Modeling in a Sentence Representation Space
Figure 4 for Large Concept Models: Language Modeling in a Sentence Representation Space
Viaarxiv icon

Latent Watermarking of Audio Generative Models

Add code
Sep 04, 2024
Figure 1 for Latent Watermarking of Audio Generative Models
Figure 2 for Latent Watermarking of Audio Generative Models
Figure 3 for Latent Watermarking of Audio Generative Models
Figure 4 for Latent Watermarking of Audio Generative Models
Viaarxiv icon

Proactive Detection of Voice Cloning with Localized Watermarking

Add code
Jan 30, 2024
Viaarxiv icon

Seamless: Multilingual Expressive and Streaming Speech Translation

Add code
Dec 08, 2023
Figure 1 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 2 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 3 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 4 for Seamless: Multilingual Expressive and Streaming Speech Translation
Viaarxiv icon

From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion

Add code
Aug 02, 2023
Figure 1 for From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion
Figure 2 for From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion
Figure 3 for From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion
Figure 4 for From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion
Viaarxiv icon

Denoising Diffusion Gamma Models

Add code
Oct 10, 2021
Figure 1 for Denoising Diffusion Gamma Models
Figure 2 for Denoising Diffusion Gamma Models
Figure 3 for Denoising Diffusion Gamma Models
Viaarxiv icon

Non Gaussian Denoising Diffusion Models

Add code
Jun 14, 2021
Figure 1 for Non Gaussian Denoising Diffusion Models
Figure 2 for Non Gaussian Denoising Diffusion Models
Figure 3 for Non Gaussian Denoising Diffusion Models
Figure 4 for Non Gaussian Denoising Diffusion Models
Viaarxiv icon