Picture for Luis C. Cobo

Luis C. Cobo

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Large-scale multilingual audio visual dubbing

Nov 06, 2020
Figure 1 for Large-scale multilingual audio visual dubbing
Figure 2 for Large-scale multilingual audio visual dubbing
Figure 3 for Large-scale multilingual audio visual dubbing
Figure 4 for Large-scale multilingual audio visual dubbing
Viaarxiv icon

High Fidelity Speech Synthesis with Adversarial Networks

Add code
Sep 26, 2019
Figure 1 for High Fidelity Speech Synthesis with Adversarial Networks
Figure 2 for High Fidelity Speech Synthesis with Adversarial Networks
Figure 3 for High Fidelity Speech Synthesis with Adversarial Networks
Figure 4 for High Fidelity Speech Synthesis with Adversarial Networks
Viaarxiv icon

Sample Efficient Adaptive Text-to-Speech

Add code
Sep 27, 2018
Figure 1 for Sample Efficient Adaptive Text-to-Speech
Figure 2 for Sample Efficient Adaptive Text-to-Speech
Figure 3 for Sample Efficient Adaptive Text-to-Speech
Figure 4 for Sample Efficient Adaptive Text-to-Speech
Viaarxiv icon

Parallel WaveNet: Fast High-Fidelity Speech Synthesis

Add code
Nov 28, 2017
Figure 1 for Parallel WaveNet: Fast High-Fidelity Speech Synthesis
Figure 2 for Parallel WaveNet: Fast High-Fidelity Speech Synthesis
Figure 3 for Parallel WaveNet: Fast High-Fidelity Speech Synthesis
Figure 4 for Parallel WaveNet: Fast High-Fidelity Speech Synthesis
Viaarxiv icon