Picture for Mary Williamson

Mary Williamson

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

Audiobox: Unified Audio Generation with Natural Language Prompts

Add code
Dec 25, 2023
Viaarxiv icon

A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions

Add code
Dec 14, 2023
Viaarxiv icon

Seamless: Multilingual Expressive and Streaming Speech Translation

Add code
Dec 08, 2023
Figure 1 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 2 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 3 for Seamless: Multilingual Expressive and Streaming Speech Translation
Figure 4 for Seamless: Multilingual Expressive and Streaming Speech Translation
Viaarxiv icon

Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale

Add code
Jun 23, 2023
Viaarxiv icon

droidlet: modular, heterogenous, multi-modal agents

Add code
Jan 25, 2021
Figure 1 for droidlet: modular, heterogenous, multi-modal agents
Figure 2 for droidlet: modular, heterogenous, multi-modal agents
Figure 3 for droidlet: modular, heterogenous, multi-modal agents
Figure 4 for droidlet: modular, heterogenous, multi-modal agents
Viaarxiv icon

VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation

Add code
Jan 02, 2021
Figure 1 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Figure 2 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Figure 3 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Figure 4 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Viaarxiv icon

I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling

Add code
Dec 28, 2020
Figure 1 for I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling
Figure 2 for I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling
Figure 3 for I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling
Figure 4 for I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling
Viaarxiv icon

Facebook AI's WMT20 News Translation Task Submission

Add code
Nov 16, 2020
Figure 1 for Facebook AI's WMT20 News Translation Task Submission
Figure 2 for Facebook AI's WMT20 News Translation Task Submission
Figure 3 for Facebook AI's WMT20 News Translation Task Submission
Figure 4 for Facebook AI's WMT20 News Translation Task Submission
Viaarxiv icon

Open-Domain Conversational Agents: Current Progress, Open Problems, and Future Directions

Add code
Jul 13, 2020
Viaarxiv icon