Picture for Bowen Shi

Bowen Shi

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

NC-NCD: Novel Class Discovery for Node Classification

Add code
Jul 25, 2024
Figure 1 for NC-NCD: Novel Class Discovery for Node Classification
Figure 2 for NC-NCD: Novel Class Discovery for Node Classification
Figure 3 for NC-NCD: Novel Class Discovery for Node Classification
Figure 4 for NC-NCD: Novel Class Discovery for Node Classification
Viaarxiv icon

High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching

Add code
Jul 04, 2024
Figure 1 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Figure 2 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Figure 3 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Figure 4 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Viaarxiv icon

Learning Fine-Grained Controllability on Speech Generation via Efficient Fine-Tuning

Add code
Jun 10, 2024
Figure 1 for Learning Fine-Grained Controllability on Speech Generation via Efficient Fine-Tuning
Figure 2 for Learning Fine-Grained Controllability on Speech Generation via Efficient Fine-Tuning
Figure 3 for Learning Fine-Grained Controllability on Speech Generation via Efficient Fine-Tuning
Figure 4 for Learning Fine-Grained Controllability on Speech Generation via Efficient Fine-Tuning
Viaarxiv icon

XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception

Add code
Mar 21, 2024
Figure 1 for XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception
Figure 2 for XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception
Figure 3 for XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception
Figure 4 for XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception
Viaarxiv icon

Towards Privacy-Aware Sign Language Translation at Scale

Add code
Feb 14, 2024
Figure 1 for Towards Privacy-Aware Sign Language Translation at Scale
Figure 2 for Towards Privacy-Aware Sign Language Translation at Scale
Figure 3 for Towards Privacy-Aware Sign Language Translation at Scale
Figure 4 for Towards Privacy-Aware Sign Language Translation at Scale
Viaarxiv icon

UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

Add code
Jan 18, 2024
Figure 1 for UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding
Figure 2 for UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding
Figure 3 for UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding
Figure 4 for UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding
Viaarxiv icon

Audiobox: Unified Audio Generation with Natural Language Prompts

Add code
Dec 25, 2023
Figure 1 for Audiobox: Unified Audio Generation with Natural Language Prompts
Figure 2 for Audiobox: Unified Audio Generation with Natural Language Prompts
Figure 3 for Audiobox: Unified Audio Generation with Natural Language Prompts
Figure 4 for Audiobox: Unified Audio Generation with Natural Language Prompts
Viaarxiv icon

AiluRus: A Scalable ViT Framework for Dense Prediction

Add code
Nov 02, 2023
Viaarxiv icon

Generative Pre-training for Speech with Flow Matching

Add code
Oct 25, 2023
Figure 1 for Generative Pre-training for Speech with Flow Matching
Figure 2 for Generative Pre-training for Speech with Flow Matching
Figure 3 for Generative Pre-training for Speech with Flow Matching
Figure 4 for Generative Pre-training for Speech with Flow Matching
Viaarxiv icon