Picture for Liefeng Bo

Liefeng Bo

University of Washington

Synchronized Video-to-Audio Generation via Mel Quantization-Continuum Decomposition

Add code
Mar 10, 2025
Viaarxiv icon

R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning

Add code
Mar 07, 2025
Figure 1 for R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning
Figure 2 for R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning
Figure 3 for R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning
Figure 4 for R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning
Viaarxiv icon

LAM: Large Avatar Model for One-shot Animatable Gaussian Head

Add code
Feb 25, 2025
Viaarxiv icon

Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance

Add code
Feb 10, 2025
Figure 1 for Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Figure 2 for Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Figure 3 for Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Figure 4 for Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Viaarxiv icon

HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding

Add code
Jan 25, 2025
Figure 1 for HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding
Figure 2 for HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding
Figure 3 for HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding
Figure 4 for HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding
Viaarxiv icon

EMO2: End-Effector Guided Audio-Driven Avatar Video Generation

Add code
Jan 18, 2025
Figure 1 for EMO2: End-Effector Guided Audio-Driven Avatar Video Generation
Figure 2 for EMO2: End-Effector Guided Audio-Driven Avatar Video Generation
Figure 3 for EMO2: End-Effector Guided Audio-Driven Avatar Video Generation
Figure 4 for EMO2: End-Effector Guided Audio-Driven Avatar Video Generation
Viaarxiv icon

Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions

Add code
Jan 17, 2025
Figure 1 for Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions
Figure 2 for Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions
Figure 3 for Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions
Figure 4 for Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions
Viaarxiv icon

DiffuEraser: A Diffusion Model for Video Inpainting

Add code
Jan 17, 2025
Viaarxiv icon

AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation

Add code
Jan 16, 2025
Viaarxiv icon

Make-A-Character 2: Animatable 3D Character Generation From a Single Image

Add code
Jan 15, 2025
Figure 1 for Make-A-Character 2: Animatable 3D Character Generation From a Single Image
Figure 2 for Make-A-Character 2: Animatable 3D Character Generation From a Single Image
Figure 3 for Make-A-Character 2: Animatable 3D Character Generation From a Single Image
Figure 4 for Make-A-Character 2: Animatable 3D Character Generation From a Single Image
Viaarxiv icon