Picture for Liefeng Bo

Liefeng Bo

University of Washington

EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation

Add code
Apr 15, 2025
Viaarxiv icon

OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication

Add code
Apr 03, 2025
Viaarxiv icon

ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model

Add code
Mar 27, 2025
Viaarxiv icon

GaussianIP: Identity-Preserving Realistic 3D Human Generation via Human-Centric Diffusion Prior

Add code
Mar 14, 2025
Viaarxiv icon

LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Add code
Mar 13, 2025
Viaarxiv icon

Synchronized Video-to-Audio Generation via Mel Quantization-Continuum Decomposition

Add code
Mar 10, 2025
Viaarxiv icon

R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning

Add code
Mar 07, 2025
Viaarxiv icon

LAM: Large Avatar Model for One-shot Animatable Gaussian Head

Add code
Feb 25, 2025
Viaarxiv icon

Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance

Add code
Feb 10, 2025
Figure 1 for Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Figure 2 for Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Figure 3 for Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Figure 4 for Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Viaarxiv icon

HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding

Add code
Jan 25, 2025
Figure 1 for HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding
Figure 2 for HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding
Figure 3 for HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding
Figure 4 for HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding
Viaarxiv icon