Picture for Bin Lin

Bin Lin

OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

Add code
May 28, 2025
Viaarxiv icon

ImgEdit: A Unified Image Editing Dataset and Benchmark

Add code
May 26, 2025
Viaarxiv icon

SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any Video

Add code
Mar 12, 2025
Viaarxiv icon

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation

Add code
Mar 10, 2025
Viaarxiv icon

Next Patch Prediction for Autoregressive Visual Generation

Add code
Dec 19, 2024
Viaarxiv icon

DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses

Add code
Nov 30, 2024
Viaarxiv icon

Open-Sora Plan: Open-Source Large Video Generation Model

Add code
Nov 28, 2024
Figure 1 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 2 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 3 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 4 for Open-Sora Plan: Open-Source Large Video Generation Model
Viaarxiv icon

WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

Add code
Nov 26, 2024
Viaarxiv icon

Takin-ADA: Emotion Controllable Audio-Driven Animation with Canonical and Landmark Loss Optimization

Add code
Oct 18, 2024
Viaarxiv icon

Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models

Add code
Sep 18, 2024
Viaarxiv icon