Picture for Liyang Chen

Liyang Chen

Towards Streaming Target Speaker Extraction via Chunk-wise Interleaved Splicing of Autoregressive Language Model

Add code
Apr 21, 2026
Viaarxiv icon

Seedance 2.0: Advancing Video Generation for World Complexity

Add code
Apr 15, 2026
Viaarxiv icon

DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation

Add code
Feb 12, 2026
Viaarxiv icon

AudioRouter: Data Efficient Audio Understanding via RL based Dual Reasoning

Add code
Feb 11, 2026
Viaarxiv icon

OptiSQL: Executable SQL Generation from Optical Tokens

Add code
Jan 21, 2026
Viaarxiv icon

From Inpainting to Editing: A Self-Bootstrapping Framework for Context-Rich Visual Dubbing

Add code
Dec 31, 2025
Viaarxiv icon

Detecting and Mitigating Insertion Hallucination in Video-to-Audio Generation

Add code
Oct 09, 2025
Viaarxiv icon

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Add code
Sep 10, 2025
Viaarxiv icon

BRIGHT+: Upgrading the BRIGHT Benchmark with MARCUS, a Multi-Agent RAG Clean-Up Suite

Add code
Jun 08, 2025
Viaarxiv icon

MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement

Add code
Aug 26, 2024
Figure 1 for MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement
Figure 2 for MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement
Figure 3 for MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement
Figure 4 for MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement
Viaarxiv icon