Picture for Shuohuan Wang

Shuohuan Wang

ConSA: Controllable Sparsity in Hybrid Attention via Learnable Allocation

Add code
Jun 16, 2026
Viaarxiv icon

Memento: Reconstruct to Remember for Consistent Long Video Generation

Add code
Jun 12, 2026
Viaarxiv icon

Native Audio-Visual Alignment for Generation

Add code
May 28, 2026
Viaarxiv icon

CLEAR: Unlocking Generative Potential for Degraded Image Understanding in Unified Multimodal Models

Add code
Apr 06, 2026
Viaarxiv icon

Sparse Growing Transformer: Training-Time Sparse Depth Allocation via Progressive Attention Looping

Add code
Mar 25, 2026
Viaarxiv icon

Mixture of Universal Experts: Scaling Virtual Width via Depth-Width Transformation

Add code
Mar 05, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction

Add code
Jan 09, 2026
Viaarxiv icon

Blink: Dynamic Visual Token Resolution for Enhanced Multimodal Understanding

Add code
Dec 11, 2025
Viaarxiv icon

Advantageous Parameter Expansion Training Makes Better Large Language Models

Add code
May 30, 2025
Figure 1 for Advantageous Parameter Expansion Training Makes Better Large Language Models
Figure 2 for Advantageous Parameter Expansion Training Makes Better Large Language Models
Figure 3 for Advantageous Parameter Expansion Training Makes Better Large Language Models
Figure 4 for Advantageous Parameter Expansion Training Makes Better Large Language Models
Viaarxiv icon