Picture for Songhua Liu

Songhua Liu

National University of Singapore, Shanghai Jiaotong University

Masked Generative Transformer Is What You Need for Image Editing

Add code
May 11, 2026
Viaarxiv icon

LongSeeker: Elastic Context Orchestration for Long-Horizon Search Agents

Add code
May 06, 2026
Viaarxiv icon

Gated Condition Injection without Multimodal Attention: Towards Controllable Linear-Attention Transformers

Add code
Mar 29, 2026
Viaarxiv icon

ViBe: Ultra-High-Resolution Video Synthesis Born from Pure Images

Add code
Mar 24, 2026
Viaarxiv icon

ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer

Add code
Mar 16, 2026
Viaarxiv icon

MultiAnimate: Pose-Guided Image Animation Made Extensible

Add code
Feb 25, 2026
Viaarxiv icon

SpotEdit: Selective Region Editing in Diffusion Transformers

Add code
Dec 26, 2025
Viaarxiv icon

EditMGT: Unleashing Potentials of Masked Generative Transformers in Image Editing

Add code
Dec 12, 2025
Viaarxiv icon

FreeSwim: Revisiting Sliding-Window Attention Mechanisms for Training-Free Ultra-High-Resolution Video Generation

Add code
Nov 18, 2025
Viaarxiv icon

Control and Realism: Best of Both Worlds in Layout-to-Image without Training

Add code
Jun 18, 2025
Viaarxiv icon