Picture for Xin Jin

Xin Jin

ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?

Add code
Jun 17, 2026
Viaarxiv icon

AutoPDE: Reliable Agentic PDE Solving via Explicitly Represented Solver Strategies

Add code
Jun 09, 2026
Viaarxiv icon

CP4D: Compositional Physics-aware 4D Scene Generation

Add code
Jun 08, 2026
Viaarxiv icon

Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions

Add code
Jun 08, 2026
Viaarxiv icon

Physics-Informed Video Generation via Mixture-of-Experts Latent Alignment

Add code
Jun 03, 2026
Viaarxiv icon

EarlyTom: Early Token Compression Completes Fast Video Understanding

Add code
May 28, 2026
Viaarxiv icon

BigMac: Breaking the Pareto Frontier of Compute and Memory in Multimodal LLM Training

Add code
May 25, 2026
Viaarxiv icon

RankE: End-to-End Post-Training for Discrete Text-to-Image Generation with Decoder Co-Evolution

Add code
May 20, 2026
Viaarxiv icon

GTA: Advancing Image-to-3D World Generation via Geometry Then Appearance Video Diffusion

Add code
May 13, 2026
Viaarxiv icon

LoViF 2026 The First Challenge on Holistic Quality Assessment for 4D World Model (PhyScore)

Add code
May 06, 2026
Viaarxiv icon