Picture for Xinchao Wang

Xinchao Wang

Image Editing As Programs with Diffusion Models

Add code
Jun 04, 2025
Viaarxiv icon

Minute-Long Videos with Dual Parallelisms

Add code
May 29, 2025
Viaarxiv icon

PixelThink: Towards Efficient Chain-of-Pixel Reasoning

Add code
May 29, 2025
Viaarxiv icon

Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

Add code
May 24, 2025
Viaarxiv icon

VeriThinker: Learning to Verify Makes Reasoning Model Efficient

Add code
May 23, 2025
Viaarxiv icon

Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding

Add code
May 22, 2025
Viaarxiv icon

dKV-Cache: The Cache for Diffusion Language Models

Add code
May 21, 2025
Viaarxiv icon

Thinkless: LLM Learns When to Think

Add code
May 19, 2025
Viaarxiv icon

Top-Down Compression: Revisit Efficient Vision Token Projection for Visual Instruction Tuning

Add code
May 17, 2025
Viaarxiv icon

PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning

Add code
Apr 22, 2025
Viaarxiv icon