Picture for Xiaosong Zhang

Xiaosong Zhang

UESTC, Chengdu, China

GaMi: Geometry-Agnostic Material Identification via Cross-Modal Subtractive Disentanglement

Add code
May 29, 2026
Viaarxiv icon

Case-Based Calibration of Adaptive Reasoning and Execution for LLM Tool Use

Add code
May 14, 2026
Viaarxiv icon

Improving Multi-turn Dialogue Consistency with Self-Recall Thinking

Add code
May 14, 2026
Viaarxiv icon

FAIL: Flow Matching Adversarial Imitation Learning for Image Generation

Add code
Feb 12, 2026
Viaarxiv icon

GenArena: How Can We Achieve Human-Aligned Evaluation for Visual Generation Tasks?

Add code
Feb 05, 2026
Viaarxiv icon

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again

Add code
Jul 29, 2025
Viaarxiv icon

ReDDiT: Rehashing Noise for Discrete Visual Generation

Add code
May 26, 2025
Viaarxiv icon

Emu3: Next-Token Prediction is All You Need

Add code
Sep 27, 2024
Figure 1 for Emu3: Next-Token Prediction is All You Need
Figure 2 for Emu3: Next-Token Prediction is All You Need
Figure 3 for Emu3: Next-Token Prediction is All You Need
Figure 4 for Emu3: Next-Token Prediction is All You Need
Viaarxiv icon

Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS

Add code
Aug 16, 2024
Viaarxiv icon

Do As I Do: Pose Guided Human Motion Copy

Add code
Jun 24, 2024
Figure 1 for Do As I Do: Pose Guided Human Motion Copy
Figure 2 for Do As I Do: Pose Guided Human Motion Copy
Figure 3 for Do As I Do: Pose Guided Human Motion Copy
Figure 4 for Do As I Do: Pose Guided Human Motion Copy
Viaarxiv icon