Picture for Hang Xu

Hang Xu

Latent Visual States for Efficient Multimodal Reasoning

Add code
Jun 23, 2026
Viaarxiv icon

AnchorEdit: Maintaining Temporal Consistency in Multi-turn Image Editing via Causal Memory

Add code
Jun 10, 2026
Viaarxiv icon

Goal2Pixel: Grounding Goals to Pixels for Vision-Language Navigation

Add code
Jun 01, 2026
Viaarxiv icon

SIRIUS-SQL: Anchoring Multi-Candidate Text-to-SQL in Execution Feedback

Add code
May 31, 2026
Viaarxiv icon

FedSmoothLoRA: Toward Smoother and Faster Convergence in Federated Low-Rank Adaptation

Add code
May 28, 2026
Viaarxiv icon

Accelerating ground state search of spatial photonic Ising machines with genetic-simulated annealing hybrid algorithm

Add code
May 22, 2026
Viaarxiv icon

OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation

Add code
May 12, 2026
Viaarxiv icon

Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation

Add code
May 05, 2026
Viaarxiv icon

Shared Autonomy Assisted by Impedance-Driven Anisotropic Guidance Field

Add code
May 04, 2026
Viaarxiv icon

When AI reviews science: Can we trust the referee?

Add code
Apr 26, 2026
Viaarxiv icon