Picture for Gao Huang

Gao Huang

MemoryVLA++: Temporal Modeling via Memory and Imagination in Vision-Language-Action Models

Add code
Jun 08, 2026
Viaarxiv icon

Towards World Models in Biomedical Research

Add code
Jun 04, 2026
Viaarxiv icon

Potential-Guided Flow Matching for Vision-Language-Action Policy Improvement

Add code
Jun 03, 2026
Viaarxiv icon

Action with Visual Primitives

Add code
May 21, 2026
Viaarxiv icon

From Reasoning Chains to Verifiable Subproblems: Curriculum Reinforcement Learning Enables Credit Assignment for LLM Reasoning

Add code
May 21, 2026
Viaarxiv icon

InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation

Add code
May 14, 2026
Viaarxiv icon

Steering Visual Generation in Unified Multimodal Models with Understanding Supervision

Add code
May 07, 2026
Viaarxiv icon

Linear-Time Global Visual Modeling without Explicit Attention

Add code
May 06, 2026
Viaarxiv icon

Linearizing Vision Transformer with Test-Time Training

Add code
May 04, 2026
Viaarxiv icon

Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models

Add code
Apr 28, 2026
Viaarxiv icon