Picture for Xuanjing Huang

Xuanjing Huang

LIBERO-Occ: Evaluating and Improving Vision-Language-Action Models under Scene-Induced Occlusion via Viewpoint Imagination

Add code
Jun 09, 2026
Viaarxiv icon

Two Bridges, One Pathway: From VLMs to Generalizable VLAs with Embodied Trajectory-Coupled Data

Add code
Jun 07, 2026
Viaarxiv icon

Entropy Is Not Enough: Unlocking Effective Reinforcement Learning for Visual Reasoning via Vision-Anchored Token Selection

Add code
Jun 03, 2026
Viaarxiv icon

EntangleCodec: A Unified Discrete Audio Tokenizer via Semantic-Acoustic Entanglement

Add code
Jun 01, 2026
Viaarxiv icon

AdaptR1: Reinforcement Learning Based Adaptive Interleaved Thinking in Multi-hop Question Answering

Add code
May 29, 2026
Viaarxiv icon

Prompt-Level Reward Specifications for Open-Ended Post-Training

Add code
May 28, 2026
Viaarxiv icon

Rethinking Agentic RAG: Toward LLM-Driven Logical Retrieval Beyond Embeddings

Add code
May 26, 2026
Viaarxiv icon

LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening

Add code
May 19, 2026
Viaarxiv icon

Entropy Polarity in Reinforcement Fine-Tuning: Direction, Asymmetry, and Control

Add code
May 14, 2026
Viaarxiv icon

ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents

Add code
May 12, 2026
Viaarxiv icon