Picture for Kai Tang

Kai Tang

GeoMin: Data-Efficient Semi-Supervised RLVR via Geometric Distribution Modeling

Add code
Jun 03, 2026
Viaarxiv icon

Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill

Add code
Jun 02, 2026
Viaarxiv icon

Phase-Conditioned Imitation Learning with Autonomous Failure Recovery for Robust Deformable Object Manipulation

Add code
May 28, 2026
Viaarxiv icon

expo: Exploration-prioritized policy optimization via adaptive kl regulation and gaussian curriculum sampling

Add code
May 11, 2026
Viaarxiv icon

Reinforced Curriculum Pre-Alignment for Domain-Adaptive VLMs

Add code
Feb 11, 2026
Viaarxiv icon

TC-IDM: Grounding Video Generation for Executable Zero-shot Robot Motion

Add code
Jan 26, 2026
Viaarxiv icon

Learning from Prompt itself: the Hierarchical Attribution Prompt Optimization

Add code
Jan 06, 2026
Viaarxiv icon

TableGPT-R1: Advancing Tabular Reasoning Through Reinforcement Learning

Add code
Dec 23, 2025
Figure 1 for TableGPT-R1: Advancing Tabular Reasoning Through Reinforcement Learning
Figure 2 for TableGPT-R1: Advancing Tabular Reasoning Through Reinforcement Learning
Figure 3 for TableGPT-R1: Advancing Tabular Reasoning Through Reinforcement Learning
Figure 4 for TableGPT-R1: Advancing Tabular Reasoning Through Reinforcement Learning
Viaarxiv icon

RTFF: Random-to-Target Fabric Flattening Policy using Dual-Arm Manipulator

Add code
Oct 01, 2025
Viaarxiv icon

WoW: Towards a World omniscient World model Through Embodied Interaction

Add code
Sep 26, 2025
Viaarxiv icon