Picture for Zhexin Hu

Zhexin Hu

ZipRL: Adaptive Multi-Turn Context Compression with Hindsight Response Replay

Add code
May 27, 2026
Viaarxiv icon

AMR-SD: Asymmetric Meta-Reflective Self-Distillation for Token-Level Credit Assignment

Add code
May 18, 2026
Viaarxiv icon

ToolForge: A Data Synthesis Pipeline for Multi-Hop Search without Real-World APIs

Add code
Dec 18, 2025
Viaarxiv icon