Picture for Tiehua Mei

Tiehua Mei

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

Add code
May 19, 2026
Viaarxiv icon

Good Reasoning Makes Good Demonstrations: Implicit Reasoning Quality Supervision via In-Context Reinforcement Learning

Add code
Mar 10, 2026
Viaarxiv icon

GORACS: Group-level Optimal Transport-guided Coreset Selection for LLM-based Recommender Systems

Add code
Jun 04, 2025
Viaarxiv icon