Picture for Huiming Fan

Huiming Fan

LoopRPT: Reinforcement Pre-Training for Looped Language Models

Add code
Mar 20, 2026
Viaarxiv icon

REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents

Add code
Feb 15, 2026
Viaarxiv icon

Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering

Add code
May 25, 2025
Viaarxiv icon