Picture for Jiahao Qiu

Jiahao Qiu

ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs

Add code
Jun 23, 2025
Viaarxiv icon

AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes

Add code
Jun 17, 2025
Viaarxiv icon

On Path to Multimodal Historical Reasoning: HistBench and HistAgent

Add code
May 26, 2025
Viaarxiv icon

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Add code
May 26, 2025
Viaarxiv icon

Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?

Add code
May 21, 2025
Viaarxiv icon

OTC: Optimal Tool Calls via Reinforcement Learning

Add code
Apr 21, 2025
Viaarxiv icon

EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental Health Safety

Add code
Apr 13, 2025
Viaarxiv icon

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models

Add code
Mar 31, 2025
Viaarxiv icon

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment

Add code
Mar 27, 2025
Viaarxiv icon

Temporal Consistency for LLM Reasoning Process Error Identification

Add code
Mar 18, 2025
Viaarxiv icon