Picture for Jinhao Jiang

Jinhao Jiang

Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework

Add code
Sep 05, 2025
Viaarxiv icon

ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework

Add code
May 23, 2025
Viaarxiv icon

R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon

SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis

Add code
May 22, 2025
Viaarxiv icon

CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-Document QA Capability

Add code
May 15, 2025
Figure 1 for CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-Document QA Capability
Figure 2 for CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-Document QA Capability
Figure 3 for CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-Document QA Capability
Figure 4 for CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-Document QA Capability
Viaarxiv icon

Embodied Intelligence: The Key to Unblocking Generalized Artificial Intelligence

Add code
May 11, 2025
Viaarxiv icon

RV-Syn: Rational and Verifiable Mathematical Reasoning Data Synthesis based on Structured Function Library

Add code
Apr 29, 2025
Viaarxiv icon

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Add code
Mar 07, 2025
Figure 1 for R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
Figure 2 for R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
Figure 3 for R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
Figure 4 for R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
Viaarxiv icon

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

Add code
Mar 06, 2025
Figure 1 for An Empirical Study on Eliciting and Improving R1-like Reasoning Models
Figure 2 for An Empirical Study on Eliciting and Improving R1-like Reasoning Models
Figure 3 for An Empirical Study on Eliciting and Improving R1-like Reasoning Models
Figure 4 for An Empirical Study on Eliciting and Improving R1-like Reasoning Models
Viaarxiv icon

LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation

Add code
Feb 11, 2025
Viaarxiv icon