Picture for Jianfeng Gao

Jianfeng Gao

EJ

Adapting Web Agents with Synthetic Supervision

Add code
Nov 08, 2025
Viaarxiv icon

Dyna-Mind: Learning to Simulate from Experience for Better AI Agents

Add code
Oct 10, 2025
Figure 1 for Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Figure 2 for Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Figure 3 for Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Figure 4 for Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Viaarxiv icon

FlowRL: Matching Reward Distributions for LLM Reasoning

Add code
Sep 18, 2025
Viaarxiv icon

SAS: Simulated Attention Score

Add code
Jul 10, 2025
Viaarxiv icon

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

Add code
Jul 09, 2025
Figure 1 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 2 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 3 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 4 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Viaarxiv icon

Training Language Models to Generate Quality Code with Program Analysis Feedback

Add code
May 28, 2025
Figure 1 for Training Language Models to Generate Quality Code with Program Analysis Feedback
Figure 2 for Training Language Models to Generate Quality Code with Program Analysis Feedback
Figure 3 for Training Language Models to Generate Quality Code with Program Analysis Feedback
Figure 4 for Training Language Models to Generate Quality Code with Program Analysis Feedback
Viaarxiv icon

EfficientLLM: Efficiency in Large Language Models

Add code
May 20, 2025
Viaarxiv icon

Text Generation Beyond Discrete Token Sampling

Add code
May 20, 2025
Figure 1 for Text Generation Beyond Discrete Token Sampling
Figure 2 for Text Generation Beyond Discrete Token Sampling
Figure 3 for Text Generation Beyond Discrete Token Sampling
Figure 4 for Text Generation Beyond Discrete Token Sampling
Viaarxiv icon

SITE: towards Spatial Intelligence Thorough Evaluation

Add code
May 08, 2025
Viaarxiv icon

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Add code
Apr 30, 2025
Figure 1 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Figure 2 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Figure 3 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Figure 4 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Viaarxiv icon