Picture for Jianfeng Gao

Jianfeng Gao

EJ

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Add code
Feb 25, 2026
Viaarxiv icon

Training Large Reasoning Models Efficiently via Progressive Thought Encoding

Add code
Feb 18, 2026
Viaarxiv icon

SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks

Add code
Feb 06, 2026
Viaarxiv icon

Test-time Recursive Thinking: Self-Improvement without External Feedback

Add code
Feb 03, 2026
Viaarxiv icon

Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling

Add code
Jan 30, 2026
Viaarxiv icon

VideoWeave: A Data-Centric Approach for Efficient Video Understanding

Add code
Jan 09, 2026
Viaarxiv icon

Adapting Web Agents with Synthetic Supervision

Add code
Nov 08, 2025
Viaarxiv icon

Dyna-Mind: Learning to Simulate from Experience for Better AI Agents

Add code
Oct 10, 2025
Figure 1 for Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Figure 2 for Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Figure 3 for Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Figure 4 for Dyna-Mind: Learning to Simulate from Experience for Better AI Agents
Viaarxiv icon

FlowRL: Matching Reward Distributions for LLM Reasoning

Add code
Sep 18, 2025
Viaarxiv icon

SAS: Simulated Attention Score

Add code
Jul 10, 2025
Viaarxiv icon