Picture for Junxian He

Junxian He

SWE-RM: Execution-free Feedback For Software Engineering Agents

Add code
Dec 26, 2025
Figure 1 for SWE-RM: Execution-free Feedback For Software Engineering Agents
Figure 2 for SWE-RM: Execution-free Feedback For Software Engineering Agents
Figure 3 for SWE-RM: Execution-free Feedback For Software Engineering Agents
Figure 4 for SWE-RM: Execution-free Feedback For Software Engineering Agents
Viaarxiv icon

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Add code
Oct 29, 2025
Viaarxiv icon

Model-Task Alignment Drives Distinct RL Outcomes

Add code
Aug 28, 2025
Viaarxiv icon

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Add code
Jun 16, 2025
Figure 1 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 2 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 3 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Figure 4 for MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Viaarxiv icon

Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning

Add code
May 28, 2025
Viaarxiv icon

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Add code
May 26, 2025
Viaarxiv icon

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Add code
May 21, 2025
Figure 1 for Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
Figure 2 for Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
Figure 3 for Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
Figure 4 for Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
Viaarxiv icon

Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging

Add code
May 08, 2025
Viaarxiv icon

Breaking the Data Barrier -- Building GUI Agents Through Task Generalization

Add code
Apr 15, 2025
Viaarxiv icon

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Add code
Apr 11, 2025
Figure 1 for Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
Figure 2 for Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
Figure 3 for Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
Figure 4 for Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
Viaarxiv icon