Picture for Jianlong Chen

Jianlong Chen

GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

Add code
Jun 16, 2026
Viaarxiv icon

RealMath-Eval: Why SOTA Judges Struggle with Real Human Reasoning

Add code
Jun 08, 2026
Viaarxiv icon

MSSR: Memory-Aware Adaptive Replay for Continual LLM Fine-Tuning

Add code
Mar 10, 2026
Viaarxiv icon

Revisiting Sharpness-Aware Minimization: A More Faithful and Effective Implementation

Add code
Mar 09, 2026
Viaarxiv icon

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

Milestones over Outcome: Unlocking Geometric Reasoning with Sub-Goal Verifiable Reward

Add code
Jan 08, 2026
Viaarxiv icon

GeoBench: Rethinking Multimodal Geometric Problem-Solving via Hierarchical Evaluation

Add code
Dec 30, 2025
Viaarxiv icon

Hierarchical Attention Generates Better Proofs

Add code
Apr 27, 2025
Figure 1 for Hierarchical Attention Generates Better Proofs
Figure 2 for Hierarchical Attention Generates Better Proofs
Figure 3 for Hierarchical Attention Generates Better Proofs
Figure 4 for Hierarchical Attention Generates Better Proofs
Viaarxiv icon

Advancing Prompt Recovery in NLP: A Deep Dive into the Integration of Gemma-2b-it and Phi2 Models

Add code
Jul 07, 2024
Figure 1 for Advancing Prompt Recovery in NLP: A Deep Dive into the Integration of Gemma-2b-it and Phi2 Models
Figure 2 for Advancing Prompt Recovery in NLP: A Deep Dive into the Integration of Gemma-2b-it and Phi2 Models
Viaarxiv icon

MD tree: a model-diagnostic tree grown on loss landscape

Add code
Jun 24, 2024
Figure 1 for MD tree: a model-diagnostic tree grown on loss landscape
Figure 2 for MD tree: a model-diagnostic tree grown on loss landscape
Figure 3 for MD tree: a model-diagnostic tree grown on loss landscape
Figure 4 for MD tree: a model-diagnostic tree grown on loss landscape
Viaarxiv icon