Picture for Jiangjie Chen

Jiangjie Chen

Curse of Knowledge: When Complex Evaluation Context Benefits yet Biases LLM Judges

Add code
Sep 03, 2025
Viaarxiv icon

ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models

Add code
Aug 26, 2025
Viaarxiv icon

MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent

Add code
Jul 03, 2025
Viaarxiv icon

Can LLMs Learn to Map the World from Local Descriptions?

Add code
May 27, 2025
Viaarxiv icon

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Add code
May 26, 2025
Viaarxiv icon

KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation

Add code
May 21, 2025
Viaarxiv icon

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Add code
Mar 18, 2025
Viaarxiv icon

PowerAttention: Exponentially Scaling of Receptive Fields for Effective Sparse Attention

Add code
Mar 05, 2025
Viaarxiv icon

CoSER: Coordinating LLM-Based Persona Simulation of Established Roles

Add code
Feb 13, 2025
Viaarxiv icon

Revealing the Barriers of Language Agents in Planning

Add code
Oct 16, 2024
Figure 1 for Revealing the Barriers of Language Agents in Planning
Figure 2 for Revealing the Barriers of Language Agents in Planning
Figure 3 for Revealing the Barriers of Language Agents in Planning
Figure 4 for Revealing the Barriers of Language Agents in Planning
Viaarxiv icon