Picture for Fei Cheng

Fei Cheng

BenchTrace: A Benchmark for Testing Reflection Ability and Controlled Evolution in LLM Agents

Add code
May 28, 2026
Viaarxiv icon

Tailoring the Curriculum: Student-Centered Reasoning Distillation via Dynamic Data-Model Compatibility

Add code
May 28, 2026
Viaarxiv icon

Revisiting Anthropomorphic Reflection Markers in Large Language Model Reasoning

Add code
May 27, 2026
Viaarxiv icon

Reasoning Depth and Environment Complexity: A Controlled Study of RLVR Data Allocation across Logical Reasoning Tasks

Add code
May 26, 2026
Viaarxiv icon

When and Why Does Unsupervised RL Succeed in Mathematical Reasoning? A Manifold Envelopment Perspective

Add code
Mar 17, 2026
Viaarxiv icon

Persona Jailbreaking in Large Language Models

Add code
Jan 23, 2026
Viaarxiv icon

Better Generalizing to Unseen Concepts: An Evaluation Framework and An LLM-Based Auto-Labeled Pipeline for Biomedical Concept Recognition

Add code
Jan 23, 2026
Viaarxiv icon

EmplifAI: a Fine-grained Dataset for Japanese Empathetic Medical Dialogues in 28 Emotion Labels

Add code
Jan 15, 2026
Viaarxiv icon

Evaluation Framework for AI Creativity: A Case Study Based on Story Generation

Add code
Jan 07, 2026
Viaarxiv icon

Memorization, Emergence, and Explaining Reversal Failures: A Controlled Study of Relational Semantics in LLMs

Add code
Jan 06, 2026
Viaarxiv icon