Picture for Juanzi Li

Juanzi Li

HG-Bench: A Benchmark for Multi-Page Handwritten Answer-Region Grounding in Automated Homework Assessment

Add code
Jun 24, 2026
Viaarxiv icon

An LMM for Precisely Grounding Elements in Documents

Add code
Jun 23, 2026
Viaarxiv icon

EnvRL: Learn from Environment Dynamics in Agentic Reinforcement Learning

Add code
Jun 16, 2026
Viaarxiv icon

EurekAgent: Agent Environment Engineering is All You Need For Autonomous Scientific Discovery

Add code
Jun 11, 2026
Viaarxiv icon

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

Add code
Jun 03, 2026
Viaarxiv icon

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Add code
May 29, 2026
Viaarxiv icon

Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders

Add code
May 26, 2026
Viaarxiv icon

StoryAlign: Evaluating and Training Reward Models for Story Generation

Add code
May 06, 2026
Viaarxiv icon

MAIC-UI: Making Interactive Courseware with Generative UI

Add code
Apr 28, 2026
Viaarxiv icon

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Add code
Mar 12, 2026
Viaarxiv icon