Picture for Tianyi Zhou

Tianyi Zhou

How Language Models Process Negation

Add code
May 04, 2026
Viaarxiv icon

Do Synthetic Trajectories Reflect Real Reward Hacking? A Systematic Study on Monitoring In-the-Wild Hacking in Code Generation

Add code
Apr 26, 2026
Viaarxiv icon

Superminds Test: Actively Evaluating Collective Intelligence of Agent Society via Probing Agents

Add code
Apr 24, 2026
Viaarxiv icon

Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks

Add code
Apr 22, 2026
Viaarxiv icon

Convergent Evolution: How Different Language Models Learn Similar Number Representations

Add code
Apr 22, 2026
Viaarxiv icon

ClawEnvKit: Automatic Environment Generation for Claw-Like Agents

Add code
Apr 20, 2026
Viaarxiv icon

Benchmarks for Trajectory Safety Evaluation and Diagnosis in OpenClaw and Codex: ATBench-Claw and ATBench-CodeX

Add code
Apr 16, 2026
Viaarxiv icon

One Model for All: Multi-Objective Controllable Language Models

Add code
Apr 06, 2026
Viaarxiv icon

When AI Navigates the Fog of War

Add code
Mar 17, 2026
Viaarxiv icon

WestWorld: A Knowledge-Encoded Scalable Trajectory World Model for Diverse Robotic Systems

Add code
Mar 15, 2026
Viaarxiv icon