Picture for Cho-Jui Hsieh

Cho-Jui Hsieh

UCLA

Do Synthetic Trajectories Reflect Real Reward Hacking? A Systematic Study on Monitoring In-the-Wild Hacking in Code Generation

Add code
Apr 26, 2026
Viaarxiv icon

ClawEnvKit: Automatic Environment Generation for Claw-Like Agents

Add code
Apr 20, 2026
Viaarxiv icon

Cycle-Consistent Search: Question Reconstructability as a Proxy Reward for Search Agent Training

Add code
Apr 14, 2026
Viaarxiv icon

FRESCO: Benchmarking and Optimizing Re-rankers for Evolving Semantic Conflict in Retrieval-Augmented Generation

Add code
Apr 14, 2026
Viaarxiv icon

AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Agents with Cloud Computing Access

Add code
Mar 23, 2026
Viaarxiv icon

Text is All You Need for Vision-Language Model Jailbreaking

Add code
Jan 31, 2026
Viaarxiv icon

LoL: Longer than Longer, Scaling Video Generation to Hour

Add code
Jan 23, 2026
Viaarxiv icon

FlexAct: Why Learn when you can Pick?

Add code
Jan 10, 2026
Viaarxiv icon

Towards Building efficient Routed systems for Retrieval

Add code
Jan 10, 2026
Viaarxiv icon

Understanding Reward Hacking in Text-to-Image Reinforcement Learning

Add code
Jan 06, 2026
Viaarxiv icon