Picture for Xuandong Zhao

Xuandong Zhao

Making Bias Non-Predictive: Training Robust LLM Judges via Reinforcement Learning

Add code
Feb 02, 2026
Viaarxiv icon

Clipping-Free Policy Optimization for Large Language Models

Add code
Jan 30, 2026
Viaarxiv icon

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Add code
Jan 17, 2026
Viaarxiv icon

InfoSynth: Information-Guided Benchmark Synthesis for LLMs

Add code
Jan 02, 2026
Viaarxiv icon

Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models

Add code
Jul 10, 2025
Viaarxiv icon

AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents

Add code
Jun 17, 2025
Viaarxiv icon

OVERT: A Benchmark for Over-Refusal Evaluation on Text-to-Image Models

Add code
May 28, 2025
Viaarxiv icon

Learning to Reason without External Rewards

Add code
May 26, 2025
Figure 1 for Learning to Reason without External Rewards
Figure 2 for Learning to Reason without External Rewards
Figure 3 for Learning to Reason without External Rewards
Figure 4 for Learning to Reason without External Rewards
Viaarxiv icon

Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services

Add code
May 24, 2025
Figure 1 for Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services
Figure 2 for Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services
Figure 3 for Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services
Figure 4 for Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services
Viaarxiv icon

In-Context Watermarks for Large Language Models

Add code
May 22, 2025
Figure 1 for In-Context Watermarks for Large Language Models
Figure 2 for In-Context Watermarks for Large Language Models
Figure 3 for In-Context Watermarks for Large Language Models
Figure 4 for In-Context Watermarks for Large Language Models
Viaarxiv icon