Picture for Danqi Chen

Danqi Chen

Shammie

Self-Distillation Zero: Self-Revision Turns Binary Rewards into Dense Supervision

Add code
Apr 13, 2026
Viaarxiv icon

Agentic Aggregation for Parallel Scaling of Long-Horizon Agentic Tasks

Add code
Apr 13, 2026
Viaarxiv icon

Vero: An Open RL Recipe for General Visual Reasoning

Add code
Apr 07, 2026
Viaarxiv icon

DySCO: Dynamic Attention-Scaling Decoding for Long-Context LMs

Add code
Feb 25, 2026
Viaarxiv icon

Lost in the Maze: Overcoming Context Limitations in Long-Horizon Agentic Search

Add code
Oct 21, 2025
Figure 1 for Lost in the Maze: Overcoming Context Limitations in Long-Horizon Agentic Search
Figure 2 for Lost in the Maze: Overcoming Context Limitations in Long-Horizon Agentic Search
Figure 3 for Lost in the Maze: Overcoming Context Limitations in Long-Horizon Agentic Search
Figure 4 for Lost in the Maze: Overcoming Context Limitations in Long-Horizon Agentic Search
Viaarxiv icon

Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting

Add code
Oct 21, 2025
Figure 1 for Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting
Figure 2 for Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting
Figure 3 for Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting
Figure 4 for Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting
Viaarxiv icon

Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking

Add code
Jun 11, 2025
Viaarxiv icon

Precise Information Control in Long-Form Text Generation

Add code
Jun 06, 2025
Viaarxiv icon

The Deployment of End-to-End Audio Language Models Should Take into Account the Principle of Least Privilege

Add code
Mar 21, 2025
Figure 1 for The Deployment of End-to-End Audio Language Models Should Take into Account the Principle of Least Privilege
Figure 2 for The Deployment of End-to-End Audio Language Models Should Take into Account the Principle of Least Privilege
Figure 3 for The Deployment of End-to-End Audio Language Models Should Take into Account the Principle of Least Privilege
Figure 4 for The Deployment of End-to-End Audio Language Models Should Take into Account the Principle of Least Privilege
Viaarxiv icon

Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving

Add code
Feb 11, 2025
Viaarxiv icon