Picture for Yiran Jenny Shen

Yiran Jenny Shen

MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization

Add code
May 11, 2026
Viaarxiv icon

SAND: Boosting LLM Agents with Self-Taught Action Deliberation

Add code
Jul 10, 2025
Viaarxiv icon