Picture for Uma Kona

Uma Kona

Ada-RS: Adaptive Rejection Sampling for Selective Thinking

Add code
Feb 23, 2026
Viaarxiv icon

Toward Scalable Verifiable Reward: Proxy State-Based Evaluation for Multi-turn Tool-Calling LLM Agents

Add code
Feb 18, 2026
Viaarxiv icon

NEMO-4-PAYPAL: Leveraging NVIDIA's Nemo Framework for empowering PayPal's Commerce Agent

Add code
Dec 25, 2025
Viaarxiv icon