Picture for Gauri Joshi

Gauri Joshi

PubSwap: Public-Data Off-Policy Coordination for Federated RLVR

Add code
Apr 14, 2026
Viaarxiv icon

Not All Turns Are Equally Hard: Adaptive Thinking Budgets For Efficient Multi-Turn Reasoning

Add code
Apr 06, 2026
Viaarxiv icon

Federate the Router: Learning Language Model Routers with Sparse and Decentralized Evaluations

Add code
Jan 29, 2026
Viaarxiv icon

LOCUS: Low-Dimensional Model Embeddings for Efficient Model Exploration, Comparison, and Selection

Add code
Jan 28, 2026
Viaarxiv icon

Sample Complexity of Average-Reward Q-Learning: From Single-agent to Federated Reinforcement Learning

Add code
Jan 20, 2026
Viaarxiv icon

Ravan: Multi-Head Low-Rank Adaptation for Federated Fine-Tuning

Add code
Jun 05, 2025
Viaarxiv icon

Navigating the Accuracy-Size Trade-Off with Flexible Model Merging

Add code
May 29, 2025
Viaarxiv icon

Natural Policy Gradient for Average Reward Non-Stationary RL

Add code
Apr 23, 2025
Figure 1 for Natural Policy Gradient for Average Reward Non-Stationary RL
Figure 2 for Natural Policy Gradient for Average Reward Non-Stationary RL
Figure 3 for Natural Policy Gradient for Average Reward Non-Stationary RL
Figure 4 for Natural Policy Gradient for Average Reward Non-Stationary RL
Viaarxiv icon

Initialization Matters: Unraveling the Impact of Pre-Training on Federated Learning

Add code
Feb 11, 2025
Viaarxiv icon

The Cost of Shuffling in Private Gradient Based Optimization

Add code
Feb 05, 2025
Figure 1 for The Cost of Shuffling in Private Gradient Based Optimization
Figure 2 for The Cost of Shuffling in Private Gradient Based Optimization
Figure 3 for The Cost of Shuffling in Private Gradient Based Optimization
Figure 4 for The Cost of Shuffling in Private Gradient Based Optimization
Viaarxiv icon