Picture for Milind Tambe

Milind Tambe

Efficient Ensemble Selection from Binary and Pairwise Feedback

Add code
May 10, 2026
Viaarxiv icon

Decisions and Deployment: The Five-Year SAHELI Project (2020-2025) on Restless Multi-Armed Bandits for Improving Maternal and Child Health

Add code
Apr 08, 2026
Viaarxiv icon

Many Preferences, Few Policies: Towards Scalable Language Model Personalization

Add code
Apr 05, 2026
Viaarxiv icon

Incentive-Aware AI Safety via Strategic Resource Allocation: A Stackelberg Security Games Perspective

Add code
Feb 06, 2026
Viaarxiv icon

LLM Active Alignment: A Nash Equilibrium Perspective

Add code
Feb 06, 2026
Viaarxiv icon

Reward Shaping for Inference-Time Alignment: A Stackelberg Game Perspective

Add code
Jan 31, 2026
Viaarxiv icon

Latent Spherical Flow Policy for Reinforcement Learning with Combinatorial Actions

Add code
Jan 29, 2026
Viaarxiv icon

Policy-Embedded Graph Expansion: Networked HIV Testing with Diffusion-Driven Network Samples

Add code
Jan 20, 2026
Viaarxiv icon

Health Facility Location in Ethiopia: Leveraging LLMs to Integrate Expert Knowledge into Algorithmic Planning

Add code
Jan 16, 2026
Viaarxiv icon

Beyond Majority Voting: LLM Aggregation by Leveraging Higher-Order Information

Add code
Oct 01, 2025
Viaarxiv icon