Picture for Tien Mai

Tien Mai

Learning What to Do and What Not To Do: Offline Imitation from Expert and Undesirable Demonstrations

Add code
May 27, 2025
Viaarxiv icon

MisoDICE: Multi-Agent Imitation from Unlabeled Mixed-Quality Demonstrations

Add code
May 24, 2025
Viaarxiv icon

O-MAPL: Offline Multi-agent Preference Learning

Add code
Jan 31, 2025
Figure 1 for O-MAPL: Offline Multi-agent Preference Learning
Figure 2 for O-MAPL: Offline Multi-agent Preference Learning
Figure 3 for O-MAPL: Offline Multi-agent Preference Learning
Figure 4 for O-MAPL: Offline Multi-agent Preference Learning
Viaarxiv icon

UNIQ: Offline Inverse Q-learning for Avoiding Undesirable Demonstrations

Add code
Oct 10, 2024
Viaarxiv icon

ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization

Add code
Oct 02, 2024
Viaarxiv icon

Outer Approximation and Super-modular Cuts for Constrained Assortment Optimization under Mixed-Logit Model

Add code
Jul 26, 2024
Viaarxiv icon

Competitive Facility Location under Random Utilities and Routing Constraints

Add code
Mar 09, 2024
Viaarxiv icon

SubIQ: Inverse Soft-Q Learning for Offline Imitation with Suboptimal Demonstrations

Add code
Feb 20, 2024
Figure 1 for SubIQ: Inverse Soft-Q Learning for Offline Imitation with Suboptimal Demonstrations
Figure 2 for SubIQ: Inverse Soft-Q Learning for Offline Imitation with Suboptimal Demonstrations
Figure 3 for SubIQ: Inverse Soft-Q Learning for Offline Imitation with Suboptimal Demonstrations
Figure 4 for SubIQ: Inverse Soft-Q Learning for Offline Imitation with Suboptimal Demonstrations
Viaarxiv icon

Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning

Add code
Dec 26, 2023
Viaarxiv icon

Inverse Factorized Q-Learning for Cooperative Multi-agent Imitation Learning

Add code
Oct 10, 2023
Viaarxiv icon