Picture for Laixi Shi

Laixi Shi

MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning

Add code
May 30, 2025
Viaarxiv icon

KL-regularization Itself is Differentially Private in Bandits and RLHF

Add code
May 23, 2025
Viaarxiv icon

Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning

Add code
Feb 27, 2025
Viaarxiv icon

Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data

Add code
Nov 06, 2024
Figure 1 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Figure 2 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Figure 3 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Figure 4 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Viaarxiv icon

Can We Break the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning?

Add code
Sep 30, 2024
Viaarxiv icon

BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning

Add code
Jul 15, 2024
Figure 1 for BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Figure 2 for BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Figure 3 for BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Figure 4 for BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Viaarxiv icon

Distributionally Robust Constrained Reinforcement Learning under Strong Duality

Add code
Jun 22, 2024
Figure 1 for Distributionally Robust Constrained Reinforcement Learning under Strong Duality
Figure 2 for Distributionally Robust Constrained Reinforcement Learning under Strong Duality
Figure 3 for Distributionally Robust Constrained Reinforcement Learning under Strong Duality
Viaarxiv icon

Tractable Equilibrium Computation in Markov Games through Risk Aversion

Add code
Jun 20, 2024
Viaarxiv icon

Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation

Add code
May 31, 2024
Figure 1 for Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Figure 2 for Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Figure 3 for Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Figure 4 for Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Viaarxiv icon

Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty

Add code
Apr 29, 2024
Figure 1 for Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty
Figure 2 for Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty
Figure 3 for Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty
Viaarxiv icon