Picture for Laixi Shi

Laixi Shi

Taming the Curses of Multiagency in Robust Markov Games with Large State Space through Linear Function Approximation

Add code
May 04, 2026
Viaarxiv icon

Pushing Forward Pareto Frontiers of Proactive Agents with Behavioral Agentic Optimization

Add code
Feb 11, 2026
Viaarxiv icon

Understanding Agent Scaling in LLM-Based Multi-Agent Systems via Diversity

Add code
Feb 03, 2026
Viaarxiv icon

MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning

Add code
May 30, 2025
Figure 1 for MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning
Figure 2 for MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning
Figure 3 for MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning
Figure 4 for MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning
Viaarxiv icon

KL-regularization Itself is Differentially Private in Bandits and RLHF

Add code
May 23, 2025
Viaarxiv icon

Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning

Add code
Feb 27, 2025
Figure 1 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Figure 2 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Figure 3 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Figure 4 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Viaarxiv icon

Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data

Add code
Nov 06, 2024
Figure 1 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Figure 2 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Figure 3 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Figure 4 for Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Viaarxiv icon

Can We Break the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning?

Add code
Sep 30, 2024
Figure 1 for Can We Break the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning?
Viaarxiv icon

BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning

Add code
Jul 15, 2024
Figure 1 for BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Figure 2 for BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Figure 3 for BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Figure 4 for BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Viaarxiv icon

Distributionally Robust Constrained Reinforcement Learning under Strong Duality

Add code
Jun 22, 2024
Figure 1 for Distributionally Robust Constrained Reinforcement Learning under Strong Duality
Figure 2 for Distributionally Robust Constrained Reinforcement Learning under Strong Duality
Figure 3 for Distributionally Robust Constrained Reinforcement Learning under Strong Duality
Viaarxiv icon