Picture for Longbo Huang

Longbo Huang

Finite-Time Convergence Analysis of ODE-based Generative Models for Stochastic Interpolants

Add code
Aug 10, 2025
Viaarxiv icon

Reparameterization Proximal Policy Optimization

Add code
Aug 08, 2025
Viaarxiv icon

OM2P: Offline Multi-Agent Mean-Flow Policy

Add code
Aug 08, 2025
Viaarxiv icon

Proxy-Free GFlowNet

Add code
May 26, 2025
Viaarxiv icon

Continuous K-Max Bandits

Add code
Feb 19, 2025
Viaarxiv icon

Few is More: Task-Efficient Skill-Discovery for Multi-Task Offline Multi-Agent Reinforcement Learning

Add code
Feb 13, 2025
Viaarxiv icon

Finite-Time Analysis of Discrete-Time Stochastic Interpolants

Add code
Feb 13, 2025
Viaarxiv icon

Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration

Add code
Oct 25, 2024
Figure 1 for Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration
Figure 2 for Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration
Figure 3 for Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration
Figure 4 for Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration
Viaarxiv icon

uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs

Add code
Oct 04, 2024
Viaarxiv icon

Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks

Add code
Oct 03, 2024
Figure 1 for Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks
Figure 2 for Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks
Figure 3 for Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks
Figure 4 for Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks
Viaarxiv icon