Picture for Longbo Huang

Longbo Huang

RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning

Add code
Mar 07, 2024
Figure 1 for RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning
Figure 2 for RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning
Figure 3 for RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning
Figure 4 for RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning
Viaarxiv icon

Provably Efficient Partially Observable Risk-Sensitive Reinforcement Learning with Hindsight Observation

Add code
Feb 28, 2024
Viaarxiv icon

Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation

Add code
Feb 28, 2024
Viaarxiv icon

LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

Add code
Nov 09, 2023
Figure 1 for LCM-LoRA: A Universal Stable-Diffusion Acceleration Module
Figure 2 for LCM-LoRA: A Universal Stable-Diffusion Acceleration Module
Figure 3 for LCM-LoRA: A Universal Stable-Diffusion Acceleration Module
Figure 4 for LCM-LoRA: A Universal Stable-Diffusion Acceleration Module
Viaarxiv icon

One is More: Diverse Perspectives within a Single Network for Efficient DRL

Add code
Oct 29, 2023
Viaarxiv icon

A Quadratic Synchronization Rule for Distributed Deep Learning

Add code
Oct 22, 2023
Figure 1 for A Quadratic Synchronization Rule for Distributed Deep Learning
Figure 2 for A Quadratic Synchronization Rule for Distributed Deep Learning
Figure 3 for A Quadratic Synchronization Rule for Distributed Deep Learning
Figure 4 for A Quadratic Synchronization Rule for Distributed Deep Learning
Viaarxiv icon

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Add code
Oct 06, 2023
Figure 1 for Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Figure 2 for Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Figure 3 for Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Figure 4 for Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Viaarxiv icon

Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation

Add code
Jul 06, 2023
Figure 1 for Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation
Figure 2 for Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation
Figure 3 for Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation
Figure 4 for Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation
Viaarxiv icon

Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning

Add code
Jul 04, 2023
Figure 1 for Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning
Figure 2 for Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning
Figure 3 for Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning
Figure 4 for Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning
Viaarxiv icon

Why does Local SGD Generalize Better than SGD?

Add code
Mar 09, 2023
Figure 1 for Why  does Local SGD Generalize Better than SGD?
Figure 2 for Why  does Local SGD Generalize Better than SGD?
Figure 3 for Why  does Local SGD Generalize Better than SGD?
Figure 4 for Why  does Local SGD Generalize Better than SGD?
Viaarxiv icon