Picture for Qinghua Liu

Qinghua Liu

On Limitation of Transformer for Learning HMMs

Add code
Jun 06, 2024
Viaarxiv icon

PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network

Add code
Sep 13, 2023
Figure 1 for PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network
Figure 2 for PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network
Figure 3 for PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network
Figure 4 for PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network
Viaarxiv icon

Is RLHF More Difficult than Standard RL?

Add code
Jun 25, 2023
Figure 1 for Is RLHF More Difficult than Standard RL?
Viaarxiv icon

Context-lumpable stochastic bandits

Add code
Jun 22, 2023
Viaarxiv icon

Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL

Add code
May 18, 2023
Figure 1 for Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL
Viaarxiv icon

Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function Approximation

Add code
Mar 02, 2023
Viaarxiv icon

Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making

Add code
Sep 29, 2022
Figure 1 for Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making
Figure 2 for Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making
Viaarxiv icon

Dive into Big Model Training

Add code
Jul 25, 2022
Figure 1 for Dive into Big Model Training
Figure 2 for Dive into Big Model Training
Figure 3 for Dive into Big Model Training
Figure 4 for Dive into Big Model Training
Viaarxiv icon

A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games

Add code
Jul 18, 2022
Figure 1 for A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Figure 2 for A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Figure 3 for A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Figure 4 for A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Viaarxiv icon

Policy Optimization for Markov Games: Unified Framework and Faster Convergence

Add code
Jun 06, 2022
Figure 1 for Policy Optimization for Markov Games: Unified Framework and Faster Convergence
Figure 2 for Policy Optimization for Markov Games: Unified Framework and Faster Convergence
Viaarxiv icon