Picture for Qiyang Li

Qiyang Li

LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design

Add code
May 28, 2024
Viaarxiv icon

Learning Visuotactile Skills with Two Multifingered Hands

Add code
Apr 25, 2024
Viaarxiv icon

REFACTOR: Learning to Extract Theorems from Proofs

Add code
Feb 26, 2024
Figure 1 for REFACTOR: Learning to Extract Theorems from Proofs
Figure 2 for REFACTOR: Learning to Extract Theorems from Proofs
Figure 3 for REFACTOR: Learning to Extract Theorems from Proofs
Figure 4 for REFACTOR: Learning to Extract Theorems from Proofs
Viaarxiv icon

Accelerating Exploration with Unlabeled Prior Data

Add code
Nov 21, 2023
Figure 1 for Accelerating Exploration with Unlabeled Prior Data
Figure 2 for Accelerating Exploration with Unlabeled Prior Data
Figure 3 for Accelerating Exploration with Unlabeled Prior Data
Figure 4 for Accelerating Exploration with Unlabeled Prior Data
Viaarxiv icon

Efficient Deep Reinforcement Learning Requires Regulating Overfitting

Add code
Apr 20, 2023
Figure 1 for Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Figure 2 for Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Figure 3 for Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Figure 4 for Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Viaarxiv icon

Understanding the Complexity Gains of Single-Task RL with a Curriculum

Add code
Dec 24, 2022
Figure 1 for Understanding the Complexity Gains of Single-Task RL with a Curriculum
Figure 2 for Understanding the Complexity Gains of Single-Task RL with a Curriculum
Figure 3 for Understanding the Complexity Gains of Single-Task RL with a Curriculum
Figure 4 for Understanding the Complexity Gains of Single-Task RL with a Curriculum
Viaarxiv icon

AdaCat: Adaptive Categorical Discretization for Autoregressive Models

Add code
Aug 03, 2022
Figure 1 for AdaCat: Adaptive Categorical Discretization for Autoregressive Models
Figure 2 for AdaCat: Adaptive Categorical Discretization for Autoregressive Models
Figure 3 for AdaCat: Adaptive Categorical Discretization for Autoregressive Models
Figure 4 for AdaCat: Adaptive Categorical Discretization for Autoregressive Models
Viaarxiv icon

Reinforcement Learning as One Big Sequence Modeling Problem

Add code
Jun 03, 2021
Figure 1 for Reinforcement Learning as One Big Sequence Modeling Problem
Figure 2 for Reinforcement Learning as One Big Sequence Modeling Problem
Figure 3 for Reinforcement Learning as One Big Sequence Modeling Problem
Figure 4 for Reinforcement Learning as One Big Sequence Modeling Problem
Viaarxiv icon

Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks

Add code
Nov 09, 2019
Figure 1 for Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks
Figure 2 for Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks
Figure 3 for Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks
Figure 4 for Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks
Viaarxiv icon

TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer

Add code
Nov 22, 2018
Figure 1 for TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer
Figure 2 for TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer
Figure 3 for TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer
Figure 4 for TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer
Viaarxiv icon