Picture for Quanquan Gu

Quanquan Gu

Decomposed Direct Preference Optimization for Structure-Based Drug Design

Add code
Jul 19, 2024
Viaarxiv icon

Uncertainty-Aware Reward-Free Exploration with General Function Approximation

Add code
Jun 24, 2024
Viaarxiv icon

Self-Play Preference Optimization for Language Model Alignment

Add code
May 01, 2024
Figure 1 for Self-Play Preference Optimization for Language Model Alignment
Figure 2 for Self-Play Preference Optimization for Language Model Alignment
Figure 3 for Self-Play Preference Optimization for Language Model Alignment
Figure 4 for Self-Play Preference Optimization for Language Model Alignment
Viaarxiv icon

Matching the Statistical Query Lower Bound for k-sparse Parity Problems with Stochastic Gradient Descent

Add code
Apr 18, 2024
Viaarxiv icon

Guided Discrete Diffusion for Electronic Health Record Generation

Add code
Apr 18, 2024
Viaarxiv icon

Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback

Add code
Apr 16, 2024
Viaarxiv icon

Settling Constant Regrets in Linear Markov Decision Processes

Add code
Apr 16, 2024
Figure 1 for Settling Constant Regrets in Linear Markov Decision Processes
Figure 2 for Settling Constant Regrets in Linear Markov Decision Processes
Viaarxiv icon

Feel-Good Thompson Sampling for Contextual Dueling Bandits

Add code
Apr 09, 2024
Viaarxiv icon

Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization

Add code
Mar 25, 2024
Viaarxiv icon

Protein Conformation Generation via Force-Guided SE(3) Diffusion Models

Add code
Mar 21, 2024
Figure 1 for Protein Conformation Generation via Force-Guided SE(3) Diffusion Models
Figure 2 for Protein Conformation Generation via Force-Guided SE(3) Diffusion Models
Figure 3 for Protein Conformation Generation via Force-Guided SE(3) Diffusion Models
Figure 4 for Protein Conformation Generation via Force-Guided SE(3) Diffusion Models
Viaarxiv icon