Picture for Quanquan Gu

Quanquan Gu

General Preference Modeling with Preference Representations for Aligning Language Models

Add code
Oct 03, 2024
Figure 1 for General Preference Modeling with Preference Representations for Aligning Language Models
Figure 2 for General Preference Modeling with Preference Representations for Aligning Language Models
Figure 3 for General Preference Modeling with Preference Representations for Aligning Language Models
Figure 4 for General Preference Modeling with Preference Representations for Aligning Language Models
Viaarxiv icon

ProteinBench: A Holistic Evaluation of Protein Foundation Models

Add code
Sep 10, 2024
Viaarxiv icon

Relative-Translation Invariant Wasserstein Distance

Add code
Sep 04, 2024
Figure 1 for Relative-Translation Invariant Wasserstein Distance
Figure 2 for Relative-Translation Invariant Wasserstein Distance
Figure 3 for Relative-Translation Invariant Wasserstein Distance
Figure 4 for Relative-Translation Invariant Wasserstein Distance
Viaarxiv icon

Decomposed Direct Preference Optimization for Structure-Based Drug Design

Add code
Jul 19, 2024
Viaarxiv icon

Uncertainty-Aware Reward-Free Exploration with General Function Approximation

Add code
Jun 24, 2024
Figure 1 for Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Figure 2 for Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Figure 3 for Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Figure 4 for Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Viaarxiv icon

Self-Play Preference Optimization for Language Model Alignment

Add code
May 01, 2024
Figure 1 for Self-Play Preference Optimization for Language Model Alignment
Figure 2 for Self-Play Preference Optimization for Language Model Alignment
Figure 3 for Self-Play Preference Optimization for Language Model Alignment
Figure 4 for Self-Play Preference Optimization for Language Model Alignment
Viaarxiv icon

Guided Discrete Diffusion for Electronic Health Record Generation

Add code
Apr 18, 2024
Figure 1 for Guided Discrete Diffusion for Electronic Health Record Generation
Figure 2 for Guided Discrete Diffusion for Electronic Health Record Generation
Figure 3 for Guided Discrete Diffusion for Electronic Health Record Generation
Figure 4 for Guided Discrete Diffusion for Electronic Health Record Generation
Viaarxiv icon

Matching the Statistical Query Lower Bound for k-sparse Parity Problems with Stochastic Gradient Descent

Add code
Apr 18, 2024
Figure 1 for Matching the Statistical Query Lower Bound for k-sparse Parity Problems with Stochastic Gradient Descent
Figure 2 for Matching the Statistical Query Lower Bound for k-sparse Parity Problems with Stochastic Gradient Descent
Figure 3 for Matching the Statistical Query Lower Bound for k-sparse Parity Problems with Stochastic Gradient Descent
Figure 4 for Matching the Statistical Query Lower Bound for k-sparse Parity Problems with Stochastic Gradient Descent
Viaarxiv icon

Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback

Add code
Apr 16, 2024
Figure 1 for Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
Figure 2 for Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
Figure 3 for Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
Viaarxiv icon

Settling Constant Regrets in Linear Markov Decision Processes

Add code
Apr 16, 2024
Figure 1 for Settling Constant Regrets in Linear Markov Decision Processes
Figure 2 for Settling Constant Regrets in Linear Markov Decision Processes
Figure 3 for Settling Constant Regrets in Linear Markov Decision Processes
Figure 4 for Settling Constant Regrets in Linear Markov Decision Processes
Viaarxiv icon