Picture for Quanquan Gu

Quanquan Gu

General Preference Modeling with Preference Representations for Aligning Language Models

Add code
Oct 03, 2024
Figure 1 for General Preference Modeling with Preference Representations for Aligning Language Models
Figure 2 for General Preference Modeling with Preference Representations for Aligning Language Models
Figure 3 for General Preference Modeling with Preference Representations for Aligning Language Models
Figure 4 for General Preference Modeling with Preference Representations for Aligning Language Models
Viaarxiv icon

Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis

Add code
Oct 03, 2024
Figure 1 for Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis
Figure 2 for Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis
Viaarxiv icon

LLaVA-Critic: Learning to Evaluate Multimodal Models

Add code
Oct 03, 2024
Viaarxiv icon

ProteinBench: A Holistic Evaluation of Protein Foundation Models

Add code
Sep 10, 2024
Viaarxiv icon

Relative-Translation Invariant Wasserstein Distance

Add code
Sep 04, 2024
Figure 1 for Relative-Translation Invariant Wasserstein Distance
Figure 2 for Relative-Translation Invariant Wasserstein Distance
Figure 3 for Relative-Translation Invariant Wasserstein Distance
Figure 4 for Relative-Translation Invariant Wasserstein Distance
Viaarxiv icon

Decomposed Direct Preference Optimization for Structure-Based Drug Design

Add code
Jul 19, 2024
Viaarxiv icon

Uncertainty-Aware Reward-Free Exploration with General Function Approximation

Add code
Jun 24, 2024
Figure 1 for Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Figure 2 for Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Figure 3 for Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Figure 4 for Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Viaarxiv icon

Self-Play Preference Optimization for Language Model Alignment

Add code
May 01, 2024
Figure 1 for Self-Play Preference Optimization for Language Model Alignment
Figure 2 for Self-Play Preference Optimization for Language Model Alignment
Figure 3 for Self-Play Preference Optimization for Language Model Alignment
Figure 4 for Self-Play Preference Optimization for Language Model Alignment
Viaarxiv icon

Matching the Statistical Query Lower Bound for k-sparse Parity Problems with Stochastic Gradient Descent

Add code
Apr 18, 2024
Viaarxiv icon

Guided Discrete Diffusion for Electronic Health Record Generation

Add code
Apr 18, 2024
Figure 1 for Guided Discrete Diffusion for Electronic Health Record Generation
Figure 2 for Guided Discrete Diffusion for Electronic Health Record Generation
Figure 3 for Guided Discrete Diffusion for Electronic Health Record Generation
Figure 4 for Guided Discrete Diffusion for Electronic Health Record Generation
Viaarxiv icon