Picture for Quanquan Gu

Quanquan Gu

Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path

Add code
Feb 14, 2024
Figure 1 for Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Figure 2 for Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Figure 3 for Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Viaarxiv icon

Mitigating Object Hallucination in Large Vision-Language Models via Classifier-Free Guidance

Add code
Feb 13, 2024
Viaarxiv icon

TrustLLM: Trustworthiness in Large Language Models

Add code
Jan 25, 2024
Figure 1 for TrustLLM: Trustworthiness in Large Language Models
Figure 2 for TrustLLM: Trustworthiness in Large Language Models
Figure 3 for TrustLLM: Trustworthiness in Large Language Models
Figure 4 for TrustLLM: Trustworthiness in Large Language Models
Viaarxiv icon

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Add code
Jan 02, 2024
Figure 1 for Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Figure 2 for Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Figure 3 for Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Figure 4 for Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Viaarxiv icon

Sparse PCA with Oracle Property

Add code
Dec 28, 2023
Viaarxiv icon

Fast Sampling via De-randomization for Discrete Diffusion Models

Add code
Dec 14, 2023
Viaarxiv icon

A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation

Add code
Nov 26, 2023
Viaarxiv icon

Risk Bounds of Accelerated SGD for Overparameterized Linear Regression

Add code
Nov 23, 2023
Viaarxiv icon

Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

Add code
Nov 07, 2023
Viaarxiv icon

Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data

Add code
Oct 29, 2023
Figure 1 for Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data
Figure 2 for Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data
Figure 3 for Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data
Figure 4 for Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data
Viaarxiv icon