Picture for Quanquan Gu

Quanquan Gu

Causal Graph ODE: Continuous Treatment Effect Modeling in Multi-agent Dynamical Systems

Add code
Feb 29, 2024
Viaarxiv icon

Diffusion Language Models Are Versatile Protein Learners

Add code
Feb 28, 2024
Figure 1 for Diffusion Language Models Are Versatile Protein Learners
Figure 2 for Diffusion Language Models Are Versatile Protein Learners
Figure 3 for Diffusion Language Models Are Versatile Protein Learners
Figure 4 for Diffusion Language Models Are Versatile Protein Learners
Viaarxiv icon

Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption

Add code
Feb 15, 2024
Viaarxiv icon

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Add code
Feb 15, 2024
Figure 1 for Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Figure 2 for Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Figure 3 for Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Figure 4 for Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Viaarxiv icon

Reinforcement Learning from Human Feedback with Active Queries

Add code
Feb 14, 2024
Figure 1 for Reinforcement Learning from Human Feedback with Active Queries
Figure 2 for Reinforcement Learning from Human Feedback with Active Queries
Figure 3 for Reinforcement Learning from Human Feedback with Active Queries
Figure 4 for Reinforcement Learning from Human Feedback with Active Queries
Viaarxiv icon

Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path

Add code
Feb 14, 2024
Figure 1 for Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Figure 2 for Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Figure 3 for Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Viaarxiv icon

Mitigating Object Hallucination in Large Vision-Language Models via Classifier-Free Guidance

Add code
Feb 13, 2024
Viaarxiv icon

TrustLLM: Trustworthiness in Large Language Models

Add code
Jan 25, 2024
Figure 1 for TrustLLM: Trustworthiness in Large Language Models
Figure 2 for TrustLLM: Trustworthiness in Large Language Models
Figure 3 for TrustLLM: Trustworthiness in Large Language Models
Figure 4 for TrustLLM: Trustworthiness in Large Language Models
Viaarxiv icon

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Add code
Jan 02, 2024
Figure 1 for Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Figure 2 for Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Figure 3 for Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Figure 4 for Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Viaarxiv icon

Sparse PCA with Oracle Property

Add code
Dec 28, 2023
Viaarxiv icon