Picture for Zhiqing Sun

Zhiqing Sun

HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild

Add code
Mar 07, 2024
Viaarxiv icon

Instruction-tuned Language Models are Better Knowledge Learners

Add code
Feb 20, 2024
Figure 1 for Instruction-tuned Language Models are Better Knowledge Learners
Figure 2 for Instruction-tuned Language Models are Better Knowledge Learners
Figure 3 for Instruction-tuned Language Models are Better Knowledge Learners
Figure 4 for Instruction-tuned Language Models are Better Knowledge Learners
Viaarxiv icon

Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble

Add code
Jan 30, 2024
Figure 1 for Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
Figure 2 for Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
Figure 3 for Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
Figure 4 for Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
Viaarxiv icon

SALMON: Self-Alignment with Principle-Following Reward Models

Add code
Oct 09, 2023
Figure 1 for SALMON: Self-Alignment with Principle-Following Reward Models
Figure 2 for SALMON: Self-Alignment with Principle-Following Reward Models
Figure 3 for SALMON: Self-Alignment with Principle-Following Reward Models
Figure 4 for SALMON: Self-Alignment with Principle-Following Reward Models
Viaarxiv icon

Aligning Large Multimodal Models with Factually Augmented RLHF

Add code
Sep 25, 2023
Viaarxiv icon

Accelerating Diffusion-based Combinatorial Optimization Solvers by Progressive Distillation

Add code
Aug 22, 2023
Viaarxiv icon

Active Retrieval Augmented Generation

Add code
May 11, 2023
Figure 1 for Active Retrieval Augmented Generation
Figure 2 for Active Retrieval Augmented Generation
Figure 3 for Active Retrieval Augmented Generation
Figure 4 for Active Retrieval Augmented Generation
Viaarxiv icon

Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision

Add code
May 04, 2023
Figure 1 for Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Figure 2 for Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Figure 3 for Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Figure 4 for Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Viaarxiv icon

DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization

Add code
Feb 16, 2023
Viaarxiv icon

A Neural PDE Solver with Temporal Stencil Modeling

Add code
Feb 16, 2023
Viaarxiv icon