Alert button
Picture for Zhiqing Sun

Zhiqing Sun

Alert button

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Add code
Bookmark button
Alert button
Apr 02, 2024
Ruohong Zhang, Liangke Gui, Zhiqing Sun, Yihao Feng, Keyang Xu, Yuanhan Zhang, Di Fu, Chunyuan Li, Alexander Hauptmann, Yonatan Bisk, Yiming Yang

Viaarxiv icon

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Add code
Bookmark button
Alert button
Mar 14, 2024
Zhiqing Sun, Longhui Yu, Yikang Shen, Weiyang Liu, Yiming Yang, Sean Welleck, Chuang Gan

Figure 1 for Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Figure 2 for Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Figure 3 for Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Figure 4 for Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Viaarxiv icon

HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild

Add code
Bookmark button
Alert button
Mar 07, 2024
Zhiying Zhu, Zhiqing Sun, Yiming Yang

Figure 1 for HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild
Figure 2 for HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild
Figure 3 for HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild
Figure 4 for HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild
Viaarxiv icon

Instruction-tuned Language Models are Better Knowledge Learners

Add code
Bookmark button
Alert button
Feb 20, 2024
Zhengbao Jiang, Zhiqing Sun, Weijia Shi, Pedro Rodriguez, Chunting Zhou, Graham Neubig, Xi Victoria Lin, Wen-tau Yih, Srinivasan Iyer

Viaarxiv icon

Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble

Add code
Bookmark button
Alert button
Jan 30, 2024
Shun Zhang, Zhenfang Chen, Sunli Chen, Yikang Shen, Zhiqing Sun, Chuang Gan

Viaarxiv icon

SALMON: Self-Alignment with Principle-Following Reward Models

Add code
Bookmark button
Alert button
Oct 09, 2023
Zhiqing Sun, Yikang Shen, Hongxin Zhang, Qinhong Zhou, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan

Viaarxiv icon

Aligning Large Multimodal Models with Factually Augmented RLHF

Add code
Bookmark button
Alert button
Sep 25, 2023
Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liang-Yan Gui, Yu-Xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell

Figure 1 for Aligning Large Multimodal Models with Factually Augmented RLHF
Figure 2 for Aligning Large Multimodal Models with Factually Augmented RLHF
Figure 3 for Aligning Large Multimodal Models with Factually Augmented RLHF
Figure 4 for Aligning Large Multimodal Models with Factually Augmented RLHF
Viaarxiv icon

Accelerating Diffusion-based Combinatorial Optimization Solvers by Progressive Distillation

Add code
Bookmark button
Alert button
Aug 22, 2023
Junwei Huang, Zhiqing Sun, Yiming Yang

Figure 1 for Accelerating Diffusion-based Combinatorial Optimization Solvers by Progressive Distillation
Viaarxiv icon

Active Retrieval Augmented Generation

Add code
Bookmark button
Alert button
May 11, 2023
Zhengbao Jiang, Frank F. Xu, Luyu Gao, Zhiqing Sun, Qian Liu, Jane Dwivedi-Yu, Yiming Yang, Jamie Callan, Graham Neubig

Figure 1 for Active Retrieval Augmented Generation
Figure 2 for Active Retrieval Augmented Generation
Figure 3 for Active Retrieval Augmented Generation
Figure 4 for Active Retrieval Augmented Generation
Viaarxiv icon