Alert button
Picture for Situo Zhang

Situo Zhang

Alert button

Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback

Add code
Bookmark button
Alert button
Apr 07, 2024
Hongshen Xu, Zichen Zhu, Situo Zhang, Da Ma, Shuai Fan, Lu Chen, Kai Yu

Viaarxiv icon

Multi: Multimodal Understanding Leaderboard with Text and Images

Add code
Bookmark button
Alert button
Feb 05, 2024
Zichen Zhu, Yang Xu, Lu Chen, Jingkai Yang, Yichuan Ma, Yiming Sun, Hailin Wen, Jiaqi Liu, Jinyu Cai, Yingzi Ma, Situo Zhang, Zihan Zhao, Liangtai Sun, Kai Yu

Viaarxiv icon

Large Language Model Is Semi-Parametric Reinforcement Learning Agent

Add code
Bookmark button
Alert button
Jun 09, 2023
Danyang Zhang, Lu Chen, Situo Zhang, Hongshen Xu, Zihan Zhao, Kai Yu

Figure 1 for Large Language Model Is Semi-Parametric Reinforcement Learning Agent
Figure 2 for Large Language Model Is Semi-Parametric Reinforcement Learning Agent
Figure 3 for Large Language Model Is Semi-Parametric Reinforcement Learning Agent
Figure 4 for Large Language Model Is Semi-Parametric Reinforcement Learning Agent
Viaarxiv icon