Picture for Situo Zhang

Situo Zhang

Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback

Add code
Apr 07, 2024
Viaarxiv icon

Multi: Multimodal Understanding Leaderboard with Text and Images

Add code
Feb 05, 2024
Viaarxiv icon

Large Language Model Is Semi-Parametric Reinforcement Learning Agent

Add code
Jun 09, 2023
Viaarxiv icon