Alert button
Picture for Hassan Mansoor

Hassan Mansoor

Alert button

Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs

Add code
Bookmark button
Alert button
Mar 19, 2024
Victor Carbune, Hassan Mansoor, Fangyu Liu, Rahul Aralikatte, Gilles Baechler, Jindong Chen, Abhanshu Sharma

Figure 1 for Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs
Figure 2 for Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs
Figure 3 for Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs
Figure 4 for Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs
Viaarxiv icon

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Add code
Bookmark button
Alert button
Mar 15, 2024
Hakim Sidahmed, Samrat Phatale, Alex Hutcheson, Zhuonan Lin, Zhang Chen, Zac Yu, Jarvis Jin, Roman Komarytsia, Christiane Ahlheim, Yonghao Zhu, Simral Chaudhary, Bowen Li, Saravanan Ganesh, Bill Byrne, Jessica Hoffmann, Hassan Mansoor, Wei Li, Abhinav Rastogi, Lucas Dixon

Figure 1 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Figure 2 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Figure 3 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Figure 4 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Viaarxiv icon

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Add code
Bookmark button
Alert button
Feb 19, 2024
Gilles Baechler, Srinivas Sunkara, Maria Wang, Fedir Zubach, Hassan Mansoor, Vincent Etter, Victor Cărbune, Jason Lin, Jindong Chen, Abhanshu Sharma

Viaarxiv icon

LLMs cannot find reasoning errors, but can correct them!

Add code
Bookmark button
Alert button
Nov 14, 2023
Gladys Tyen, Hassan Mansoor, Peter Chen, Tony Mak, Victor Cărbune

Viaarxiv icon

The Impact of Preference Agreement in Reinforcement Learning from Human Feedback: A Case Study in Summarization

Add code
Bookmark button
Alert button
Nov 02, 2023
Sian Gooding, Hassan Mansoor

Viaarxiv icon

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Add code
Bookmark button
Alert button
Sep 01, 2023
Harrison Lee, Samrat Phatale, Hassan Mansoor, Kellie Lu, Thomas Mesnard, Colton Bishop, Victor Carbune, Abhinav Rastogi

Figure 1 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 2 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 3 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 4 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Viaarxiv icon