Alert button
Picture for Samrat Phatale

Samrat Phatale

Alert button

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Mar 15, 2024
Hakim Sidahmed, Samrat Phatale, Alex Hutcheson, Zhuonan Lin, Zhang Chen, Zac Yu, Jarvis Jin, Roman Komarytsia, Christiane Ahlheim, Yonghao Zhu, Simral Chaudhary, Bowen Li, Saravanan Ganesh, Bill Byrne, Jessica Hoffmann, Hassan Mansoor, Wei Li, Abhinav Rastogi, Lucas Dixon

Viaarxiv icon

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Sep 01, 2023
Harrison Lee, Samrat Phatale, Hassan Mansoor, Kellie Lu, Thomas Mesnard, Colton Bishop, Victor Carbune, Abhinav Rastogi

Figure 1 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 2 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 3 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 4 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Viaarxiv icon

Conversational Recommendation as Retrieval: A Simple, Strong Baseline

May 23, 2023
Raghav Gupta, Renat Aksitov, Samrat Phatale, Simral Chaudhary, Harrison Lee, Abhinav Rastogi

Figure 1 for Conversational Recommendation as Retrieval: A Simple, Strong Baseline
Figure 2 for Conversational Recommendation as Retrieval: A Simple, Strong Baseline
Figure 3 for Conversational Recommendation as Retrieval: A Simple, Strong Baseline
Figure 4 for Conversational Recommendation as Retrieval: A Simple, Strong Baseline
Viaarxiv icon

Prose for a Painting

Oct 08, 2019
Prerna Kashyap, Samrat Phatale, Iddo Drori

Figure 1 for Prose for a Painting
Figure 2 for Prose for a Painting
Figure 3 for Prose for a Painting
Figure 4 for Prose for a Painting
Viaarxiv icon