Alert button
Picture for Karthik Narasimhan

Karthik Narasimhan

Alert button

Can Language Models Solve Olympiad Programming?

Add code
Bookmark button
Alert button
Apr 16, 2024
Quan Shi, Michael Tang, Karthik Narasimhan, Shunyu Yao

Viaarxiv icon

RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

Add code
Bookmark button
Alert button
Apr 16, 2024
Shreyas Chaudhari, Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande, Bruno Castro da Silva

Viaarxiv icon

Language-Guided World Models: A Model-Based Approach to AI Control

Add code
Bookmark button
Alert button
Jan 24, 2024
Alex Zhang, Khanh Nguyen, Jens Tuyls, Albert Lin, Karthik Narasimhan

Viaarxiv icon

QualEval: Qualitative Evaluation for Model Improvement

Add code
Bookmark button
Alert button
Nov 06, 2023
Vishvak Murahari, Ameet Deshpande, Peter Clark, Tanmay Rajpurohit, Ashish Sabharwal, Karthik Narasimhan, Ashwin Kalyan

Viaarxiv icon

Progressively Efficient Learning

Add code
Bookmark button
Alert button
Oct 13, 2023
Ruijie Zheng, Khanh Nguyen, Hal Daumé III, Furong Huang, Karthik Narasimhan

Figure 1 for Progressively Efficient Learning
Figure 2 for Progressively Efficient Learning
Figure 3 for Progressively Efficient Learning
Figure 4 for Progressively Efficient Learning
Viaarxiv icon

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

Add code
Bookmark button
Alert button
Oct 10, 2023
Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, Karthik Narasimhan

Viaarxiv icon

FireAct: Toward Language Agent Fine-tuning

Add code
Bookmark button
Alert button
Oct 09, 2023
Baian Chen, Chang Shu, Ehsan Shareghi, Nigel Collier, Karthik Narasimhan, Shunyu Yao

Viaarxiv icon

Cognitive Architectures for Language Agents

Add code
Bookmark button
Alert button
Sep 05, 2023
Theodore Sumers, Shunyu Yao, Karthik Narasimhan, Thomas L. Griffiths

Viaarxiv icon

Scaling Laws for Imitation Learning in NetHack

Add code
Bookmark button
Alert button
Jul 18, 2023
Jens Tuyls, Dhruv Madeka, Kari Torkkola, Dean Foster, Karthik Narasimhan, Sham Kakade

Figure 1 for Scaling Laws for Imitation Learning in NetHack
Figure 2 for Scaling Laws for Imitation Learning in NetHack
Figure 3 for Scaling Laws for Imitation Learning in NetHack
Figure 4 for Scaling Laws for Imitation Learning in NetHack
Viaarxiv icon