Alert button
Picture for Adish Singla

Adish Singla

Alert button

Corruption-Robust Offline Two-Player Zero-Sum Markov Games

Add code
Bookmark button
Alert button
Mar 04, 2024
Andi Nika, Debmalya Mandal, Adish Singla, Goran Radanović

Figure 1 for Corruption-Robust Offline Two-Player Zero-Sum Markov Games
Figure 2 for Corruption-Robust Offline Two-Player Zero-Sum Markov Games
Viaarxiv icon

Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences

Add code
Bookmark button
Alert button
Mar 04, 2024
Andi Nika, Debmalya Mandal, Parameswaran Kamalaruban, Georgios Tzannetos, Goran Radanović, Adish Singla

Figure 1 for Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences
Viaarxiv icon

Informativeness of Reward Functions in Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 10, 2024
Rati Devidze, Parameswaran Kamalaruban, Adish Singla

Viaarxiv icon

Corruption Robust Offline Reinforcement Learning with Human Feedback

Add code
Bookmark button
Alert button
Feb 09, 2024
Debmalya Mandal, Andi Nika, Parameswaran Kamalaruban, Adish Singla, Goran Radanović

Viaarxiv icon

Generative AI for Education (GAIED): Advances, Opportunities, and Challenges

Add code
Bookmark button
Alert button
Feb 07, 2024
Paul Denny, Sumit Gulwani, Neil T. Heffernan, Tanja Käser, Steven Moore, Anna N. Rafferty, Adish Singla

Viaarxiv icon

Active Third-Person Imitation Learning

Add code
Bookmark button
Alert button
Dec 27, 2023
Timo Klein, Susanna Weinberger, Adish Singla, Sebastian Tschiatschek

Viaarxiv icon

Optimally Teaching a Linear Behavior Cloning Agent

Add code
Bookmark button
Alert button
Nov 26, 2023
Shubham Kumar Bharti, Stephen Wright, Adish Singla, Xiaojin Zhu

Viaarxiv icon

Large Language Models for In-Context Student Modeling: Synthesizing Student's Behavior in Visual Programming from One-Shot Observation

Add code
Bookmark button
Alert button
Oct 15, 2023
Manh Hung Nguyen, Sebastian Tschiatschek, Adish Singla

Viaarxiv icon

Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation

Add code
Bookmark button
Alert button
Oct 05, 2023
Tung Phung, Victor-Alexandru Pădurean, Anjali Singh, Christopher Brooks, José Cambronero, Sumit Gulwani, Adish Singla, Gustavo Soares

Figure 1 for Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation
Figure 2 for Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation
Figure 3 for Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation
Figure 4 for Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation
Viaarxiv icon