Alert button
Picture for Aman Madaan

Aman Madaan

Alert button

In-Context Principle Learning from Mistakes

Add code
Bookmark button
Alert button
Feb 09, 2024
Tianjun Zhang, Aman Madaan, Luyu Gao, Steven Zheng, Swaroop Mishra, Yiming Yang, Niket Tandon, Uri Alon

Viaarxiv icon

Self-Imagine: Effective Unimodal Reasoning with Multimodal Models using Self-Imagination

Add code
Bookmark button
Alert button
Jan 16, 2024
Syeda Nahida Akter, Aman Madaan, Sangwu Lee, Yiming Yang, Eric Nyberg

Viaarxiv icon

Program-Aided Reasoners (better) Know What They Know

Add code
Bookmark button
Alert button
Nov 16, 2023
Anubha Kabra, Sanketh Rangreji, Yash Mathur, Aman Madaan, Emmy Liu, Graham Neubig

Viaarxiv icon

AutoMix: Automatically Mixing Language Models

Add code
Bookmark button
Alert button
Oct 19, 2023
Aman Madaan, Pranjal Aggarwal, Ankit Anand, Srividya Pranavi Potharaju, Swaroop Mishra, Pei Zhou, Aditya Gupta, Dheeraj Rajagopal, Karthik Kappaganthu, Yiming Yang, Shyam Upadhyay, Mausam, Manaal Faruqui

Figure 1 for AutoMix: Automatically Mixing Language Models
Figure 2 for AutoMix: Automatically Mixing Language Models
Figure 3 for AutoMix: Automatically Mixing Language Models
Figure 4 for AutoMix: Automatically Mixing Language Models
Viaarxiv icon

How FaR Are Large Language Models From Agents with Theory-of-Mind?

Add code
Bookmark button
Alert button
Oct 04, 2023
Pei Zhou, Aman Madaan, Srividya Pranavi Potharaju, Aditya Gupta, Kevin R. McKee, Ari Holtzman, Jay Pujara, Xiang Ren, Swaroop Mishra, Aida Nematzadeh, Shyam Upadhyay, Manaal Faruqui

Figure 1 for How FaR Are Large Language Models From Agents with Theory-of-Mind?
Figure 2 for How FaR Are Large Language Models From Agents with Theory-of-Mind?
Figure 3 for How FaR Are Large Language Models From Agents with Theory-of-Mind?
Figure 4 for How FaR Are Large Language Models From Agents with Theory-of-Mind?
Viaarxiv icon

Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs

Add code
Bookmark button
Alert button
May 19, 2023
Pranjal Aggarwal, Aman Madaan, Yiming Yang, Mausam

Figure 1 for Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs
Figure 2 for Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs
Figure 3 for Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs
Figure 4 for Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs
Viaarxiv icon

RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs

Add code
Bookmark button
Alert button
May 15, 2023
Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon

Figure 1 for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs
Figure 2 for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs
Figure 3 for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs
Figure 4 for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs
Viaarxiv icon

Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation

Add code
Bookmark button
Alert button
May 01, 2023
Patrick Fernandes, Aman Madaan, Emmy Liu, António Farinhas, Pedro Henrique Martins, Amanda Bertsch, José G. C. de Souza, Shuyan Zhou, Tongshuang Wu, Graham Neubig, André F. T. Martins

Figure 1 for Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
Figure 2 for Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
Figure 3 for Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
Viaarxiv icon

Self-Refine: Iterative Refinement with Self-Feedback

Add code
Bookmark button
Alert button
Mar 30, 2023
Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, Sean Welleck, Bodhisattwa Prasad Majumder, Shashank Gupta, Amir Yazdanbakhsh, Peter Clark

Figure 1 for Self-Refine: Iterative Refinement with Self-Feedback
Figure 2 for Self-Refine: Iterative Refinement with Self-Feedback
Figure 3 for Self-Refine: Iterative Refinement with Self-Feedback
Figure 4 for Self-Refine: Iterative Refinement with Self-Feedback
Viaarxiv icon