Picture for Yangyi Chen

Yangyi Chen

R-Tuning: Teaching Large Language Models to Refuse Unknown Questions

Add code
Nov 16, 2023
Figure 1 for R-Tuning: Teaching Large Language Models to Refuse Unknown Questions
Figure 2 for R-Tuning: Teaching Large Language Models to Refuse Unknown Questions
Figure 3 for R-Tuning: Teaching Large Language Models to Refuse Unknown Questions
Figure 4 for R-Tuning: Teaching Large Language Models to Refuse Unknown Questions
Viaarxiv icon

DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback

Add code
Nov 16, 2023
Figure 1 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Figure 2 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Figure 3 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Figure 4 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Viaarxiv icon

Prudent Silence or Foolish Babble? Examining Large Language Models' Responses to the Unknown

Add code
Nov 16, 2023
Figure 1 for Prudent Silence or Foolish Babble? Examining Large Language Models' Responses to the Unknown
Figure 2 for Prudent Silence or Foolish Babble? Examining Large Language Models' Responses to the Unknown
Figure 3 for Prudent Silence or Foolish Babble? Examining Large Language Models' Responses to the Unknown
Figure 4 for Prudent Silence or Foolish Babble? Examining Large Language Models' Responses to the Unknown
Viaarxiv icon

CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets

Add code
Sep 29, 2023
Figure 1 for CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets
Figure 2 for CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets
Figure 3 for CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets
Figure 4 for CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets
Viaarxiv icon

MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback

Add code
Sep 19, 2023
Figure 1 for MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback
Figure 2 for MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback
Figure 3 for MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback
Figure 4 for MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback
Viaarxiv icon

Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models

Add code
Sep 08, 2023
Viaarxiv icon

Making Pre-trained Language Models both Task-solvers and Self-calibrators

Add code
Jul 21, 2023
Viaarxiv icon

Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations

Add code
Jun 07, 2023
Figure 1 for Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations
Figure 2 for Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations
Figure 3 for Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations
Figure 4 for Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations
Viaarxiv icon

From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework

Add code
May 29, 2023
Figure 1 for From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework
Figure 2 for From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework
Figure 3 for From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework
Figure 4 for From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework
Viaarxiv icon

A Close Look into the Calibration of Pre-trained Language Models

Add code
Oct 31, 2022
Viaarxiv icon