Picture for Xifeng Yan

Xifeng Yan

MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension

Add code
Jul 06, 2024
Viaarxiv icon

Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters

Add code
Mar 05, 2024
Figure 1 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 2 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 3 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 4 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Viaarxiv icon

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling

Add code
Feb 16, 2024
Figure 1 for Large Language Models as Zero-shot Dialogue State Tracker through Function Calling
Figure 2 for Large Language Models as Zero-shot Dialogue State Tracker through Function Calling
Figure 3 for Large Language Models as Zero-shot Dialogue State Tracker through Function Calling
Figure 4 for Large Language Models as Zero-shot Dialogue State Tracker through Function Calling
Viaarxiv icon

GPT-4V as a Generalist Evaluator for Vision-Language Tasks

Add code
Nov 02, 2023
Figure 1 for GPT-4V as a Generalist Evaluator for Vision-Language Tasks
Figure 2 for GPT-4V as a Generalist Evaluator for Vision-Language Tasks
Figure 3 for GPT-4V as a Generalist Evaluator for Vision-Language Tasks
Figure 4 for GPT-4V as a Generalist Evaluator for Vision-Language Tasks
Viaarxiv icon

Do you really follow me? Adversarial Instructions for Evaluating the Robustness of Large Language Models

Add code
Aug 17, 2023
Figure 1 for Do you really follow me? Adversarial Instructions for Evaluating the Robustness of Large Language Models
Figure 2 for Do you really follow me? Adversarial Instructions for Evaluating the Robustness of Large Language Models
Figure 3 for Do you really follow me? Adversarial Instructions for Evaluating the Robustness of Large Language Models
Figure 4 for Do you really follow me? Adversarial Instructions for Evaluating the Robustness of Large Language Models
Viaarxiv icon

Augmenting Language Models with Long-Term Memory

Add code
Jun 12, 2023
Figure 1 for Augmenting Language Models with Long-Term Memory
Figure 2 for Augmenting Language Models with Long-Term Memory
Figure 3 for Augmenting Language Models with Long-Term Memory
Figure 4 for Augmenting Language Models with Long-Term Memory
Viaarxiv icon

STEPS: A Benchmark for Order Reasoning in Sequential Tasks

Add code
Jun 07, 2023
Figure 1 for STEPS: A Benchmark for Order Reasoning in Sequential Tasks
Figure 2 for STEPS: A Benchmark for Order Reasoning in Sequential Tasks
Figure 3 for STEPS: A Benchmark for Order Reasoning in Sequential Tasks
Figure 4 for STEPS: A Benchmark for Order Reasoning in Sequential Tasks
Viaarxiv icon

Graph Reasoning for Question Answering with Triplet Retrieval

Add code
May 30, 2023
Figure 1 for Graph Reasoning for Question Answering with Triplet Retrieval
Figure 2 for Graph Reasoning for Question Answering with Triplet Retrieval
Figure 3 for Graph Reasoning for Question Answering with Triplet Retrieval
Figure 4 for Graph Reasoning for Question Answering with Triplet Retrieval
Viaarxiv icon

Bot or Human? Detecting ChatGPT Imposters with A Single Question

Add code
May 16, 2023
Figure 1 for Bot or Human? Detecting ChatGPT Imposters with A Single Question
Figure 2 for Bot or Human? Detecting ChatGPT Imposters with A Single Question
Figure 3 for Bot or Human? Detecting ChatGPT Imposters with A Single Question
Viaarxiv icon

Guiding Large Language Models via Directional Stimulus Prompting

Add code
Feb 22, 2023
Figure 1 for Guiding Large Language Models via Directional Stimulus Prompting
Figure 2 for Guiding Large Language Models via Directional Stimulus Prompting
Figure 3 for Guiding Large Language Models via Directional Stimulus Prompting
Figure 4 for Guiding Large Language Models via Directional Stimulus Prompting
Viaarxiv icon