Picture for Chonghua Wang

Chonghua Wang

Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks

Add code
Apr 10, 2024
Viaarxiv icon

BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues

Add code
Oct 20, 2023
Figure 1 for BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues
Figure 2 for BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues
Figure 3 for BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues
Figure 4 for BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues
Viaarxiv icon