Picture for Abhilasha Ravichander

Abhilasha Ravichander

WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild

Add code
Jun 07, 2024
Viaarxiv icon

Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?

Add code
Feb 19, 2024
Figure 1 for Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?
Figure 2 for Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?
Figure 3 for Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?
Figure 4 for Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?
Viaarxiv icon

OLMo: Accelerating the Science of Language Models

Add code
Feb 07, 2024
Figure 1 for OLMo: Accelerating the Science of Language Models
Figure 2 for OLMo: Accelerating the Science of Language Models
Figure 3 for OLMo: Accelerating the Science of Language Models
Figure 4 for OLMo: Accelerating the Science of Language Models
Viaarxiv icon

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Add code
Jan 31, 2024
Figure 1 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 2 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 3 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 4 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Viaarxiv icon

The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning

Add code
Dec 04, 2023
Figure 1 for The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning
Figure 2 for The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning
Figure 3 for The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning
Figure 4 for The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning
Viaarxiv icon

MacGyver: Are Large Language Models Creative Problem Solvers?

Add code
Nov 16, 2023
Figure 1 for MacGyver: Are Large Language Models Creative Problem Solvers?
Figure 2 for MacGyver: Are Large Language Models Creative Problem Solvers?
Figure 3 for MacGyver: Are Large Language Models Creative Problem Solvers?
Figure 4 for MacGyver: Are Large Language Models Creative Problem Solvers?
Viaarxiv icon

Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs

Add code
Nov 09, 2023
Figure 1 for Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
Figure 2 for Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
Figure 3 for Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
Figure 4 for Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
Viaarxiv icon

What's In My Big Data?

Add code
Oct 31, 2023
Figure 1 for What's In My Big Data?
Figure 2 for What's In My Big Data?
Figure 3 for What's In My Big Data?
Figure 4 for What's In My Big Data?
Viaarxiv icon

The Generative AI Paradox: "What It Can Create, It May Not Understand"

Add code
Oct 31, 2023
Figure 1 for The Generative AI Paradox: "What It Can Create, It May Not Understand"
Figure 2 for The Generative AI Paradox: "What It Can Create, It May Not Understand"
Figure 3 for The Generative AI Paradox: "What It Can Create, It May Not Understand"
Figure 4 for The Generative AI Paradox: "What It Can Create, It May Not Understand"
Viaarxiv icon

Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning

Add code
May 24, 2023
Figure 1 for Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
Figure 2 for Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
Figure 3 for Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
Figure 4 for Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
Viaarxiv icon