Picture for Chaitanya Malaviya

Chaitanya Malaviya

Flattery, Fluff, and Fog: Diagnosing and Mitigating Idiosyncratic Biases in Preference Models

Add code
Jun 05, 2025
Viaarxiv icon

LogiCoL: Logically-Informed Contrastive Learning for Set-based Dense Retrieval

Add code
May 26, 2025
Viaarxiv icon

EvalAgent: Discovering Implicit Evaluation Criteria from the Web

Add code
Apr 21, 2025
Viaarxiv icon

On Reference (In-)Determinacy in Natural Language Inference

Add code
Feb 09, 2025
Figure 1 for On Reference (In-)Determinacy in Natural Language Inference
Figure 2 for On Reference (In-)Determinacy in Natural Language Inference
Figure 3 for On Reference (In-)Determinacy in Natural Language Inference
Figure 4 for On Reference (In-)Determinacy in Natural Language Inference
Viaarxiv icon

Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations

Add code
Nov 11, 2024
Figure 1 for Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations
Figure 2 for Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations
Figure 3 for Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations
Figure 4 for Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations
Viaarxiv icon

AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?

Add code
Jul 22, 2024
Viaarxiv icon

DOLOMITES: Domain-Specific Long-Form Methodical Tasks

Add code
May 09, 2024
Viaarxiv icon

Calibrating Large Language Models with Sample Consistency

Add code
Feb 21, 2024
Viaarxiv icon

Pachinko: Patching Interpretable QA Models through Natural Language Feedback

Add code
Nov 16, 2023
Viaarxiv icon

ExpertQA: Expert-Curated Questions and Attributed Answers

Add code
Sep 14, 2023
Figure 1 for ExpertQA: Expert-Curated Questions and Attributed Answers
Figure 2 for ExpertQA: Expert-Curated Questions and Attributed Answers
Figure 3 for ExpertQA: Expert-Curated Questions and Attributed Answers
Figure 4 for ExpertQA: Expert-Curated Questions and Attributed Answers
Viaarxiv icon