Picture for Siddhartha Jain

Siddhartha Jain

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Add code
Jun 27, 2024
Viaarxiv icon

Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies

Add code
Jun 11, 2024
Viaarxiv icon

Code-Aware Prompting: A study of Coverage Guided Test Generation in Regression Setting using LLM

Add code
Jan 31, 2024
Figure 1 for Code-Aware Prompting: A study of Coverage Guided Test Generation in Regression Setting using LLM
Figure 2 for Code-Aware Prompting: A study of Coverage Guided Test Generation in Regression Setting using LLM
Figure 3 for Code-Aware Prompting: A study of Coverage Guided Test Generation in Regression Setting using LLM
Figure 4 for Code-Aware Prompting: A study of Coverage Guided Test Generation in Regression Setting using LLM
Viaarxiv icon

Self-consistency for open-ended generations

Add code
Jul 11, 2023
Figure 1 for Self-consistency for open-ended generations
Figure 2 for Self-consistency for open-ended generations
Figure 3 for Self-consistency for open-ended generations
Figure 4 for Self-consistency for open-ended generations
Viaarxiv icon

Multi-lingual Evaluation of Code Generation Models

Add code
Oct 26, 2022
Figure 1 for Multi-lingual Evaluation of Code Generation Models
Figure 2 for Multi-lingual Evaluation of Code Generation Models
Figure 3 for Multi-lingual Evaluation of Code Generation Models
Figure 4 for Multi-lingual Evaluation of Code Generation Models
Viaarxiv icon

Overinterpretation reveals image classification model pathologies

Add code
Mar 19, 2020
Figure 1 for Overinterpretation reveals image classification model pathologies
Figure 2 for Overinterpretation reveals image classification model pathologies
Figure 3 for Overinterpretation reveals image classification model pathologies
Figure 4 for Overinterpretation reveals image classification model pathologies
Viaarxiv icon

Information Condensing Active Learning

Add code
Feb 20, 2020
Figure 1 for Information Condensing Active Learning
Figure 2 for Information Condensing Active Learning
Figure 3 for Information Condensing Active Learning
Figure 4 for Information Condensing Active Learning
Viaarxiv icon

Maximizing Overall Diversity for Improved Uncertainty Estimates in Deep Ensembles

Add code
Jun 18, 2019
Figure 1 for Maximizing Overall Diversity for Improved Uncertainty Estimates in Deep Ensembles
Figure 2 for Maximizing Overall Diversity for Improved Uncertainty Estimates in Deep Ensembles
Figure 3 for Maximizing Overall Diversity for Improved Uncertainty Estimates in Deep Ensembles
Figure 4 for Maximizing Overall Diversity for Improved Uncertainty Estimates in Deep Ensembles
Viaarxiv icon

What made you do this? Understanding black-box decisions with sufficient input subsets

Add code
Oct 09, 2018
Figure 1 for What made you do this? Understanding black-box decisions with sufficient input subsets
Figure 2 for What made you do this? Understanding black-box decisions with sufficient input subsets
Figure 3 for What made you do this? Understanding black-box decisions with sufficient input subsets
Figure 4 for What made you do this? Understanding black-box decisions with sufficient input subsets
Viaarxiv icon