Picture for Kshitij Gupta

Kshitij Gupta

WHODUNIT: Evaluation benchmark for culprit detection in mystery stories

Add code
Feb 11, 2025
Viaarxiv icon

Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark

Add code
Jan 16, 2025
Figure 1 for Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark
Figure 2 for Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark
Figure 3 for Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark
Figure 4 for Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark
Viaarxiv icon

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Add code
Mar 30, 2024
Figure 1 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Figure 2 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Figure 3 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Figure 4 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Viaarxiv icon

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Add code
Mar 26, 2024
Figure 1 for Simple and Scalable Strategies to Continually Pre-train Large Language Models
Figure 2 for Simple and Scalable Strategies to Continually Pre-train Large Language Models
Figure 3 for Simple and Scalable Strategies to Continually Pre-train Large Language Models
Figure 4 for Simple and Scalable Strategies to Continually Pre-train Large Language Models
Viaarxiv icon

Continual Pre-Training of Large Language Models: How to (re)warm your model?

Add code
Aug 08, 2023
Figure 1 for Continual Pre-Training of Large Language Models: How to (re)warm your model?
Figure 2 for Continual Pre-Training of Large Language Models: How to (re)warm your model?
Figure 3 for Continual Pre-Training of Large Language Models: How to (re)warm your model?
Figure 4 for Continual Pre-Training of Large Language Models: How to (re)warm your model?
Viaarxiv icon

ARB: Advanced Reasoning Benchmark for Large Language Models

Add code
Jul 28, 2023
Figure 1 for ARB: Advanced Reasoning Benchmark for Large Language Models
Figure 2 for ARB: Advanced Reasoning Benchmark for Large Language Models
Figure 3 for ARB: Advanced Reasoning Benchmark for Large Language Models
Figure 4 for ARB: Advanced Reasoning Benchmark for Large Language Models
Viaarxiv icon

Broken Neural Scaling Laws

Add code
Nov 10, 2022
Figure 1 for Broken Neural Scaling Laws
Figure 2 for Broken Neural Scaling Laws
Figure 3 for Broken Neural Scaling Laws
Figure 4 for Broken Neural Scaling Laws
Viaarxiv icon

Data Augmentation for Automated Essay Scoring using Transformer Models

Add code
Oct 29, 2022
Figure 1 for Data Augmentation for Automated Essay Scoring using Transformer Models
Figure 2 for Data Augmentation for Automated Essay Scoring using Transformer Models
Figure 3 for Data Augmentation for Automated Essay Scoring using Transformer Models
Viaarxiv icon

MALM: Mixing Augmented Language Modeling for Zero-Shot Machine Translation

Add code
Oct 01, 2022
Figure 1 for MALM: Mixing Augmented Language Modeling for Zero-Shot Machine Translation
Figure 2 for MALM: Mixing Augmented Language Modeling for Zero-Shot Machine Translation
Viaarxiv icon

cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation

Add code
Jun 09, 2022
Figure 1 for cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation
Figure 2 for cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation
Figure 3 for cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation
Figure 4 for cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation
Viaarxiv icon