Picture for Yash Kumar Lal

Yash Kumar Lal

MuSciClaims: Multimodal Scientific Claim Verification

Add code
Jun 05, 2025
Viaarxiv icon

$\texttt{DIAMONDs}$: A Dataset for $\mathbb{D}$ynamic $\mathbb{I}$nformation $\mathbb{A}$nd $\mathbb{M}$ental modeling $\mathbb{O}$f $\mathbb{N}$umeric $\mathbb{D}$iscussions

Add code
May 19, 2025
Viaarxiv icon

Explaining GPT-4's Schema of Depression Using Machine Behavior Analysis

Add code
Nov 21, 2024
Figure 1 for Explaining GPT-4's Schema of Depression Using Machine Behavior Analysis
Figure 2 for Explaining GPT-4's Schema of Depression Using Machine Behavior Analysis
Figure 3 for Explaining GPT-4's Schema of Depression Using Machine Behavior Analysis
Figure 4 for Explaining GPT-4's Schema of Depression Using Machine Behavior Analysis
Viaarxiv icon

Can Stories Help LLMs Reason? Curating Information Space Through Narrative

Add code
Oct 25, 2024
Figure 1 for Can Stories Help LLMs Reason? Curating Information Space Through Narrative
Figure 2 for Can Stories Help LLMs Reason? Curating Information Space Through Narrative
Figure 3 for Can Stories Help LLMs Reason? Curating Information Space Through Narrative
Figure 4 for Can Stories Help LLMs Reason? Curating Information Space Through Narrative
Viaarxiv icon

Automated Adversarial Discovery for Safety Classifiers

Add code
Jun 24, 2024
Viaarxiv icon

CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans

Add code
Jun 22, 2024
Figure 1 for CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans
Figure 2 for CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans
Figure 3 for CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans
Figure 4 for CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans
Viaarxiv icon

SOCIALITE-LLAMA: An Instruction-Tuned Model for Social Scientific Tasks

Add code
Feb 03, 2024
Viaarxiv icon

One Size Does Not Fit All: Customizing Open-Domain Procedures

Add code
Nov 16, 2023
Viaarxiv icon

Evaluating Paraphrastic Robustness in Textual Entailment Models

Add code
Jun 29, 2023
Viaarxiv icon

Systematic Evaluation of GPT-3 for Zero-Shot Personality Estimation

Add code
Jun 01, 2023
Viaarxiv icon