Picture for Pegah Alipoormolabashi

Pegah Alipoormolabashi

Shammie

$\texttt{DIAMONDs}$: A Dataset for $\mathbb{D}$ynamic $\mathbb{I}$nformation $\mathbb{A}$nd $\mathbb{M}$ental modeling $\mathbb{O}$f $\mathbb{N}$umeric $\mathbb{D}$iscussions

Add code
May 19, 2025
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks

Add code
Apr 16, 2022
Figure 1 for Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks
Figure 2 for Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks
Figure 3 for Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks
Figure 4 for Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks
Viaarxiv icon

Understanding Procedural Knowledge by Sequencing Multimodal Instructional Manuals

Add code
Oct 16, 2021
Figure 1 for Understanding Procedural Knowledge by Sequencing Multimodal Instructional Manuals
Figure 2 for Understanding Procedural Knowledge by Sequencing Multimodal Instructional Manuals
Figure 3 for Understanding Procedural Knowledge by Sequencing Multimodal Instructional Manuals
Figure 4 for Understanding Procedural Knowledge by Sequencing Multimodal Instructional Manuals
Viaarxiv icon

COM2SENSE: A Commonsense Reasoning Benchmark with Complementary Sentences

Add code
Jun 02, 2021
Viaarxiv icon