Picture for Karishma Malkan

Karishma Malkan

Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning

Add code
Jun 13, 2024
Figure 1 for Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Figure 2 for Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Figure 3 for Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Figure 4 for Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Viaarxiv icon

Do Transformer Modifications Transfer Across Implementations and Applications?

Add code
Feb 23, 2021
Figure 1 for Do Transformer Modifications Transfer Across Implementations and Applications?
Figure 2 for Do Transformer Modifications Transfer Across Implementations and Applications?
Figure 3 for Do Transformer Modifications Transfer Across Implementations and Applications?
Viaarxiv icon

WT5?! Training Text-to-Text Models to Explain their Predictions

Add code
Apr 30, 2020
Figure 1 for WT5?! Training Text-to-Text Models to Explain their Predictions
Figure 2 for WT5?! Training Text-to-Text Models to Explain their Predictions
Figure 3 for WT5?! Training Text-to-Text Models to Explain their Predictions
Figure 4 for WT5?! Training Text-to-Text Models to Explain their Predictions
Viaarxiv icon