Picture for Shir Ashury-Tahan

Shir Ashury-Tahan

ErrorMap and ErrorAtlas: Charting the Failure Landscape of Large Language Models

Add code
Jan 22, 2026
Viaarxiv icon

The Mighty ToRR: A Benchmark for Table Reasoning and Robustness

Add code
Feb 26, 2025
Figure 1 for The Mighty ToRR: A Benchmark for Table Reasoning and Robustness
Figure 2 for The Mighty ToRR: A Benchmark for Table Reasoning and Robustness
Figure 3 for The Mighty ToRR: A Benchmark for Table Reasoning and Robustness
Figure 4 for The Mighty ToRR: A Benchmark for Table Reasoning and Robustness
Viaarxiv icon

Data-driven Coreference-based Ontology Building

Add code
Oct 22, 2024
Viaarxiv icon

Label-Efficient Model Selection for Text Generation

Add code
Feb 12, 2024
Figure 1 for Label-Efficient Model Selection for Text Generation
Figure 2 for Label-Efficient Model Selection for Text Generation
Figure 3 for Label-Efficient Model Selection for Text Generation
Figure 4 for Label-Efficient Model Selection for Text Generation
Viaarxiv icon