Alert button
Picture for Rotem Dror

Rotem Dror

Alert button

State of What Art? A Call for Multi-Prompt LLM Evaluation

Add code
Bookmark button
Alert button
Dec 31, 2023
Moran Mizrahi, Guy Kaplan, Dan Malkin, Rotem Dror, Dafna Shahaf, Gabriel Stanovsky

Viaarxiv icon

DMLR: Data-centric Machine Learning Research -- Past, Present and Future

Add code
Bookmark button
Alert button
Nov 21, 2023
Luis Oala, Manil Maskey, Lilith Bat-Leah, Alicia Parrish, Nezihe Merve Gürel, Tzu-Sheng Kuo, Yang Liu, Rotem Dror, Danilo Brajovic, Xiaozhe Yao, Max Bartolo, William A Gaviria Rojas, Ryan Hileman, Rainier Aliment, Michael W. Mahoney, Meg Risdal, Matthew Lease, Wojciech Samek, Debojyoti Dutta, Curtis G Northcutt, Cody Coleman, Braden Hancock, Bernard Koch, Girmaw Abebe Tadesse, Bojan Karlaš, Ahmed Alaa, Adji Bousso Dieng, Natasha Noy, Vijay Janapa Reddi, James Zou, Praveen Paritosh, Mihaela van der Schaar, Kurt Bollacker, Lora Aroyo, Ce Zhang, Joaquin Vanschoren, Isabelle Guyon, Peter Mattson

Viaarxiv icon

The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics

Add code
Bookmark button
Alert button
Oct 30, 2023
Christoph Leiter, Juri Opitz, Daniel Deutsch, Yang Gao, Rotem Dror, Steffen Eger

Viaarxiv icon

Human-in-the-Loop Schema Induction

Add code
Bookmark button
Alert button
Feb 25, 2023
Tianyi Zhang, Isaac Tham, Zhaoyi Hou, Jiaxuan Ren, Liyang Zhou, Hainiu Xu, Li Zhang, Lara J. Martin, Rotem Dror, Sha Li, Heng Ji, Martha Palmer, Susan Brown, Reece Suchocki, Chris Callison-Burch

Figure 1 for Human-in-the-Loop Schema Induction
Figure 2 for Human-in-the-Loop Schema Induction
Figure 3 for Human-in-the-Loop Schema Induction
Figure 4 for Human-in-the-Loop Schema Induction
Viaarxiv icon

On the Limitations of Reference-Free Evaluations of Generated Text

Add code
Bookmark button
Alert button
Oct 22, 2022
Daniel Deutsch, Rotem Dror, Dan Roth

Figure 1 for On the Limitations of Reference-Free Evaluations of Generated Text
Figure 2 for On the Limitations of Reference-Free Evaluations of Generated Text
Figure 3 for On the Limitations of Reference-Free Evaluations of Generated Text
Figure 4 for On the Limitations of Reference-Free Evaluations of Generated Text
Viaarxiv icon

Zero-Shot On-the-Fly Event Schema Induction

Add code
Bookmark button
Alert button
Oct 12, 2022
Rotem Dror, Haoyu Wang, Dan Roth

Figure 1 for Zero-Shot On-the-Fly Event Schema Induction
Figure 2 for Zero-Shot On-the-Fly Event Schema Induction
Figure 3 for Zero-Shot On-the-Fly Event Schema Induction
Figure 4 for Zero-Shot On-the-Fly Event Schema Induction
Viaarxiv icon

Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics

Add code
Bookmark button
Alert button
Apr 21, 2022
Daniel Deutsch, Rotem Dror, Dan Roth

Figure 1 for Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics
Figure 2 for Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics
Figure 3 for Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics
Figure 4 for Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics
Viaarxiv icon

A Statistical Analysis of Summarization Evaluation Metrics using Resampling Methods

Add code
Bookmark button
Alert button
Mar 31, 2021
Daniel Deutsch, Rotem Dror, Dan Roth

Figure 1 for A Statistical Analysis of Summarization Evaluation Metrics using Resampling Methods
Figure 2 for A Statistical Analysis of Summarization Evaluation Metrics using Resampling Methods
Figure 3 for A Statistical Analysis of Summarization Evaluation Metrics using Resampling Methods
Figure 4 for A Statistical Analysis of Summarization Evaluation Metrics using Resampling Methods
Viaarxiv icon

The Structured Weighted Violations MIRA

Add code
Bookmark button
Alert button
May 09, 2020
Dor Ringel, Rotem Dror, Roi Reichart

Figure 1 for The Structured Weighted Violations MIRA
Figure 2 for The Structured Weighted Violations MIRA
Figure 3 for The Structured Weighted Violations MIRA
Figure 4 for The Structured Weighted Violations MIRA
Viaarxiv icon

Appendix - Recommended Statistical Significance Tests for NLP Tasks

Add code
Bookmark button
Alert button
Sep 05, 2018
Rotem Dror, Roi Reichart

Viaarxiv icon