Picture for Mohammad Aliannejadi

Mohammad Aliannejadi

Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics

Add code
Jun 21, 2024
Figure 1 for Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics
Figure 2 for Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics
Figure 3 for Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics
Figure 4 for Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics
Viaarxiv icon

Can We Use Large Language Models to Fill Relevance Judgment Holes?

Add code
May 09, 2024
Viaarxiv icon

TREC iKAT 2023: A Test Collection for Evaluating Conversational and Interactive Knowledge Assistants

Add code
May 04, 2024
Viaarxiv icon

Are We Really Achieving Better Beyond-Accuracy Performance in Next Basket Recommendation?

Add code
May 02, 2024
Viaarxiv icon

Ranked List Truncation for Large Language Model-based Re-Ranking

Add code
Apr 28, 2024
Viaarxiv icon

Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs

Add code
Apr 19, 2024
Figure 1 for Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs
Figure 2 for Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs
Figure 3 for Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs
Figure 4 for Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs
Viaarxiv icon

Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems

Add code
Apr 15, 2024
Figure 1 for Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems
Figure 2 for Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems
Figure 3 for Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems
Figure 4 for Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems
Viaarxiv icon

Query Performance Prediction using Relevance Judgments Generated by Large Language Models

Add code
Apr 01, 2024
Viaarxiv icon

Generate then Retrieve: Conversational Response Retrieval Using LLMs as Answer and Query Generators

Add code
Mar 28, 2024
Viaarxiv icon

CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems

Add code
Mar 27, 2024
Figure 1 for CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems
Figure 2 for CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems
Figure 3 for CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems
Figure 4 for CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems
Viaarxiv icon