Picture for Mohammad Aliannejadi

Mohammad Aliannejadi

LLMJudge: LLMs for Relevance Judgments

Add code
Aug 09, 2024
Figure 1 for LLMJudge: LLMs for Relevance Judgments
Figure 2 for LLMJudge: LLMs for Relevance Judgments
Viaarxiv icon

Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024

Add code
Aug 09, 2024
Figure 1 for Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024
Figure 2 for Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024
Viaarxiv icon

Generative Retrieval with Few-shot Indexing

Add code
Aug 04, 2024
Figure 1 for Generative Retrieval with Few-shot Indexing
Figure 2 for Generative Retrieval with Few-shot Indexing
Figure 3 for Generative Retrieval with Few-shot Indexing
Figure 4 for Generative Retrieval with Few-shot Indexing
Viaarxiv icon

Interactions with Generative Information Retrieval Systems

Add code
Jul 16, 2024
Figure 1 for Interactions with Generative Information Retrieval Systems
Figure 2 for Interactions with Generative Information Retrieval Systems
Viaarxiv icon

Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics

Add code
Jun 21, 2024
Figure 1 for Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics
Figure 2 for Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics
Figure 3 for Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics
Figure 4 for Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics
Viaarxiv icon

Can We Use Large Language Models to Fill Relevance Judgment Holes?

Add code
May 09, 2024
Figure 1 for Can We Use Large Language Models to Fill Relevance Judgment Holes?
Figure 2 for Can We Use Large Language Models to Fill Relevance Judgment Holes?
Figure 3 for Can We Use Large Language Models to Fill Relevance Judgment Holes?
Figure 4 for Can We Use Large Language Models to Fill Relevance Judgment Holes?
Viaarxiv icon

TREC iKAT 2023: A Test Collection for Evaluating Conversational and Interactive Knowledge Assistants

Add code
May 04, 2024
Viaarxiv icon

Are We Really Achieving Better Beyond-Accuracy Performance in Next Basket Recommendation?

Add code
May 02, 2024
Viaarxiv icon

Ranked List Truncation for Large Language Model-based Re-Ranking

Add code
Apr 28, 2024
Figure 1 for Ranked List Truncation for Large Language Model-based Re-Ranking
Figure 2 for Ranked List Truncation for Large Language Model-based Re-Ranking
Figure 3 for Ranked List Truncation for Large Language Model-based Re-Ranking
Figure 4 for Ranked List Truncation for Large Language Model-based Re-Ranking
Viaarxiv icon

Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs

Add code
Apr 19, 2024
Figure 1 for Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs
Figure 2 for Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs
Figure 3 for Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs
Figure 4 for Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs
Viaarxiv icon