Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Joachim Daiber

Evaluating Cost-Accuracy Trade-offs in Multimodal Search Relevance Judgements

Oct 25, 2024

Silvia Terragni, Hoang Cuong, Joachim Daiber, Pallavi Gudipati, Pablo N. Mendes

Figure 1 for Evaluating Cost-Accuracy Trade-offs in Multimodal Search Relevance Judgements

Figure 2 for Evaluating Cost-Accuracy Trade-offs in Multimodal Search Relevance Judgements

Figure 3 for Evaluating Cost-Accuracy Trade-offs in Multimodal Search Relevance Judgements

Figure 4 for Evaluating Cost-Accuracy Trade-offs in Multimodal Search Relevance Judgements

Abstract:Large Language Models (LLMs) have demonstrated potential as effective search relevance evaluators. However, there is a lack of comprehensive guidance on which models consistently perform optimally across various contexts or within specific use cases. In this paper, we assess several LLMs and Multimodal Language Models (MLLMs) in terms of their alignment with human judgments across multiple multimodal search scenarios. Our analysis investigates the trade-offs between cost and accuracy, highlighting that model performance varies significantly depending on the context. Interestingly, in smaller models, the inclusion of a visual component may hinder performance rather than enhance it. These findings highlight the complexities involved in selecting the most appropriate model for practical applications.

* CIKM MMSR 2024

Via

Access Paper or Ask Questions

MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering

Jul 30, 2020

Shayne Longpre, Yi Lu, Joachim Daiber

Figure 1 for MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering

Figure 2 for MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering

Figure 3 for MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering

Figure 4 for MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering

Abstract:Progress in cross-lingual modeling depends on challenging, realistic, and diverse evaluation sets. We introduce Multilingual Knowledge Questions and Answers (MKQA), an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically diverse languages (260k question-answer pairs in total). The goal of this dataset is to provide a challenging benchmark for question answering quality across a wide set of languages. Answers are based on a language-independent data representation, making results comparable across languages and independent of language-specific passages. With 26 languages, this dataset supplies the widest range of languages to-date for evaluating question answering. We benchmark state-of-the-art extractive question answering baselines, trained on Natural Questions, including Multilingual BERT, and XLM-RoBERTa, in zero shot and translation settings. Results indicate this dataset is challenging, especially in low-resource languages.

Via

Access Paper or Ask Questions

Splitting Compounds by Semantic Analogy

Sep 15, 2015

Joachim Daiber, Lautaro Quiroz, Roger Wechsler, Stella Frank

Figure 1 for Splitting Compounds by Semantic Analogy

Figure 2 for Splitting Compounds by Semantic Analogy

Figure 3 for Splitting Compounds by Semantic Analogy

Figure 4 for Splitting Compounds by Semantic Analogy

Abstract:Compounding is a highly productive word-formation process in some languages that is often problematic for natural language processing applications. In this paper, we investigate whether distributional semantics in the form of word embeddings can enable a deeper, i.e., more knowledge-rich, processing of compounds than the standard string-based methods. We present an unsupervised approach that exploits regularities in the semantic vector space (based on analogies such as "bookshop is to shop as bookshelf is to shelf") to produce compound analyses of high quality. A subsequent compound splitting algorithm based on these analyses is highly effective, particularly for ambiguous compounds. German to English machine translation experiments show that this semantic analogy-based compound splitter leads to better translations than a commonly used frequency-based method.

* Proceedings of the 1st Deep Machine Translation Workshop. Prague, Czech Republic. 2015

Via

Access Paper or Ask Questions