Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Souvick Das

Fine-grained Claim-level RAG Benchmark for Law

May 21, 2026

Souvick Das, Sallam Abualhaija, Domenico Bianculli

Abstract:The rapid progress of large language models (LLMs) is shifting semantic search toward a question-answering paradigm, where users ask questions and LLMs generate responses. In high-stake domains such as law, retrieval-augmented generation (RAG) is commonly used to mitigate hallucinations in generated responses. Nonetheless, prior work shows that RAG systems, whether general-purpose or legal-specific, still hallucinate at varying rates, making fine-grained evaluation essential. Despite the need, existing evaluation frameworks for legal RAG systems lack the granularity required to provide detailed analysis of retrieval and generation performance separately. Moreover, current benchmarks are largely English-only and centered on legal expert queries, overlooking non-expert needs. We introduce ClaimRAG-LAW, a comprehensive dataset for legal RAG that supports French and English, targets both experts and non-experts, and includes diverse question types reflecting realistic scenarios. We further apply a fine-grained evaluation framework of state-of-the-art legal RAG systems, revealing limitations in retrieval, generation, and claim-level analysis in the legal domain.

Via

Access Paper or Ask Questions

An Annexure to the Paper "Driving the Technology Value Stream by Analyzing App Reviews"

Mar 08, 2023

Souvick Das, Novarun Deb, Agostino Cortesi, Nabendu Chaki

Figure 1 for An Annexure to the Paper "Driving the Technology Value Stream by Analyzing App Reviews"

Figure 2 for An Annexure to the Paper "Driving the Technology Value Stream by Analyzing App Reviews"

Figure 3 for An Annexure to the Paper "Driving the Technology Value Stream by Analyzing App Reviews"

Figure 4 for An Annexure to the Paper "Driving the Technology Value Stream by Analyzing App Reviews"

Abstract:This paper presents a novel framework that utilizes Natural Language Processing (NLP) techniques to understand user feedback on mobile applications. The framework allows software companies to drive their technology value stream based on user reviews, which can highlight areas for improvement. The framework is analyzed in depth, and its modules are evaluated for their effectiveness. The proposed approach is demonstrated to be effective through an analysis of reviews for sixteen popular Android Play Store applications over a long period of time.

Via

Access Paper or Ask Questions