Dataset Of Legal Documents


Tree-Based Text Retrieval via Hierarchical Clustering in RAGFrameworks: Application on Taiwanese Regulations

Add code
Jun 16, 2025
Viaarxiv icon

DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response

Add code
May 26, 2025
Viaarxiv icon

AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios

Add code
May 22, 2025
Viaarxiv icon

A document processing pipeline for the construction of a dataset for topic modeling based on the judgments of the Italian Supreme Court

Add code
May 13, 2025
Viaarxiv icon

BR-TaxQA-R: A Dataset for Question Answering with References for Brazilian Personal Income Tax Law, including case law

Add code
May 21, 2025
Viaarxiv icon

QBD-RankedDataGen: Generating Custom Ranked Datasets for Improving Query-By-Document Search Using LLM-Reranking with Reduced Human Effort

Add code
May 07, 2025
Viaarxiv icon

Labeling Case Similarity based on Co-Citation of Legal Articles in Judgment Documents with Empirical Dispute-Based Evaluation

Add code
Apr 29, 2025
Viaarxiv icon

SynLexLM: Scaling Legal LLMs with Synthetic Data and Curriculum Learning

Add code
Apr 29, 2025
Viaarxiv icon

Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA

Add code
Apr 15, 2025
Viaarxiv icon

Improving the Accuracy and Efficiency of Legal Document Tagging with Large Language Models and Instruction Prompts

Add code
Apr 12, 2025
Viaarxiv icon