Picture for Yueyue Wu

Yueyue Wu

Civil Court Simulation with Large Language Models

Add code
Jun 08, 2026
Viaarxiv icon

LexRubric: A Rubric-Guided Diagnostic Benchmark for Open-Ended Legal Tasks

Add code
Jun 08, 2026
Viaarxiv icon

Enhancing Judgment Document Generation via Agentic Legal Information Collection and Rubric-Guided Optimization

Add code
May 03, 2026
Viaarxiv icon

LegalOne: A Family of Foundation Models for Reliable Legal Reasoning

Add code
Feb 03, 2026
Viaarxiv icon

Chinese Court Simulation with LLM-Based Agent System

Add code
Aug 24, 2025
Figure 1 for Chinese Court Simulation with LLM-Based Agent System
Figure 2 for Chinese Court Simulation with LLM-Based Agent System
Figure 3 for Chinese Court Simulation with LLM-Based Agent System
Figure 4 for Chinese Court Simulation with LLM-Based Agent System
Viaarxiv icon

JuDGE: Benchmarking Judgment Document Generation for Chinese Legal System

Add code
Mar 20, 2025
Viaarxiv icon

LexRAG: Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation Conversation

Add code
Feb 28, 2025
Viaarxiv icon

CaseGen: A Benchmark for Multi-Stage Legal Case Documents Generation

Add code
Feb 25, 2025
Figure 1 for CaseGen: A Benchmark for Multi-Stage Legal Case Documents Generation
Figure 2 for CaseGen: A Benchmark for Multi-Stage Legal Case Documents Generation
Figure 3 for CaseGen: A Benchmark for Multi-Stage Legal Case Documents Generation
Figure 4 for CaseGen: A Benchmark for Multi-Stage Legal Case Documents Generation
Viaarxiv icon

LegalAgentBench: Evaluating LLM Agents in Legal Domain

Add code
Dec 23, 2024
Viaarxiv icon

LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models

Add code
Sep 30, 2024
Figure 1 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Figure 2 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Figure 3 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Figure 4 for LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Viaarxiv icon