Picture for Faiz Surani

Faiz Surani

Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools

Add code
May 30, 2024
Viaarxiv icon

LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

Add code
Aug 20, 2023
Figure 1 for LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
Figure 2 for LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
Figure 3 for LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
Figure 4 for LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
Viaarxiv icon

PRESTO: A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs

Add code
Mar 17, 2023
Figure 1 for PRESTO: A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs
Figure 2 for PRESTO: A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs
Figure 3 for PRESTO: A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs
Figure 4 for PRESTO: A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs
Viaarxiv icon