LegalBench-RAG

This repository contains the LegalBench-RAG benchmark, which can test any retrieval system over the task of identifying the correct snippets that answer a given query. LegalBench-RAG allows to compute precision and recall at the character level for any given retrieval system in a deterministic way.

Download

To download the existing benchmark and corpus, please visit this link.

Usage

Install your venv

python3.12 -m venv .venv
source .venv/bin/activate

Install the dependencies

pip install pip-tools
pip-sync && pip install -e .

Create your credentials.toml and set your API keys

cp ./credentials/credentials.example.toml ./credentials/credentials.toml
vim ./credentials/credentials.toml

Download or Generate the dataset

You can download the data using the download link provided above. The directory structure from the root should have a ./data/corpus folder and a ./data/benchmarks folder. The corpus folder should be a set of raw text files, potentially with a directory hierarchy within itself. The benchmarks folder should be a set of benchmark json files. Each benchmark json has a set of test cases. Each test case has a query, and a ground truth array of snippets. Each snippet references a text file in the corpus via its file path within the corpus folder, and a character index range of that file.

If instead you would like to re-generate the benchmark from the source datasets, the entire code to do so is also provided in this repository. Please ensure you agree to the usage policies of ContractNLI, CUAD, MAUD, and PrivacyQA, before running this script. Once you have done that, simply execute the following:

python ./legalbenchrag/generate

Please note that LLMs are used in the process of creating the LegalBench-RAG benchmark. So, running this generate script will not generate exactly the same benchmark as was provided in the download link. However, the data in the download link itself was generated from the exact same process.

Run the benchmark script

python ./legalbenchrag/benchmark.py

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
credentials		credentials
legalbenchrag		legalbenchrag
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
lint.sh		lint.sh
mypy.ini		mypy.ini
pyproject.toml		pyproject.toml
requirements.in		requirements.in
requirements.txt		requirements.txt
ruff.toml		ruff.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LegalBench-RAG

Download

Usage

About

Releases

Packages

Contributors 2

Languages

License

zeroentropy-ai/legalbenchrag

Folders and files

Latest commit

History

Repository files navigation

LegalBench-RAG

Download

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages