Skip to content

Latest commit

 

History

History
27 lines (15 loc) · 639 Bytes

README.md

File metadata and controls

27 lines (15 loc) · 639 Bytes

lapl-directories

Download metadata and PDFs for phone directories in the Los Angeles Public Library:

https://rescarta.lapl.org/

Install

You'll need to install Python and uv:

$ uv venv
$ source .venv/bin/activate
$ uv pip install -r requirements.txt

Crawl

To generate a CSV of metadata for the titles, and download the PDFs for each page:

$ ./crawl.py

NOTE: as a courtesy please enter your email address when prompted to let LAPL know you are crawling this content.

Build

Once the crawling is done you can build a single PDF for each title:

$ ./build.py