- I generated a list of incorporated cities along with associated planning agency (SACOG, ABAG, etc), county, and downloaded links to housing element PDFs.
- Geojson parcel data from each county website along with cleanup, normalization, and transfer to my database.
- Download each PDF, use machine learning to extract the data. Then do a second pass to clean up the (very messy) data. Generate metadata (page count, thumbnail, etc). Transfer all data to my database.
- Created endpoints to query database from the UI.
- Extra: Created web UI viewer using nextjs and react.