Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reimplement data initialization process; keeps ending early because of network issues #2

Open
MattTriano opened this issue Apr 9, 2022 · 0 comments

Comments

@MattTriano
Copy link
Owner

A full installation of TIGER data for all US states involves downloading 10s of GB of data, and with the current implementation, a single network hiccup or other issue will cause execution of the /docker-entrypoint-initdb.d/11_load_data.sh script to end early, and as that script is only called on the initial startup of the postgis_geocoder docker-compose app (with a given docker volume instance) I've had to just tear down the volume, point my VPN to a different server, and try again. Not only is this unacceptable in terms of ease-of-use, but it's also very rude of me to waste Census Bureau bandwidth redownloading the same files.

So my plan is to rework the initialization to download data to a volume/mount_point on the host machine before moving on to the 11_load_data.sh script.

Maybe I'll try to work this into the /Dockerfiles/Dockerfile_postgis implementation, or maybe I'll make it an external shell script that the user has to run. The former seems more convenient, but I will have to learn more new things to go that route.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant