Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Archive EPA eGrid #517

Closed
1 of 7 tasks
cmgosnell opened this issue Jan 17, 2025 · 0 comments · Fixed by #549
Closed
1 of 7 tasks

Archive EPA eGrid #517

cmgosnell opened this issue Jan 17, 2025 · 0 comments · Fixed by #549
Assignees
Labels

Comments

@cmgosnell
Copy link
Member

cmgosnell commented Jan 17, 2025

Motivation and context:

Briefly describe the dataset. What is it, and why do we want to archive it regularly?
Include a link to the dataset webpage and any metadata documentation.

The links to all of the files show up on these two pages above, but the urls where the data is actually stored all seem to follow this pattern:

https://www.epa.gov/system/files/documents/{year of publication}-{month}/egrid{data year}{file name}

Note that the publication date is different than the data year. The data year is what we want to reference when we archive this data.

Also note that there are several files per year. We want to grab all of them so you'll need to use add_to_archive.

Requirements for archiving

To be archived on Zenodo, a dataset must be:

  • published under an open license that permits reuse and redistribution
  • less than 50Gb in size (when zipped)
  • relevant to energy modelling and research

Checklist for archive creation

Based on the README documentation on creating a new archive:

Tasks

Preview Give feedback

Links to published archives:

Include a link to the published sandbox archive for review.

@cmgosnell cmgosnell self-assigned this Jan 24, 2025
@cmgosnell cmgosnell moved this from New to In progress in Catalyst Megaproject Jan 27, 2025
@cmgosnell cmgosnell moved this from In progress to In review in Catalyst Megaproject Jan 28, 2025
@cmgosnell cmgosnell changed the title Write an archiver for EPA eGrid Archive EPA eGrid Jan 28, 2025
@github-project-automation github-project-automation bot moved this from In review to Done in Catalyst Megaproject Jan 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: Done
Development

Successfully merging a pull request may close this issue.

1 participant