Archive EPA eGrid #517

cmgosnell · 2025-01-17T19:24:12Z

Motivation and context:

Briefly describe the dataset. What is it, and why do we want to archive it regularly?
Include a link to the dataset webpage and any metadata documentation.

The links to all of the files show up on these two pages above, but the urls where the data is actually stored all seem to follow this pattern:

https://www.epa.gov/system/files/documents/{year of publication}-{month}/egrid{data year}{file name}

Note that the publication date is different than the data year. The data year is what we want to reference when we archive this data.

Also note that there are several files per year. We want to grab all of them so you'll need to use add_to_archive.

Requirements for archiving

To be archived on Zenodo, a dataset must be:

published under an open license that permits reuse and redistribution
less than 50Gb in size (when zipped)
relevant to energy modelling and research

Checklist for archive creation

Based on the README documentation on creating a new archive:

Tasks

Give feedback

Define the dataset's metadata
Implement archiver interface
Test archiver locally
Test uploading to Zenodo
Manually review archive before publication
Finalize archive (only core Catalyst developers can complete this step)
Automate archiving
Options

Links to published archives:

Include a link to the published sandbox archive for review.

The text was updated successfully, but these errors were encountered:

cmgosnell added the new-data label Jan 17, 2025

github-project-automation bot added this to Catalyst Megaproject Jan 17, 2025

github-project-automation bot moved this to New in Catalyst Megaproject Jan 17, 2025

cmgosnell self-assigned this Jan 24, 2025

cmgosnell moved this from New to In progress in Catalyst Megaproject Jan 27, 2025

cmgosnell mentioned this issue Jan 27, 2025

Add new archiver for EPA eGRID #549

Merged

cmgosnell moved this from In progress to In review in Catalyst Megaproject Jan 28, 2025

cmgosnell changed the title ~~Write an archiver for EPA eGrid~~ Archive EPA eGrid Jan 28, 2025

cmgosnell closed this as completed in #549 Jan 29, 2025

github-project-automation bot moved this from In review to Done in Catalyst Megaproject Jan 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Archive EPA eGrid #517

Archive EPA eGrid #517

cmgosnell commented Jan 17, 2025 •

edited

Loading

Tasks

Archive EPA eGrid #517

Archive EPA eGrid #517

Comments

cmgosnell commented Jan 17, 2025 • edited Loading

Motivation and context:

Requirements for archiving

Checklist for archive creation

Tasks

Links to published archives:

cmgosnell commented Jan 17, 2025 •

edited

Loading