Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Duplicate Datasets #4012

Closed
btylerburton opened this issue Oct 14, 2022 · 0 comments
Closed

Duplicate Datasets #4012

btylerburton opened this issue Oct 14, 2022 · 0 comments
Labels
component/catalog Related to catalog component playbooks/roles component/harvest component/solr-service Related to Solr-as-a-Service, a brokered Solr offering Epic

Comments

@btylerburton
Copy link
Contributor

Feature/what we're after

As a datagov team member, I want to understand why packages are getting duplicated during the harvest process

Anticipated/hypothesized benefits

  • Our count of datasets would be accurate
  • We can rely on the harvest process being atomic and idempotent

Measurements/metrics

  • CKAN package count will only change when corresponding harvest sources add / remove datasets

References/background

List of fixes that have been implemented to mitigate this issue:

@btylerburton btylerburton added component/catalog Related to catalog component playbooks/roles component/harvest component/solr-service Related to Solr-as-a-Service, a brokered Solr offering Epic labels Oct 14, 2022
@btylerburton btylerburton moved this to ✔ Done in data.gov team board Oct 17, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component/catalog Related to catalog component playbooks/roles component/harvest component/solr-service Related to Solr-as-a-Service, a brokered Solr offering Epic
Projects
None yet
Development

No branches or pull requests

1 participant