Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Duplicate datasets from two organizations #4049

Closed
FuhuXia opened this issue Nov 8, 2022 · 1 comment
Closed

Duplicate datasets from two organizations #4049

FuhuXia opened this issue Nov 8, 2022 · 1 comment
Labels
bug Software defect or bug harvest-duplicates Issues related to Duplicated Datasets

Comments

@FuhuXia
Copy link
Member

FuhuXia commented Nov 8, 2022

Same datasets are harvest twice in two organizations, for example

https://catalog.data.gov/dataset/nndss-table-ii-invasive-pneumococcal-to-legionellosis-79d5f
https://catalog.data.gov/dataset/nndss-table-ii-invasive-pneumococcal-to-legionellosis-f6b5a

there are 785 found between organization hhs-gov and centers-for-disease-control-and-prevention.

$ curl -s "https://catalog.data.gov/api/action/package_search?fq=(organization:centers-for-disease-control-and-prevention%20OR%20organization:hhs-gov)&facet.field=[%22identifier%22]&facet.limit=-1&facet.mincount=2" | jq '.result.facets.identifier | length'

785

Sketch

[Notes or a checklist reflecting our understanding of the selected approach]

@FuhuXia FuhuXia added the bug Software defect or bug label Nov 8, 2022
@FuhuXia
Copy link
Member Author

FuhuXia commented Nov 21, 2022

close this one since #4073 has better description,

@FuhuXia FuhuXia closed this as completed Nov 21, 2022
@FuhuXia FuhuXia moved this to ✔ Done in data.gov team board Nov 21, 2022
@btylerburton btylerburton added the harvest-duplicates Issues related to Duplicated Datasets label Dec 21, 2023
@hkdctol hkdctol moved this from ✔ Done to 🗄 Closed in data.gov team board Dec 26, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Software defect or bug harvest-duplicates Issues related to Duplicated Datasets
Projects
Archived in project
Development

No branches or pull requests

2 participants