Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TWDH-specific: metadata fields blanked out when running 'datapusher resubmit' #102

Open
twdbben opened this issue Jun 26, 2023 · 4 comments

Comments

@twdbben
Copy link

twdbben commented Jun 26, 2023

Describe the bug

In the specific context of the Texas Water Data Hub schema, we run the datapusher resubmit command nightly to update resources using the following command:

ckan -c /etc/ckan/default/ckan.ini datapusher resubmit --yes

When we run this command, the 'primary_tags' and 'secondary_tags' scheming-based metadata fields on the datasets that it operates on are always blanked out when the operation completes.

To reproduce this bug, in the TWDH environment, first make sure the tag fields are set on a given dataset, then view the dataset via the API, for instance:

https://txwaterdatahub.org/api/3/action/package_show?id=surface-water-rights-water-use

Then, run the resubmit command listed above.

Wait a few seconds after the command completes - the tag-blanking does tend to lag a few seconds for some reason - and re-request the dataset from the API. Notice that the 'primary_tags' and 'secondary_tags' fields will now be empty.

@TomeCirun
Copy link
Collaborator

TomeCirun commented Jun 29, 2023

Hey @twdbben, I cannot reproduce the bug locally, I did as you suggested and everything works fine.
I worked on dev-1.0 branch + qsv 0.91.0

image image

@jqnatividad
Copy link
Contributor

Hi @TomeCirun ,
Thanks for trying to reproduce the bug on the dev-v1.0 branch usingqsv 0.91.0.

Since it will be a while till @twdbben uses dev-v1.0, can you try to repro in on DP+ 0.15.0?

It will be useful too as I'm aiming to feature-freeze 0.15.x, and that will be the basis for synching the functionality with DP+ v1.x.

@twdbben
Copy link
Author

twdbben commented Jun 30, 2023

Thanks for investigating this @TomeCirun

Not sure if it means anything, but I find it interesting that the primary_tags and secondary_tags in your screenshot appear to be a quoted blob whereas when I look at them on my instance they are formatted as a proper list.

https://dev.txwaterdatahub.org/api/3/action/package_show?id=data-dan-s-data

Screenshot 2023-06-30 at 1 17 55 PM

Not suggesting this is the root if the problem, but perhaps this indicates something is different between our installs?

LMK if there is anything from my end that I can provide to help in replicating this issue. We are currently seeing the issue on all of our installs.

@TomeCirun
Copy link
Collaborator

TomeCirun commented Jul 1, 2023

I tested it against DP+ 0.15 and got the same result as with the dev-1.0 branch.

Regarding the type of tags, you probably have some converter for it in some of the extensions you are using. It would be useful to me if I know all the extensions(with the proper version) you are using so I can try again to reproduce the bug.

I think that the bug is not related to DP+, as the logic behind DP+ is not changing the package's metadata.

@twdbben @jqnatividad

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants