Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Addition of ins_serial_no requirement of validate_glatos_receivers negates use of hfx_deployments.csv as test data for read_otn_deployments` #202

Open
mhpob opened this issue Jan 14, 2024 · 6 comments

Comments

@mhpob
Copy link
Contributor

mhpob commented Jan 14, 2024

The addition of ins_serial_no as a required column in validate_glatos_receivers seems to have broken test-read_otn_deployments (commit 5f6b727). Seems to run through glatos_check_col_names, also introduced in 5f6b727.

The test data set, hfx_deployments.csv, does not contain an ins_serial_no column.

@mhpob mhpob changed the title ins_serial_no requirement negates use of hfx_deployments.csv as a test data set Addition of ins_serial_no requirement of validate_glatos_receivers negates use of hfx_deployments.csv as a test data set Jan 14, 2024
@mhpob mhpob changed the title Addition of ins_serial_no requirement of validate_glatos_receivers negates use of hfx_deployments.csv as a test data set Addition of ins_serial_no requirement of validate_glatos_receivers negates use of hfx_deployments.csv as test data for read_otn_deployments` Jan 14, 2024
@chrisholbrook
Copy link
Collaborator

I am not aware of any glatos function that requires ins_serial_no, so perhaps this requirement should be relaxed; but it also seems like a reasonable requirement for an object about devices. @jdpye, is hfx_deployments.csv still an active file structure in OTN?

@jdpye
Copy link
Member

jdpye commented Jan 24, 2024

Yep, it's sourced from our instrument metadata shortform: https://members.oceantrack.org/data/data-collection/otn-instrument-deployment-short-form.xlsx

So most if not all OTN projects will have a file like this. I don't have the ability to build this at present using our GeoServer, but i could try and build a layer for this in the near future, so we could default to a web-facing example of this data.

@jdpye
Copy link
Member

jdpye commented Jan 24, 2024

on further review, the existence of columns like off_set in this example means it was likely to be sourced from our longer-form metadata sheets, where we track things like how far up the riser is from the bottom mooring.

I still think we should accept ins_serial_no and agree to looking into whether we can soften its requirement, but this HFX example is even more than our basic deployment metadata requires.

@jdpye
Copy link
Member

jdpye commented Jan 24, 2024

further-further review - the true source looks to be our geoserver layer otn.stations_receivers. We're looking into if we can add ins_serial_no to that layer. https://members.oceantrack.org/geoserver/otn/ows?service=WFS&version=1.0.0&request=GetFeature&typeName=otn:stations_receivers&outputFormat=csv

@mhpob
Copy link
Contributor Author

mhpob commented Jan 29, 2024

Seems like two separate issues now: a fix of the OTN geoserver to include receiver number and a fix of the hfx_deployments.csv in glatos. The former seems more complicated due to the various permissions involved.

Is it correct, then, to say that GLATOS deployment data must include ins_serial_no, but there is no such requirement for OTN deployment data? If so, would it be faster to bake that distinction into read_otn_deployments by checking if the column exists and adding a dummy column if not before it gets validated rather than restructuring how OTN reports data?

If that's wrong and OTN usually reports a serial number that should be checked, hfx_detections has a collectornumber column that could be used to create a serial number and tack it onto hfx_deployments. The script to do this could be documented in the instdata and reverted/altered if something better comes along.

@jdpye
Copy link
Member

jdpye commented Jan 29, 2024

Instrument serial number is marked 'required' on our deployment sheets, we usually follow up or pull from manufacturer specifications if we don't get them from the data submitters. So we will be in good shape to require this, just have to express it in the datasets we're pushing out into the world via GeoServer.

https://ocean-tracking-network.github.io/node-manager-training/07_deploy_metadata/index.html

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants