Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Code analyses still coded as 'CHECK' #43

Closed
egouldo opened this issue Jun 16, 2024 · 10 comments
Closed

Code analyses still coded as 'CHECK' #43

egouldo opened this issue Jun 16, 2024 · 10 comments
Assignees
Labels
bug an unexpected problem or unintended behavior

Comments

@egouldo
Copy link
Owner

egouldo commented Jun 16, 2024

Hi Hannah or @parkerth,

Could you please check the master_data file (at "data-raw/anonymised_data/master_data.csv") and code as necessary or mark as NA if can't resolve? there are a few analyses remaining with values 'CHECK':

response_id submission_id analysis_id split_id test_variable Bayesian linear_model model_subclass exclusions_effect_analysis Conclusion data_cleaning_preprocessing_tool data_cleaning_preprocessing_version data_analysis_tool data_analysis_version
R_11787O3NmejXKAH 1 2 2 net_rearing_manipulation 0 generalised standard exclude_all CHECK R NA R NA
R_11787O3NmejXKAH 1 2 3 net_rearing_manipulation 0 generalised standard exclude_all CHECK R NA R NA
R_11787O3NmejXKAH 1 2 1 net_rearing_manipulation 0 generalised standard exclude_all CHECK R NA R NA
R_1eXlFKlQdiD2F59 2 2 1 rear_Cs_at_start_of_rearing 0 linear standard retain CHECK CHECK CHECK CHECK CHECK
R_1GJlffAgZv6SY4y 1 1 1 NA CHECK CHECK standard exclude_all none_c R 4.0.0 R 4.0.0
R_1M0cMZL2IPYWHoi 1 1 1 NA CHECK CHECK CHECK exclude_all CHECK CHECK CHECK CHECK CHECK
R_1M0cMZL2IPYWHoi 1 1 1 NA 1 generalised CHECK exclude_all NA R 3.6.3 R 3.6.3
R_1QlnXdW5tKuUQIr 1 1 1 CHECK CHECK CHECK CHECK CHECK neg_q R 3.6.1 R 3.6.1
R_21gmMa0uclrNoTP 2 1 1 NA CHECK linear CHECK exclude_all CHECK CHECK CHECK CHECK CHECK
R_2Pjoz1X4q5XRClO 2 1 1 NA CHECK CHECK standard exclude_all CHECK CHECK CHECK CHECK CHECK
R_2zNKAmJcWbM4QtY 1 1 1 NA 0 CHECK standard retain none_q R 3.6.1 R 3.6.1
R_3EbbZxcQ3gctVZu 1 1 1 NA 0 CHECK hurdle exclude_all neg_q R 3.6.1 R 3.6.1
R_3Kvy0h01LXHWniT 2 2 1 NA 0 generalised standard retain CHECK R NA R NA
R_3nBCE4hMLh5s3qt 3 1 2 NA CHECK CHECK standard NA CHECK CHECK CHECK CHECK CHECK
R_3nBCE4hMLh5s3qt 3 1 1 NA CHECK CHECK standard NA CHECK CHECK CHECK CHECK CHECK
R_AzL6RdNTHtPjxzX 1 1 2 NA CHECK CHECK standard exclude_all CHECK CHECK CHECK CHECK CHECK
R_es2jrrN9CTGwl5D 1 1 1 NA CHECK CHECK CHECK exclude_all neg_q R 4.0.0 R 4.0.0
@egouldo egouldo added the bug an unexpected problem or unintended behavior label Jun 16, 2024
@egouldo egouldo assigned egouldo, hannahsfraser and parkerth and unassigned egouldo Jun 17, 2024
@egouldo egouldo pinned this issue Jun 17, 2024
@parkerth
Copy link
Collaborator

parkerth commented Jun 17, 2024

@egouldo
not sure what to do here. The rows with "CHECK" in the Bayesian and/or linear_model column are all 'exclude' (or in one case, 'check.mixed.team').
I guess I will go ahead and change them to NA, but it seems like there might be a bigger problem with the code if this matters. Is the 'exclude' code working?

@parkerth
Copy link
Collaborator

... also true for 'model_subclass' variable (all checks are also exclusions)

@parkerth
Copy link
Collaborator

parkerth commented Jun 17, 2024

... also true for these columns:

Conclusion | software | data_cleaning_preprocessing_tool | data_cleaning_preprocessing_version | data_analysis_tool | data_analysis_version

@parkerth
Copy link
Collaborator

I'm uploading a version of the master_data.csv file with all "CHECK" replaced by "NA". Just let me know if, instead, we need to revert to the former version with the "CHECK" values

@parkerth
Copy link
Collaborator

That file is now on branch(?) with a pull request(?) (if my use of jargon is correct)

@hannahsfraser
Copy link
Collaborator

Awesome stuff Tim. You're fast like a fox!

@egouldo
Copy link
Owner Author

egouldo commented Jun 18, 2024 via email

@parkerth
Copy link
Collaborator

That makes sense. Where do we stand at this point?

egouldo referenced this issue Jul 15, 2024
This is "master_data.csv" with all values of "CHECK" replaced with "NA". However, I think this shouldn't be needed, since all these rows were to have been excluded anyway.
egouldo added a commit that referenced this issue Jul 15, 2024
- fix: `checkmate -> NAmat`

7c0dfe5#r144244899
@egouldo egouldo unpinned this issue Jul 15, 2024
@egouldo egouldo added this to the Respond Reviewer Comments milestone Aug 1, 2024
egouldo added a commit that referenced this issue Aug 5, 2024
any strings that had additional text in them after CHECK retained the additional text rather than having the whole string be replaced.
@egouldo
Copy link
Owner Author

egouldo commented Aug 5, 2024

Note that Capella-1-1-1 was marked as CHECK.mixed.team for exclusions_all. which was replaced with NA.mixed.team, and then I just did a replacement with NA across multiple columns for this entry in prev. commit, b00d31f. But that probably means we should check whether we want to include it at all @parkerth ? Note that I did also notice that the response_variable_name was coded incorrectly as "euc_sdlgs>2m" in the course of checking #80 and have replaced it. So I think this entry probably needs double checking anyways.

@egouldo
Copy link
Owner Author

egouldo commented Aug 10, 2024

Looks like has been fixed, row has been marked as 'exclude' and 'exclude_all' under relevant exclusion columns.

@egouldo egouldo closed this as completed Aug 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug an unexpected problem or unintended behavior
Projects
None yet
Development

No branches or pull requests

3 participants