This repository has been archived by the owner on Aug 1, 2024. It is now read-only.
Availability of the validation dataset #563
dvryaboy
started this conversation in
ESM Metagenomic Atlas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi folks, thanks for making this amazing resource available. In the paper, you describe the process you used to separate out the validation data set as follows:
Is it possible to make the training and validation datasets prepared this way available, or to share the random seed and command you used to select the validation set? This would help avoid any training set contamination concerns for folks trying to reproduce and iterate on this work.
Beta Was this translation helpful? Give feedback.
All reactions