You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
If no validation is given, the PathIterableDataset will try to split files (rather than patches) between train and validation. However, with a single file this will always throw an error.
This is only applicable for when the data does not fit in memory (according to CAREamics definition), which is an impossible case: if the data does not fit in memory, then we cannot train from this single file.
This issue cannot really be fixed, unless we complexify even further the dataset (e.g. keep the validation set in memory and extract it randomly in a first pass).
I am leaning towards waiting for the Zarr dataset, and then just retire the PathIterableDataset. We could then provide a convenience function to convert train/validation/test files into a single Zarr archive and use it for training/prediction.
The text was updated successfully, but these errors were encountered:
This is only applicable for when the data does not fit in memory (according to CAREamics definition), which is an impossible case: if the data does not fit in memory, then we cannot train from this single file.
Probably the error that the file is too big should be raised before we get to this point!
Describe the bug
If no validation is given, the
PathIterableDataset
will try to split files (rather than patches) between train and validation. However, with a single file this will always throw an error.Error:
This is only applicable for when the data does not fit in memory (according to CAREamics definition), which is an impossible case: if the data does not fit in memory, then we cannot train from this single file.
This issue cannot really be fixed, unless we complexify even further the dataset (e.g. keep the validation set in memory and extract it randomly in a first pass).
I am leaning towards waiting for the Zarr dataset, and then just retire the
PathIterableDataset
. We could then provide a convenience function to convert train/validation/test files into a single Zarr archive and use it for training/prediction.The text was updated successfully, but these errors were encountered: