You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The data load/save mechanism and formats are although working but very slow and inefficient. For instance, the saved list of dicts or dicts of dicts are extremely slow and inefficient.
Figure out whether the format that pupil saves to is the way to go during run time or simple pandas pickling mechanism is the way to go.
What data formats should be preserved and what formats should be changed i.e. pupil, marker positions dictionaries?
What about the other test code that I used before which utilizes the hdf5 format.
[ performance evaluation of different formats at least the top 2-3 candidates ]
[ Implementing the saving first ]
[ Implementing the reading in next]
[Test the fail cases i.e. mid analysis failure and follow where it was left off]
The text was updated successfully, but these errors were encountered:
If there is a problem with loading the outputs of the gaze pipeline being slow, it may lie with the loader. The following code should efficiently load the .npz files that are saved by some steps of gaze analysis. This takes ~1.5 seconds on the file for which I tested it; it will go faster (~650 ms) if the files are converted to a dict of arrays before saving with np.savez() as output from the gaze pipeline (the extant function data_analysis.gaze.gaze_utils.dictlist_to_arraydict can be used to convert from lists of dicts to dicts of arrays). Pandas loading from a csv file takes ~550 ms; slightly faster, likely due to the fact that csv files are not compressed, and are 60% bigger on disk (~78 MB vs ~49 MB).
The final decision for format should optimize the utility of the output for whatever analyses come next. DataFrames are a fine format.
The data load/save mechanism and formats are although working but very slow and inefficient. For instance, the saved list of dicts or dicts of dicts are extremely slow and inefficient.
Figure out whether the format that pupil saves to is the way to go during run time or simple pandas pickling mechanism is the way to go.
What data formats should be preserved and what formats should be changed i.e. pupil, marker positions dictionaries?
What about the other test code that I used before which utilizes the hdf5 format.
[ performance evaluation of different formats at least the top 2-3 candidates ]
[ Implementing the saving first ]
[ Implementing the reading in next]
[Test the fail cases i.e. mid analysis failure and follow where it was left off]
The text was updated successfully, but these errors were encountered: