Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Probably incorrect unit of measure in the table column "Raw Size" #58

Open
Artanias opened this issue Jan 25, 2025 · 0 comments
Open

Probably incorrect unit of measure in the table column "Raw Size" #58

Artanias opened this issue Jan 25, 2025 · 0 comments

Comments

@Artanias
Copy link

Artanias commented Jan 25, 2025

Hello, thank you very much for your contribution!

I have started to recalculate your results and in the table column "Raw Size" I think that some of the results must have unit of measure GiB instead of GB.

For example for datasets Thunderbird, HDFS and BGL:

>>> import os
>>> bytes = os.path.getsize("bgl/BGL.log") 
>>> bytes
743185031
>>> bytes_to_MiB = bytes / 1024 / 1024
>>> bytes_to_MiB
708.7564764022827
# In the table 708.76MB

>>> bytes = os.path.getsize("thunderbird/Thunderbird.log")
>>> bytes
31788301041
>>> bytes_to_GiB = bytes / 1024 / 1024 / 1024
>>> bytes_to_GiB
29.605162368156016
# In the table 29.60GB

>>> bytes = os.path.getsize("hdfs/HDFS.log")
>>> bytes
1577982906
>>> bytes_to_GiB = bytes / 1024 / 1024 / 1024
>>> bytes_to_GiB
1.4696111027151346
# In the table 1.47GB
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant