Skip to content
This repository has been archived by the owner on Nov 10, 2023. It is now read-only.

Digitial Preservation System compliance #27

Open
trym-b opened this issue Aug 14, 2023 · 1 comment
Open

Digitial Preservation System compliance #27

trym-b opened this issue Aug 14, 2023 · 1 comment

Comments

@trym-b
Copy link

trym-b commented Aug 14, 2023

Currently, the checksum generated by veidemann-warcvalidator is of the pre-DPS format of <32-char-md5> <filepath>\n

DPS requires checksums of the following format <32-char-md5> *<filepath>\n. The files should also be called checksum_transferred.md5.

The checksum format should be changed as soon as the downstream implementations can support them. Things to bare in mind are that both rclone and veidemann-c-potet projects would need to be robustly tested with the new files being created by warcvalidator.

In addition, there is an argument that the creation of the checksums should not be done by this tool, especially if we want others to use veidemann too.

@maeb
Copy link
Member

maeb commented Sep 25, 2023

When generating a checksum file we try to mimic the output of the md5sum tool. The md5sum tool signals that a file has been read in binary mode (-b flag) by substituting the last of the two space separators with a *.

Since the code is already reading the file in binary mode the only thing we need to do to comply is changing the separator here:

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants