-
Notifications
You must be signed in to change notification settings - Fork 134
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
detection of reflinks not working on btrfs #590
Comments
There are a number of outstanding issues with --is-reflink that I have been meaning to release fixes for (see #531 and the issues it references). Could you show me the output of Also, the best you can expect from running rmlint on reflinks right now is |
Thanks for the quick response.
My main concern was that after running rmlint.sh with the reflink config and then running rmlint again I was still seeing the same amount of duplicate files. So does rmlint actually still count reflinked files & their content as duplicates ? |
Some files are small enough that they cannot be reflinked. This is because small files are stored using inline extents - by default, the threshold is 2KB. Your filefrag output shows As far as rmlint is concerned, reflinked files are still duplicates, because aside from the space savings, they are independent files. Reflinks are not detected early enough to affect any printed statistics, as reflinks should not (normally) prevent matching files from being identified in outputs like CSV or JSON, which have broader uses than freeing up disk space. You can tell that a previous rmlint run created reflinks by looking for If you are looking for a quick heuristic that lets rmlint skip files that appear to be reflinked, this is tracked in issue #328. There is a work-in-progress implementation of --keep-reflinked available in my fork, but all of that code is experimental right now. |
Thanks for the clarification. I will play around with that some more and find out if it will work for me.. |
Alright. Feel free to reopen if you think there is something that should be improved. |
I am experimenting with rmlint version 2.10.1 on a btrfs filesystem, running a brnd new kernel 6.0.0.
I ran
rmlint -g -T "none +df" -o pretty -c sh:reflink .
on a folder with some duplicates and then ran the rmlink.sh script successfully.I have checked some of the former duplicates with a piece of software I found (https://github.com/pwaller/fienode) and it claims they are are reflinked.
When I run the above command again it shows me the same results as before - meaning it claims there are several duplicates.
When I run rmlint --is-reflink on one of the reflinked files I get an exit code 5.
So it seems rmlint does not detect the reflinked files due to 'fiemaps can't be read' .
What could be the reason for that ?
The text was updated successfully, but these errors were encountered: