TL;DR - I am looking for recommendations/feedback on how to best set up duplicacy to backup a bunch of disk snapshots from several machines.
So I am using Veeam agents to store volume level snapshots of system drives from several machines (say, about 10 or so, perhaps more) on a NAS. If you’re not familiar with Veeam, the solution takes periodic snapshots of the whole disk and stores it as a single file, with incremental files generated on other days that only cover changes (so usually are much smaller).
But let’s ignore increments for now and pretend that we only need to handle full backups. What I want to do is to dump these snapshots into cloud storage(s) via duplicacy. What I would like to do is to minimize storage requirements on the NAS, and also minimize cloud storage/transfer time as a secondary goal.
Veeam does compress images, and have several levels of compression. Compression can be completely disabled, it can use dedupe-friendly compression (I believe RLE) and optimal one which I believe is LZ4 (there are stronger compression levels but these usually make little sense due to CPU usage). The differences in local storage requirements are considerable, something like 430GB / 324GB / 240GB for a 500GB system drive.
The question is, how effective duplicacy’s dedupe would be on such source? In theory, there should be a lot of overlap in data between different machines, as a lot of them would be regular Windows laptops with the bulk of the data being the same OS (e.g. Win 11 for instance). But given that our source is disk image in a single file, would we see meaningful dedupe across machines? And how RLE/LZ4 compression on such files would impact it?
I am planning to do a separate repository and probably a separate storage for these snapshots, as this would be a very different data from regular file-based stuff, likely with a very different pruning policy.
If you don’t have experience with compressed images, we can start with regular uncompressed images (like ones from dd), would you see meaningful dedupe for such images across different machines with the same OS?