How Duplicacy compares to other backup solutions

Charles · 27 April 2017 16:07

I wanted to chime in on this for others who find this page since I have been testing for months now. Please note that this is all my experience and I have been looking for what best suits my needs and have not given thought to how these products might suit other’s needs. gchen, feel free to delete this post if you don’t like it here. I understand this is the “issues” section.

*skip to the bottom to hear my thoughts on Duplicacy.

I started with no backup even though I knew better. I finally kicked myself one day and went with CrashPlan free app for site to site + unlimited cloud storage. It seemed to do fine at first, but the deduplication would choke once the dataset size increased and backups of new data would takes weeks to months. I dealt with this for a long time and tried to manage it with multiple backup sets.

Duplicati looked nice, but in the first day of testing also choked after the first few hundred GB or so of data possibly due to the deduplication algorithms. I kept tweaking the settings, but I was never completely satisfied with any of the setups I was able to achieve. This might be fine for some, but it wasn’t for me.

Arq worked and is even multithreaded, but I hardly noticed any deduplication (I’ve had similar results with just compression) which I could live with, but there were no options to manage how many versions to keep or for how long, the Windows UI was clunky, and there was no linux version. From what I can tell, it was developed for Mac, then ported to Windows. Overall it didn’t seem like a good fit and I didn’t feel like I had good control over the data. The pricing was pretty nice for personal use though. My experience with their support team for things that weren’t working wasn’t great. They had some documentation about the implementation.

Cloud Backo gave me hope for a bit since it posts documentation about how it works, however it kept failing, the logging system was a nightmare, and having to switch between the many different screens was a pain. One of the craziest things with this software was that you needed to have a backup of your backup settings in order to restore to another computer. It had no way of recognizing existing backups without having that or creating the backup set on the other computer. I particularly liked the pricing structure of Cloud Backo. I could buy the simple file backup now, and if I wanted, I could purchase other modules as I needed them. Even though there were so many options, it wasn’t confusing like these things often can be when you split everything apart like that.

Cloudberry backup did everything I needed. It is fast, supports many storage options, local encryption, versioning, deletion policies, easy control over all of the settings for each backup set, and boy was it fast. There were no deduplication options unless you wanted to purchase separate software and setup a dedupe server. It does have block-level backup, but that was documented as a feature for diff comparisons, not dedupe. My major problem was that the pricing structure didn’t really seem to support advanced home users. There were all of these hard coded restrictions preventing you from running certain versions of their software on certain machines, plus there was the data cap. Why in the world are they placing an additional cost based on how much data I backup when they aren’t the ones storing it? A sales rep said he would give me a deal for some social media activity and I said sure. I tested the software, decided to buy it. All of a sudden he comes back with a $200 + maintenance plan to keep the software updated. I tried explaining to him that I’m a personal home user and I was lead to believe that he was going to give me a pricing somewhere between the personal edition and the server edition. This was just 100 shy of the most expensive package. All of a sudden he starts playing dumb and starts asking things like “so do you want the home edition”. I suppose they may be a fine choice for a business, but the hidden fees don’t make sense for personal use. They had documentation, but the organization made much of it hard to find.

Richard mentioned qBackup. I am not familiar with this but I like the promises it makes. I also like that it says that it has the same ui across all platforms and the ability to restore across platforms. I never confirmed cloudberry could restore across platforms, but it is one of my requirements. I will not test this software out though since it clearly states it does not support VSS in the FAQ. This is a deal breaker since I have had a mountain of issues come from backup services not using VSS when I am actively trying to use data.

After some initial tests, I am choosing to implement Duplicacy as my backup software. Why? It seems to excel at everything I’ve previously mentioned. There is plenty of design documentation which outlines what I believe to be a pretty clever implementation. It achieves deduplication (as far as I can tell) at a linear/ constant speed regardless of data size and does so across multiple backup sets and computers without having a deduplication server as an intermediary (which I have seen as a solution for a few dedup softwares now). It is moderately fast with single threaded uploads and from what I hear will support multi-threading for all storages soon. Support has been fantastic even though I haven’t purchased anything yet. The pricing model is easy to understand and reasonable. The licensing is awesome and there are plans to release the source code.

Minor annoyances that may improve with time mainly come from the GUI. The GUI is nice and simple, but additional backup sets would be nice. The option to restore from additional backup repositories without switching the storage location would be nice, though now that I trust the software more after testing and researching, maybe I will combine my backup repositories into the same location. It seemed odd that there was no folder selection tool at first, but considering I normally select the root folder then add a lot of excludes for what I don’t want in order to be sure that new folders will get added, it wasn’t really that bad. Not for a data drive anyway. For a desktop computer with multiple root level directories, I ended up setting up a pseudo repository folder with symlinks in it.

gchen · 28 April 2017 01:38

Wow! I really appreciate that you share your experience in such thorough detail. This can be very helpful to other users.

It is great that Duplicacy works for you. Duplicacy is unique because of the idea of Lock-Free Deduplication and this should be the way how backup is done in this cloud age – in my own opinion, any backup tool that does not follow this paradigm will have some flaws here and there. Of course we are still young and this is still room for improvements. Particularly, the GUI version, which is a simple wrapper and relies on inter-process communication to communicate with the CLI version, may not run as smoothly as a program with built-in backup/restore functionalities.

Here is the short-term development plan:

Multiple-threaded uploading and downloading (should be ready in a week or two)
A new backend for Google Cloud Storage based on the official Google client
Fair Source License
Rewrite the GUI version with a Go GUI library so it can run backup/restore without inter-process communication

By the way, your post deserves its own thread. Would you mind creating a new issue and then copying and pasting your post to there?

DUser · 4 July 2018 18:40

Thank you, Charles, for that amazing post! It helps a lot!

I completely agree.

What is the best way to do that? I’m a regular Windows 7 user and haven’t done much with symlinks yet.

leerspace · 6 July 2018 15:55

I’ve been happy with Link Shell Extension as a user-friendly option.

gchen · 6 July 2018 23:45

The current GUI version supports multiple repositories so that workaround is not needed any more.