From: Dietmar Maurer <dietmar@proxmox.com>
To: "Proxmox VE development discussion" <pve-devel@lists.proxmox.com>,
"Carsten Härle" <Carsten.Haerle@straightec.de>
Subject: Re: [pve-devel] Improve container backup speed dramatically (factor 100-1000)
Date: Fri, 20 Nov 2020 05:59:53 +0100 (CET) [thread overview]
Message-ID: <901006600.10.1605848394705@webmail.proxmox.com> (raw)
In-Reply-To: <57EF5F8B433A6742AD548ABECE78FE48539302@hal9001.straightec.lokal>
> Container backup is very slow compared to VM backup. I have a 500 GB container (sftp server) with minimal changing files, but even the incremental bakcups take 2 hours with heavy disk activity. Almost nothing is transfered to the backup server. It seems that it it reads the whole container everytime, without any optimization. Before I did backup with zfs send it there it took only a couple of seconds or minutes for every didfferencal backup.
Yes, that is how the current variable sized chunking algorithm works.
> See discussion here: https://forum.proxmox.com/threads/no-differantial-container-backup-with-big-containers.75676/#post-338868 <https://forum.proxmox.com/threads/no-differantial-container-backup-with-big-containers.75676/#post-338868>
>
> PBS is not storage agnostic but uses underlying snapshot feature according to the documentation: For container, the underlying snapshot feature of the file system ARE used, it already uses ZFS feature.
> https://pve.proxmox.com/wiki/Backup_and_Restore <https://pve.proxmox.com/wiki/Backup_and_Restore>
Yes, we use the snapshot feature. But the backup code is totally storage agnostic.
> For zfs file systems the set of changed file between snapshots can easy be displayed with "zfs diff", so PBS should use this feature to speed up large container backups dramatically.
"zfs diff" does not provide the information needed for our deduplication
algorithm, so we cannot use that. But if you have ideas how to make that work,
please shared them here.
next prev parent reply other threads:[~2020-11-20 5:00 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-11-19 19:29 Carsten Härle
2020-11-20 4:59 ` Dietmar Maurer [this message]
2020-11-20 7:18 ` Carsten Härle
2020-11-20 8:27 ` Dominik Csapak
2020-11-20 8:29 ` Dominik Csapak
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=901006600.10.1605848394705@webmail.proxmox.com \
--to=dietmar@proxmox.com \
--cc=Carsten.Haerle@straightec.de \
--cc=pve-devel@lists.proxmox.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox