From: "Max Carrara" <m.carrara@proxmox.com>
To: "Christian Ebner" <c.ebner@proxmox.com>,
"Proxmox Backup Server development discussion"
<pbs-devel@lists.proxmox.com>
Cc: Thomas Lamprecht <t.lamprecht@proxmox.com>
Subject: Re: [pbs-devel] [PATCH v4 proxmox proxmox-backup 0/7] fix #4182: concurrent group pull/push support for sync jobs
Date: Wed, 09 Apr 2025 12:22:46 +0200 [thread overview]
Message-ID: <D9218ALM6I5W.1P29S3INCGLUT@proxmox.com> (raw)
In-Reply-To: <850bd324-9251-4f69-9526-5cf6759bebd6@proxmox.com>
On Sat Apr 5, 2025 at 11:31 AM CEST, Christian Ebner wrote:
> On 4/4/25 20:01, Max Carrara wrote:
> > On Fri Apr 4, 2025 at 3:49 PM CEST, Christian Ebner wrote:
> >> Syncing contents from/to a remote source via a sync job suffers from
> >> low throughput on high latency networks because of limitations by the
> >> HTTP/2 connection, as described in [0]. To improve, syncing multiple
> >> groups in parallel by establishing multiple reader instances has been
> >> suggested.
> >>
> >> This patch series implements the functionality by adding the sync job
> >> configuration property `parallel-groups`, allowing to define the
> >> number of concurrent groups pull/push futures to be instantiated and
> >> executed for each job.
> >> The property is currently not exposed on the UI, as intended to be
> >> set in the config directly for now.
> >>
> >> Examplary configuration:
> >> ```
> >> sync: s-8764c440-3a6c
> >> ns
> >> owner root@pam
> >> remote local
> >> remote-ns
> >> remote-store push-target-store
> >> remove-vanished false
> >> store datastore
> >> sync-direction push
> >> parallel-groups 4
> >> ```
> >>
> >> Since log messages are now also written concurrently, prefix logs
> >> related to groups, snapshots and archives with their respective
> >> context prefix and add context to error messages.
> >>
> >> Further, improve logging especially for sync jobs in push direction,
> >> which only displayed limited information so far.
> >>
> >> [0] https://bugzilla.proxmox.com/show_bug.cgi?id=4182
> >
> > So, I've given the code a good look -- unfortunately it's too late to do
> > any additional testing, but I wanted to shoot this out regardless in the
> > meantime.
> >
> > Code Review
> > ===========
> >
> > As always, the code quality is pristine -- I like that you're factoring
> > things out into little helper functions where applicable instead of
> > letting the existing methods grow. Very nice. Also applies cleanly
> > and is formatted correctly, naturally. Really can't complain, the
> > changes are straightforward and easy to follow.
> >
> > There's only a couple little things I spotted; see my comments inline.
> >
> > Regarding that large comment about mutexes and atomics: That's something
> > I just wanted to mention, so just to make it clear, you don't need to
> > apply my suggestion :P It's probably something we should have a look at
> > tree-wide for other data structures, too.
> >
> > Splendid work as always, anyhow!
> >
> > For now, until I get to test this, consider:
> >
> > Reviewed-by: Max Carrara <m.carrara@proxmox.com>
>
> Thanks for your efforts on this one, appreciated!
>
> There are however 2 things which make me hesitate with bringing this
> patch series further in its current approach:
>
> - Thomas raised valid concerns about the feasibility of adding such
> parallelism parameters just for the sake of a quick fix [0].
>
> - There was a user report about sync jobs being slow over a VPN
> connection, which resulted in networking adjustments which did
> significantly increase the throughput on his side [1]. He documented
> this in the bugtracker issue upon my request [2].
>
> So I would like to rather investigate if congestion control settings can
> help increase performance for the sync jobs rather than adding the
> parallel group sync for now.
>
> What do you think? (CC'ing also Thomas asking for his opinion).
>
> [0]
> https://lore.proxmox.com/pbs-devel/e5a2dcac-630e-4797-bbbf-f38bc260c2ca@proxmox.com/
> [1] https://forum.proxmox.com/threads/164450/post-761198
> [2] https://bugzilla.proxmox.com/show_bug.cgi?id=4182#c12
Huh! That's very interesting! Yeah, seems perfectly reasonable to focus
on congestion control settings instead, then.
If you recall, Gabriel and I had cooked up a scheduler prototype a
couple months ago. We had actually managed to get parallel backup jobs
going e.g., with the max amount of jobs being adjustable. My plan was to
extend this to pretty much any type of job, but there were a *lot* of
open questions and design decisions to be made in general there--so we
didn't continue iterating on it, as there were more important things to
address.
So, should be unbury that prototype at some point in the future, perhaps
we can incorporate some of the things here in this series. I can't give
you an estimate on when that will happen though, as Gabriel is working
on SDN and I'm working on Storage atm.
_______________________________________________
pbs-devel mailing list
pbs-devel@lists.proxmox.com
https://lists.proxmox.com/cgi-bin/mailman/listinfo/pbs-devel
prev parent reply other threads:[~2025-04-09 10:23 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-04-04 13:49 Christian Ebner
2025-04-04 13:49 ` [pbs-devel] [PATCH v4 proxmox 1/7] pbs api types: add 'parallel-groups' to sync job config Christian Ebner
2025-04-04 13:49 ` [pbs-devel] [PATCH v4 proxmox-backup 2/7] client: backup writer: fix upload stats size and rate for push sync Christian Ebner
2025-04-04 18:01 ` Max Carrara
2025-04-04 13:49 ` [pbs-devel] [PATCH v4 proxmox-backup 3/7] api: config/sync: add optional `parallel-groups` property Christian Ebner
2025-04-04 13:49 ` [pbs-devel] [PATCH v4 proxmox-backup 4/7] fix #4182: server: sync: allow pulling groups concurrently Christian Ebner
2025-04-04 18:02 ` Max Carrara
2025-04-07 7:21 ` Fabian Grünbichler
2025-04-04 13:49 ` [pbs-devel] [PATCH v4 proxmox-backup 5/7] server: pull: prefix log messages and add error context Christian Ebner
2025-04-04 13:49 ` [pbs-devel] [PATCH v4 proxmox-backup 6/7] server: sync: allow pushing groups concurrently Christian Ebner
2025-04-04 13:49 ` [pbs-devel] [PATCH v4 proxmox-backup 7/7] server: push: prefix log messages and add additional logging Christian Ebner
2025-04-04 18:01 ` [pbs-devel] [PATCH v4 proxmox proxmox-backup 0/7] fix #4182: concurrent group pull/push support for sync jobs Max Carrara
2025-04-05 9:31 ` Christian Ebner
2025-04-09 10:22 ` Max Carrara [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=D9218ALM6I5W.1P29S3INCGLUT@proxmox.com \
--to=m.carrara@proxmox.com \
--cc=c.ebner@proxmox.com \
--cc=pbs-devel@lists.proxmox.com \
--cc=t.lamprecht@proxmox.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox
Service provided by Proxmox Server Solutions GmbH | Privacy | Legal