From: Dominik Csapak <d.csapak@proxmox.com>
To: pdm-devel@lists.proxmox.com
Subject: [PATCH datacenter-manager v2 0/4] implement back-off mechanism for connection errors for remotes
Date: Mon, 8 Jun 2026 15:25:28 +0200 [thread overview]
Message-ID: <20260608132539.2949407-1-d.csapak@proxmox.com> (raw)
When a remote is not reachable (e.g. network outage, crashes, etc), PDM
tries to connect on every attempt with a timeout. This leads to heavily
delayed api calls in the PDM UI. To counter that, this series implements
a basic back-off mechanism that increases the time between actual api
calls in an exponential way (up to a maximum).
For details on how the back-off mechanism works see patch 1/4
Possible Improvements/Future Work:
* We could expose the back-off values via a config (either global or per
remote) to give the admin some fine grained control over this behavior
* There is still quite a bit of logs after this, but this can be cleaned
up/improved upon later too.
changes from v1:
* rebased on master (dropped equivalent patches)
* rework most of the code
Dominik Csapak (4):
server: remote cache: prepare for back-off mechanism
server: remote cache: introduce canary remote when none is reachable
server: connection: multi-client: use back-off state from remote cache
tasks: remote node mapping: use host cache for PBS too
.../tasks/remote_node_mapping.rs | 34 ++-
server/src/connection.rs | 118 +++++++---
server/src/remote_cache/back_off.rs | 128 ++++++++++
server/src/remote_cache/mod.rs | 222 +++++++++++++++++-
4 files changed, 439 insertions(+), 63 deletions(-)
create mode 100644 server/src/remote_cache/back_off.rs
--
2.47.3
next reply other threads:[~2026-06-08 13:26 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-06-08 13:25 Dominik Csapak [this message]
2026-06-08 13:25 ` [PATCH datacenter-manager v2 1/4] server: remote cache: prepare for back-off mechanism Dominik Csapak
2026-06-08 13:25 ` [PATCH datacenter-manager v2 2/4] server: remote cache: introduce canary remote when none is reachable Dominik Csapak
2026-06-08 13:25 ` [PATCH datacenter-manager v2 3/4] server: connection: multi-client: use back-off state from remote cache Dominik Csapak
2026-06-08 13:25 ` [PATCH datacenter-manager v2 4/4] tasks: remote node mapping: use host cache for PBS too Dominik Csapak
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20260608132539.2949407-1-d.csapak@proxmox.com \
--to=d.csapak@proxmox.com \
--cc=pdm-devel@lists.proxmox.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.