all lists on lists.proxmox.com
 help / color / mirror / Atom feed
From: Dominik Csapak <d.csapak@proxmox.com>
To: pdm-devel@lists.proxmox.com
Subject: [PATCH datacenter-manager v2 0/4] implement back-off mechanism for connection errors for remotes
Date: Mon,  8 Jun 2026 15:25:28 +0200	[thread overview]
Message-ID: <20260608132539.2949407-1-d.csapak@proxmox.com> (raw)

When a remote is not reachable (e.g. network outage, crashes, etc), PDM
tries to connect on every attempt with a timeout. This leads to heavily
delayed api calls in the PDM UI. To counter that, this series implements
a basic back-off mechanism that increases the time between actual api
calls in an exponential way (up to a maximum).

For details on how the back-off mechanism works see patch 1/4

Possible Improvements/Future Work:
* We could expose the back-off values via a config (either global or per
  remote) to give the admin some fine grained control over this behavior
* There is still quite a bit of logs after this, but this can be cleaned
  up/improved upon later too.

changes from v1:
* rebased on master (dropped equivalent patches)
* rework most of the code

Dominik Csapak (4):
  server: remote cache: prepare for back-off mechanism
  server: remote cache: introduce canary remote when none is reachable
  server: connection: multi-client: use back-off state from remote cache
  tasks: remote node mapping: use host cache for PBS too

 .../tasks/remote_node_mapping.rs              |  34 ++-
 server/src/connection.rs                      | 118 +++++++---
 server/src/remote_cache/back_off.rs           | 128 ++++++++++
 server/src/remote_cache/mod.rs                | 222 +++++++++++++++++-
 4 files changed, 439 insertions(+), 63 deletions(-)
 create mode 100644 server/src/remote_cache/back_off.rs

-- 
2.47.3





             reply	other threads:[~2026-06-08 13:26 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-06-08 13:25 Dominik Csapak [this message]
2026-06-08 13:25 ` [PATCH datacenter-manager v2 1/4] server: remote cache: prepare for back-off mechanism Dominik Csapak
2026-06-08 13:25 ` [PATCH datacenter-manager v2 2/4] server: remote cache: introduce canary remote when none is reachable Dominik Csapak
2026-06-08 13:25 ` [PATCH datacenter-manager v2 3/4] server: connection: multi-client: use back-off state from remote cache Dominik Csapak
2026-06-08 13:25 ` [PATCH datacenter-manager v2 4/4] tasks: remote node mapping: use host cache for PBS too Dominik Csapak

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20260608132539.2949407-1-d.csapak@proxmox.com \
    --to=d.csapak@proxmox.com \
    --cc=pdm-devel@lists.proxmox.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.
Service provided by Proxmox Server Solutions GmbH | Privacy | Legal