From: "Daniel Kral" <d.kral@proxmox.com>
To: "Daniel Kral" <d.kral@proxmox.com>,
"Thomas Lamprecht" <t.lamprecht@proxmox.com>,
<pve-devel@lists.proxmox.com>
Subject: Re: [RFC PATCH-SERIES many 00/36] dynamic scheduler + load rebalancer
Date: Tue, 24 Mar 2026 09:51:47 +0100 [thread overview]
Message-ID: <DHAVUREQ2HVP.2ZO6AXM6MHUNY@proxmox.com> (raw)
In-Reply-To: <DH6N64HXPUR8.2AVFPM6VJLLA2@proxmox.com>
On Thu Mar 19, 2026 at 10:12 AM CET, Daniel Kral wrote:
> On Wed Mar 18, 2026 at 5:54 PM CET, Thomas Lamprecht wrote:
>> ScoredMigration's Ord only compares imbalance, so two migrations with
>> the same imbalance but different source/target count as Equal, which
>> makes the BinaryHeap output order unpredictable. Maybe use the Migration
>> field, which is already Ord itself, to break any ties here as a secondary
>> key.
>
> Thanks! I forgot to add this as a FIXME there for the RFC series.
> I had a
>
> impl Ord for ScoredMigration {
> fn cmp(&self, other: &Self) -> Ordering {
> self.imbalance
> .total_cmp(&other.imbalance)
> .reverse()
> .then(self.migration.cmp(&other.migration))
> }
> }
>
> before, but while testing it seemed to not sort as expected. I haven't
> looked into this yet, though I guess that different calculations might
> end up in different exponents, which totalOrder does define as unequal
> [1].
>
> I'll briefly test this again, but sorting here in some reasonable way is
> still better than letting the order of the input data decide.
>
> [1] https://en.wikipedia.org/wiki/IEEE_754#Total-ordering_predicate
Indeed there was a slight issue about the f64 imbalance not being the
same even if it "seemed" like it is.
I ran into the problem with a pve-ha-manager test case yesterday, where
a test case was flakey and _sometimes_ reordered the seemingly same
imbalance fp number wrong:
{ sid: "vm:102", source_node: "node1", target_node: "node3", imbalance: 0.723174948891693 }
{ sid: "vm:102", source_node: "node1", target_node: "node2", imbalance: 0.723174948891693 }
Perl seems to truncate the 16th decimal place here and with some closer
inspection with some tracing, the actual values were:
ScoredMigration { sid: "vm:102", source_node: "node1", target_node: "node3", imbalance: 0.723174948891693 }
ScoredMigration { sid: "vm:102", source_node: "node1", target_node: "node2", imbalance: 0.7231749488916931 }
We and I make sure that any interaction with `Usage` in the HA Manager
is deterministic in the sense that we go through the services in the
same order every time, e.g., by sorting the keys. But I guess that
hashbrown's HashMap Iter is not deterministic as the struct itself takes
a RandomState and so we couldn't rely on the fp operation orders here.
I tried using BTreeMap and BTreeSet to make the iterator orders
deterministic, but this increased the runtime by roughly 10x, which was
unacceptable for larger cluster sizes. It also doesn't fix the problem
directly, because these cases of approximations errors caused by
changing the fp operations order could even happen with deterministic
calculations.
For the v2, I'll use the easy route and just truncated these numbers
before ordering these as the 16th decimal place is not a significant
digit anymore and shouldn't be relied on for ordering.
next prev parent reply other threads:[~2026-03-24 8:52 UTC|newest]
Thread overview: 68+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-02-17 14:13 Daniel Kral
2026-02-17 14:13 ` [RFC proxmox 1/5] resource-scheduling: move score_nodes_to_start_service to scheduler crate Daniel Kral
2026-02-17 14:13 ` [RFC proxmox 2/5] resource-scheduling: introduce generic cluster usage implementation Daniel Kral
2026-03-09 13:38 ` Dominik Rusovac
2026-03-10 10:41 ` Daniel Kral
2026-02-17 14:13 ` [RFC proxmox 3/5] resource-scheduling: add dynamic node and service stats Daniel Kral
2026-02-17 14:13 ` [RFC proxmox 4/5] resource-scheduling: implement rebalancing migration selection Daniel Kral
2026-03-09 13:32 ` Dominik Rusovac
2026-03-10 10:40 ` Daniel Kral
2026-03-11 8:21 ` Dominik Rusovac
2026-02-17 14:13 ` [RFC proxmox 5/5] resource-scheduling: implement Add and Default for {Dynamic,Static}ServiceStats Daniel Kral
2026-02-17 14:14 ` [RFC perl-rs 1/6] pve-rs: resource scheduling: use generic cluster usage implementation Daniel Kral
2026-02-17 14:14 ` [RFC perl-rs 2/6] pve-rs: resource scheduling: create service_nodes hashset from array Daniel Kral
2026-02-17 14:14 ` [RFC perl-rs 3/6] pve-rs: resource scheduling: store service stats independently of node Daniel Kral
2026-02-17 14:14 ` [RFC perl-rs 4/6] pve-rs: resource scheduling: expose auto rebalancing methods Daniel Kral
2026-02-17 14:14 ` [RFC perl-rs 5/6] pve-rs: resource scheduling: move pve_static into resource_scheduling module Daniel Kral
2026-02-17 14:14 ` [RFC perl-rs 6/6] pve-rs: resource scheduling: implement pve_dynamic bindings Daniel Kral
2026-02-17 14:14 ` [RFC cluster 1/2] datacenter config: add dynamic load scheduler option Daniel Kral
2026-02-18 11:06 ` Maximiliano Sandoval
2026-02-17 14:14 ` [RFC cluster 2/2] datacenter config: add auto rebalancing options Daniel Kral
2026-02-18 11:15 ` Maximiliano Sandoval
2026-02-17 14:14 ` [RFC ha-manager 01/21] rename static node stats to be consistent with similar interfaces Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 02/21] resources: remove redundant load_config fallback for static config Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 03/21] remove redundant service_node and migration_target parameter Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 04/21] factor out common pve to ha resource type mapping Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 05/21] derive static service stats while filling the service stats repository Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 06/21] test: make static service usage explicit for all resources Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 07/21] make static service stats indexable by sid Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 08/21] move static service stats repository to PVE::HA::Usage::Static Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 09/21] usage: augment service stats with node and state information Daniel Kral
2026-03-18 16:54 ` Thomas Lamprecht
2026-02-17 14:14 ` [RFC ha-manager 10/21] include running non-HA resources in the scheduler's accounting Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 11/21] env, resources: add dynamic node and service stats abstraction Daniel Kral
2026-03-18 16:54 ` Thomas Lamprecht
2026-03-19 9:28 ` Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 12/21] env: pve2: implement dynamic node and service stats Daniel Kral
2026-03-18 16:54 ` Thomas Lamprecht
2026-03-19 14:07 ` Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 13/21] sim: hardware: pass correct types for static stats Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 14/21] sim: hardware: factor out static stats' default values Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 15/21] sim: hardware: rewrite set-static-stats Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 16/21] sim: hardware: add set-dynamic-stats for services Daniel Kral
2026-03-18 16:54 ` Thomas Lamprecht
2026-03-19 7:38 ` Dominik Rusovac
2026-03-18 22:34 ` Thomas Lamprecht
2026-03-19 9:15 ` Dominik Rusovac
2026-02-17 14:14 ` [RFC ha-manager 17/21] usage: add dynamic usage scheduler Daniel Kral
2026-03-18 16:54 ` Thomas Lamprecht
2026-03-19 9:35 ` Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 18/21] manager: rename execute_migration to queue_resource_motion Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 19/21] manager: update_crs_scheduler_mode: factor out crs config Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 20/21] implement automatic rebalancing Daniel Kral
2026-03-18 16:54 ` Thomas Lamprecht
2026-03-19 9:31 ` Daniel Kral
2026-02-17 14:14 ` [RFC ha-manager 21/21] test: add basic automatic rebalancing system test cases Daniel Kral
2026-03-18 16:54 ` Thomas Lamprecht
2026-03-19 9:29 ` Daniel Kral
2026-02-17 14:14 ` [RFC manager 1/2] ui: dc/options: add dynamic load scheduler option Daniel Kral
2026-02-18 11:10 ` Maximiliano Sandoval
2026-02-17 14:14 ` [RFC manager 2/2] ui: dc/options: add auto rebalancing options Daniel Kral
2026-03-12 16:24 ` [RFC PATCH-SERIES many 00/36] dynamic scheduler + load rebalancer DERUMIER, Alexandre
2026-03-13 9:35 ` Daniel Kral
2026-03-18 16:54 ` Thomas Lamprecht
2026-03-19 9:12 ` Daniel Kral
2026-03-19 10:06 ` Dominik Rusovac
2026-03-19 11:35 ` Thomas Lamprecht
2026-03-24 8:51 ` Daniel Kral [this message]
2026-03-24 18:37 ` Daniel Kral
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=DHAVUREQ2HVP.2ZO6AXM6MHUNY@proxmox.com \
--to=d.kral@proxmox.com \
--cc=pve-devel@lists.proxmox.com \
--cc=t.lamprecht@proxmox.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.