public inbox for pbs-devel@lists.proxmox.com
 help / color / mirror / Atom feed
* [pbs-devel] [PATCH proxmox-backup 0/5] GC: avoid multiple atime updates
@ 2025-02-21 14:01 Christian Ebner
  2025-02-21 14:01 ` [pbs-devel] [PATCH proxmox-backup 1/5] datastore: restrict datastores list_images method scope to module Christian Ebner
                   ` (6 more replies)
  0 siblings, 7 replies; 14+ messages in thread
From: Christian Ebner @ 2025-02-21 14:01 UTC (permalink / raw)
  To: pbs-devel

This patches implement the logic to greatly improve the performance
of phase 1 garbage collection by avoiding multiple atime updates on
the same chunk.

Currently, phase 1 GC iterates over all folders in the datastore
looking and collecting all image index files without taking any
logical assumptions (e.g. namespaces, groups, snapshots, ...). This
is to avoid accidentally missing image index files located in
unexpected paths and therefore not marking their chunks as in use,
leading to potential data losses.

This patches improve phase 1 by inserting encountered index image
paths into a data structure which allows to iterate the index files
in a more logical manner, following the same principle as for
incremental backup snapshots. The index files for the same namespace
and group as well as image filename can therefore be consecutevly
inspected.

Further, by keeping track of already seen and therefore updated chunk
atimes, it is now avoided to update the atime over and over again on the
chunks shared by consecutive backup snaphshots.

To give some ballpark figures, this reduced phase 1 garbage collection
on a real world datastore containing some of my backups from around
2 minutes to about 16 seconds.

Christian Ebner (5):
  datastore: restrict datastores list_images method scope to module
  garbage collection: refactor archive type based chunk marking logic
  garbage collection: add structure for optimized image iteration
  garbage collection: allow to keep track of already touched chunks
  fix #5331: garbage collection: avoid multiple chunk atime updates

 pbs-datastore/src/datastore.rs | 204 ++++++++++++++++++++++++++-------
 1 file changed, 160 insertions(+), 44 deletions(-)

-- 
2.39.5



_______________________________________________
pbs-devel mailing list
pbs-devel@lists.proxmox.com
https://lists.proxmox.com/cgi-bin/mailman/listinfo/pbs-devel


^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2025-03-10 11:19 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2025-02-21 14:01 [pbs-devel] [PATCH proxmox-backup 0/5] GC: avoid multiple atime updates Christian Ebner
2025-02-21 14:01 ` [pbs-devel] [PATCH proxmox-backup 1/5] datastore: restrict datastores list_images method scope to module Christian Ebner
2025-02-21 14:01 ` [pbs-devel] [PATCH proxmox-backup 2/5] garbage collection: refactor archive type based chunk marking logic Christian Ebner
2025-02-21 14:01 ` [pbs-devel] [PATCH proxmox-backup 3/5] garbage collection: add structure for optimized image iteration Christian Ebner
2025-03-05 13:47   ` Fabian Grünbichler
2025-03-07  8:24     ` Christian Ebner
2025-03-07  8:53       ` Fabian Grünbichler
2025-03-07  8:59         ` Christian Ebner
2025-02-21 14:01 ` [pbs-devel] [PATCH proxmox-backup 4/5] garbage collection: allow to keep track of already touched chunks Christian Ebner
2025-02-21 14:01 ` [pbs-devel] [PATCH proxmox-backup 5/5] fix #5331: garbage collection: avoid multiple chunk atime updates Christian Ebner
2025-02-21 15:35 ` [pbs-devel] [PATCH proxmox-backup 0/5] GC: avoid multiple " Roland
2025-02-21 15:49   ` Christian Ebner
2025-02-22 17:50     ` Roland
2025-03-10 11:18 ` Christian Ebner

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox
Service provided by Proxmox Server Solutions GmbH | Privacy | Legal