public inbox for pbs-devel@lists.proxmox.com
 help / color / mirror / Atom feed
From: "Hannes Laimer" <h.laimer@proxmox.com>
To: "Proxmox Backup Server development discussion"
	<pbs-devel@lists.proxmox.com>
Cc: "pbs-devel" <pbs-devel-bounces@lists.proxmox.com>
Subject: Re: [pbs-devel] [PATCH proxmox{, -backup} v10 00/49] fix #2943: S3 storage backend for datastores
Date: Tue, 22 Jul 2025 10:14:33 +0200	[thread overview]
Message-ID: <DBIFMRPTT6KF.2UKFU0Q147M1Z@proxmox.com> (raw)
In-Reply-To: <20250721164507.1045869-1-c.ebner@proxmox.com>

Both the max S3 object key length check and the leftover `.unwrap()`
look good now!

The rather small changes from v9 -> v10 also look fine, consider this

Reviewed-by: Hannes Laimer <h.laimer@proxmox.com>


On Mon Jul 21, 2025 at 6:44 PM CEST, Christian Ebner wrote:
> Disclaimer: These patches are still in an experimental state and not
> intended for production use.
>
> This patch series aims to add S3 compatible object stores as storage
> backend for PBS datastores. A PBS local cache store using the regular
> datastore layout is used for faster operation, bypassing requests to
> the S3 api when possible. Further, the local cache store allows to
> keep frequently used chunks and is used to avoid expensive metadata
> updates on the object store, e.g. by using local marker file during
> garbage collection.
>
> Backups are created by upload chunks to the corresponding S3 bucket,
> while keeping the index files in the local cache store, on backup
> finish, the snapshot metadata are persisted to the S3 storage backend.
>
> Snapshot restores read chunks preferably from the local cache store,
> downloading and insterting them if not present from the S3 object
> store. Listing and snapsoht metadata operation currently rely soly on
> the local cache store.
>
> Currently chunks use a 1:1 mapping to S3 objects. An advanced packing
> mechanism for chunks to significantly reduce the number of api
> requests and therefore be more cost effective will be implemented as
> followup patches.
>
> Most notably changes since version 8 of the patches (thanks @Lukas for
> more testing and debugging and Hannes for review):
> - Moved s3 refresh button to the datastore content, and made it less
>   visible by placing it into a 'More' dropdown.
> - Added basic unit tests for s3 object key helpers
> - Extend missing public function and type documentation
> - Use pbs_buildcfg::configdir macro to build config paths
> - Mostly refactoring based on feedback, please refer to the per-patch
>   changelog for details.
> - Fixes typos in docs (thanks @Maximiliano)
>
> Most notably changes since version 8 of the patches (thanks @Lukas for
> code review):
> - Moved s3 refresh button to the datastore content, and made it less
>   visible by placing it into a 'More' dropdown.
> - Added basic unit tests for s3 object key helpers
> - Extend missing public function and type documentation
> - Use pbs_buildcfg::configdir macro to build config paths
> - Mostly refactoring based on feedback, please refer to the per-patch
>   changelog for details.
> - Fixes typos in docs (thanks @Maximiliano)
>
> Most notably changes since version 7 of the patches (thanks @Thomas
> and @Lukas for feedback, testing and debugging):
> - Improve self-signed certificate fingerprint check, verify valid
>   expected fingerprint is passed to client on instantiation.
> - Rename previously missed host to endpoint is s3 client selector
> - Use more specific `S3 Client ID` over ambiguous `Unique Identifier`
> - Implement missing cli commands for s3 client manipulation
> - Add in-use marker to s3 object stores to avoid accitental reuse of
>   object stores which are already used as datastore by another
>   instance, adding also flags to overwrite.
> - Automatically perform an s3-refresh when recreating a datastore,
>   pre-populating the contents without further user interaction.
> - Add documentation
> - Fix formatting issue in proxmox-s3-client
>
> Most notably changes since version 6 of the patches (thanks @Thomas
> for feedback):
> - Reworked uri encoding logic, instead of doing this in the
>   S3ObjectKey, perform this in the build_uri helper used by all
>   client api requests.
> - Add cache-size optional parameter to datastore backend config,
>   allows to define the local datastore LRU cache capacity.
> - Increase s3 client timeout, as otherwise delete objects operations
>   on Cloudflare R2 would run into a timeout error.
> - Also upload client log, previously not uploaded to s3 backend.
> - Add missing documentation to some pub types in the response reader
> - Use s3 object key generation helper for index file upload, which
>   fixes the missing key prefix.
> - Add basic regression tests for uri encoder and decoder helper
>   functions.
> - Include some baseline performance tests for garbage collection as
>   well as chunk up-/download when caching.
>
> Most notably changes since version 5 of the patches (thanks @Thomas
> for feedback):
> - Move s3 client into its own, dedicated crate in the proxmox repo
> - Factor out any directly PBS related code from the client
> - Guard implementation behind feature cfg, so api types can be used
>   independently
> - Add basic example and extend on crate documentation
>
> Most notably changes since version 4 of the patches:
> - Fix race between S3 backend upload and local cache store insert,
>   avoiding possibly chunk loss for concurrent backups.
> - Use the local datastore cache also for local chunk reader instances
> - Fallback to fetching chunks from S3 backend if they should be cached
>   but the local chunk file is missing or empty, instead of failing
> - Rename chunks detected as corrupt also on the S3 object store
> - Retry chunk uploads via put objects in case of errors.
> - Add possibility to add rate limits for the s3 client put requests, as
>   otherwise object stores can be overloaded.
> - Allow for Cloudflare R2 compatible `auto` region, as otherwise AWS
>   sign v4 request authentication will fail
> - Use `Async` instead of `Sync` variant for the api handler of the
>   s3-refresh command, as otherwise this fails.
> - Take into account that some type folders might not be present when
>   performing an s3-refresh.
> - Use `Local` instead of `Regular` to refer to normal datastores in the
>   creation window.
>
> Most notably changes since version 3 of the patches:
> - Rebased onto current master, fixed incompatibilities with upgraded
>   dependencies
> - Added method to uri decode s3 object keys, as they are required in
>   order to download contents to a local store
> - Added api endpoint to allow resyncing of the datastore contents to
>   the local cache store, introducing a new maintenance mode s3-refresh
>   to guarantee consistency.
>
> Most notably changes since RFC version 2 of the patches (thanks
> @Lukas for feedback):
> - Extend S3 client implementation to also support path style bucket
>   addressing.
> - Keep bucket name as config option for the datastore, allowing more
>   flexible reuse of a configured S3 client.
> - Use the datastore name as additional object key prefix to allow for
>   multiple datastores on the same bucket.
> - Allow bucket and region templating in S3 endpoint, making this more
>   flexible with respect to possible DNS records.
> - Rework datastore create window to be less overloaded.
> - Drop dead code in the S3 client implementation, since tagging and
>   object copying is currently not required.
> - Fix missing locking when deleting chunks from s3 store during
>   garbage collection, avoiding possible chunk loss for concurrent
>   backups.
> - Remove chunks from LRU cache when deleting chunks during garbage
>   collection, avoiding possible chunk loss for concurrent backups.
> - Add dedicated types for object prefix and relative s3 key paths to
>   avoid misuse.
> - Use more fitting icon for S3 client.
>
> Link to the bugtracker issue:
> https://bugzilla.proxmox.com/show_bug.cgi?id=2943
>
> Steps to setup a local S3 object store using RADOS gateway or MinIO
> can be found at (internal only, external users might use the steps
> outlined in the cover letter and comments of RFC version 2):
> https://wiki.intra.proxmox.com/PBS_Setup_S3_Object_Store
>
> proxmox:
>
> Christian Ebner (3):
>   pbs-api-types: extend datastore config by backend config enum
>   pbs-api-types: maintenance: add new maintenance mode S3 refresh
>   s3 client: Add missing S3 object key max length check
>
>  Cargo.toml                              |   1 +
>  pbs-api-types/Cargo.toml                |   1 +
>  pbs-api-types/debian/control            |   2 +
>  pbs-api-types/src/datastore.rs          | 114 +++++++++++++++++++++++-
>  pbs-api-types/src/maintenance.rs        |   4 +
>  proxmox-s3-client/examples/s3_client.rs |   4 +-
>  proxmox-s3-client/src/object_key.rs     |  26 ++++--
>  7 files changed, 142 insertions(+), 10 deletions(-)
>
>
> proxmox-backup:
>
> Christian Ebner (46):
>   datastore: add helpers for path/digest to s3 object key conversion
>   config: introduce s3 object store client configuration
>   api: config: implement endpoints to manipulate and list s3 configs
>   api: datastore: check s3 backend bucket access on datastore create
>   api/cli: add endpoint and command to check s3 client connection
>   datastore: allow to get the backend for a datastore
>   api: backup: store datastore backend in runtime environment
>   api: backup: conditionally upload chunks to s3 object store backend
>   api: backup: conditionally upload blobs to s3 object store backend
>   api: backup: conditionally upload indices to s3 object store backend
>   api: backup: conditionally upload manifest to s3 object store backend
>   api: datastore: conditionally upload client log to s3 backend
>   sync: pull: conditionally upload content to s3 backend
>   api: reader: fetch chunks based on datastore backend
>   datastore: local chunk reader: read chunks based on backend
>   verify worker: add datastore backed to verify worker
>   verify: implement chunk verification for stores with s3 backend
>   datastore: create namespace marker in s3 backend
>   datastore: create/delete protected marker file on s3 storage backend
>   datastore: prune groups/snapshots from s3 object store backend
>   datastore: get and set owner for s3 store backend
>   datastore: implement garbage collection for s3 backend
>   ui: add datastore type selector and reorganize component layout
>   ui: add s3 client edit window for configuration create/edit
>   ui: add s3 client view for configuration
>   ui: expose the s3 client view in the navigation tree
>   ui: add s3 client selector and bucket field for s3 backend setup
>   tools: lru cache: add removed callback for evicted cache nodes
>   tools: async lru cache: implement insert, remove and contains methods
>   datastore: add local datastore cache for network attached storages
>   api: backup: use local datastore cache on s3 backend chunk upload
>   api: reader: use local datastore cache on s3 backend chunk fetching
>   datastore: local chunk reader: get cached chunk from local cache store
>   backup writer: refactor parameters into backup writer options struct
>   api: backup: add no-cache flag to bypass local datastore cache
>   api/datastore: implement refresh endpoint for stores with s3 backend
>   cli: add dedicated subcommand for datastore s3 refresh
>   ui: render s3 refresh as valid maintenance type and task description
>   ui: expose s3 refresh button for datastores backed by object store
>   datastore: conditionally upload atime marker chunk to s3 backend
>   bin: implement client subcommands for s3 configuration manipulation
>   bin: expose reuse-datastore flag for proxmox-backup-manager
>   datastore: mark store as in-use by setting marker on s3 backend
>   datastore: run s3-refresh when reusing a datastore with s3 backend
>   api/ui: add flag to allow overwriting in-use marker for s3 backend
>   docs: Add section describing how to setup s3 backed datastore
>
>  Cargo.toml                                    |   2 +
>  docs/storage.rst                              |  73 ++
>  examples/upload-speed.rs                      |  17 +-
>  pbs-client/src/backup_writer.rs               |  47 +-
>  pbs-config/Cargo.toml                         |   1 +
>  pbs-config/src/lib.rs                         |   1 +
>  pbs-config/src/s3.rs                          |  89 +++
>  pbs-datastore/Cargo.toml                      |   5 +
>  pbs-datastore/src/backup_info.rs              |  63 +-
>  pbs-datastore/src/cached_chunk_reader.rs      |   6 +-
>  pbs-datastore/src/chunk_store.rs              |  29 +-
>  pbs-datastore/src/datastore.rs                | 696 ++++++++++++++++--
>  pbs-datastore/src/dynamic_index.rs            |   1 +
>  pbs-datastore/src/lib.rs                      |   5 +
>  pbs-datastore/src/local_chunk_reader.rs       |  66 +-
>  .../src/local_datastore_lru_cache.rs          | 180 +++++
>  pbs-datastore/src/s3.rs                       | 114 +++
>  pbs-tools/src/async_lru_cache.rs              |  46 +-
>  pbs-tools/src/lru_cache.rs                    |  42 +-
>  proxmox-backup-client/src/benchmark.rs        |  17 +-
>  proxmox-backup-client/src/main.rs             |  26 +-
>  src/api2/admin/datastore.rs                   |  97 ++-
>  src/api2/admin/mod.rs                         |   2 +
>  src/api2/admin/s3.rs                          |  83 +++
>  src/api2/backup/environment.rs                |  82 ++-
>  src/api2/backup/mod.rs                        | 131 ++--
>  src/api2/backup/upload_chunk.rs               | 114 ++-
>  src/api2/config/datastore.rs                  | 149 +++-
>  src/api2/config/mod.rs                        |   2 +
>  src/api2/config/s3.rs                         | 307 ++++++++
>  src/api2/node/disks/directory.rs              |   2 +-
>  src/api2/node/disks/zfs.rs                    |   2 +-
>  src/api2/reader/environment.rs                |  12 +-
>  src/api2/reader/mod.rs                        |  62 +-
>  src/backup/verify.rs                          | 138 +++-
>  src/bin/proxmox-backup-manager.rs             |   1 +
>  src/bin/proxmox_backup_manager/datastore.rs   |  42 ++
>  src/bin/proxmox_backup_manager/mod.rs         |   2 +
>  src/bin/proxmox_backup_manager/s3.rs          | 102 +++
>  src/server/pull.rs                            |  76 +-
>  src/server/push.rs                            |  27 +-
>  src/server/sync.rs                            |  22 +-
>  src/server/verify_job.rs                      |   2 +-
>  www/Makefile                                  |   3 +
>  www/NavigationTree.js                         |   6 +
>  www/Utils.js                                  |   4 +
>  www/config/S3ClientView.js                    | 141 ++++
>  www/datastore/Content.js                      |  48 ++
>  www/form/S3ClientSelector.js                  |  33 +
>  www/window/DataStoreEdit.js                   | 132 +++-
>  www/window/MaintenanceOptions.js              |   6 +-
>  www/window/S3ClientEdit.js                    | 148 ++++
>  52 files changed, 3151 insertions(+), 353 deletions(-)
>  create mode 100644 pbs-config/src/s3.rs
>  create mode 100644 pbs-datastore/src/local_datastore_lru_cache.rs
>  create mode 100644 pbs-datastore/src/s3.rs
>  create mode 100644 src/api2/admin/s3.rs
>  create mode 100644 src/api2/config/s3.rs
>  create mode 100644 src/bin/proxmox_backup_manager/s3.rs
>  create mode 100644 www/config/S3ClientView.js
>  create mode 100644 www/form/S3ClientSelector.js
>  create mode 100644 www/window/S3ClientEdit.js
>
>
> Summary over all repositories:
>   59 files changed, 3293 insertions(+), 363 deletions(-)



_______________________________________________
pbs-devel mailing list
pbs-devel@lists.proxmox.com
https://lists.proxmox.com/cgi-bin/mailman/listinfo/pbs-devel


  parent reply	other threads:[~2025-07-22  8:13 UTC|newest]

Thread overview: 56+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-07-21 16:44 Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox v10 1/3] pbs-api-types: extend datastore config by backend config enum Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox v10 2/3] pbs-api-types: maintenance: add new maintenance mode S3 refresh Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox v10 3/3] s3 client: Add missing S3 object key max length check Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 01/46] datastore: add helpers for path/digest to s3 object key conversion Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 02/46] config: introduce s3 object store client configuration Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 03/46] api: config: implement endpoints to manipulate and list s3 configs Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 04/46] api: datastore: check s3 backend bucket access on datastore create Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 05/46] api/cli: add endpoint and command to check s3 client connection Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 06/46] datastore: allow to get the backend for a datastore Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 07/46] api: backup: store datastore backend in runtime environment Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 08/46] api: backup: conditionally upload chunks to s3 object store backend Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 09/46] api: backup: conditionally upload blobs " Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 10/46] api: backup: conditionally upload indices " Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 11/46] api: backup: conditionally upload manifest " Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 12/46] api: datastore: conditionally upload client log to s3 backend Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 13/46] sync: pull: conditionally upload content " Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 14/46] api: reader: fetch chunks based on datastore backend Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 15/46] datastore: local chunk reader: read chunks based on backend Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 16/46] verify worker: add datastore backed to verify worker Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 17/46] verify: implement chunk verification for stores with s3 backend Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 18/46] datastore: create namespace marker in " Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 19/46] datastore: create/delete protected marker file on s3 storage backend Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 20/46] datastore: prune groups/snapshots from s3 object store backend Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 21/46] datastore: get and set owner for s3 " Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 22/46] datastore: implement garbage collection for s3 backend Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 23/46] ui: add datastore type selector and reorganize component layout Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 24/46] ui: add s3 client edit window for configuration create/edit Christian Ebner
2025-07-21 20:14   ` Thomas Lamprecht
2025-07-22  6:24     ` Christian Ebner
2025-07-22  7:00       ` Thomas Lamprecht
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 25/46] ui: add s3 client view for configuration Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 26/46] ui: expose the s3 client view in the navigation tree Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 27/46] ui: add s3 client selector and bucket field for s3 backend setup Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 28/46] tools: lru cache: add removed callback for evicted cache nodes Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 29/46] tools: async lru cache: implement insert, remove and contains methods Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 30/46] datastore: add local datastore cache for network attached storages Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 31/46] api: backup: use local datastore cache on s3 backend chunk upload Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 32/46] api: reader: use local datastore cache on s3 backend chunk fetching Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 33/46] datastore: local chunk reader: get cached chunk from local cache store Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 34/46] backup writer: refactor parameters into backup writer options struct Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 35/46] api: backup: add no-cache flag to bypass local datastore cache Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 36/46] api/datastore: implement refresh endpoint for stores with s3 backend Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 37/46] cli: add dedicated subcommand for datastore s3 refresh Christian Ebner
2025-07-21 16:44 ` [pbs-devel] [PATCH proxmox-backup v10 38/46] ui: render s3 refresh as valid maintenance type and task description Christian Ebner
2025-07-21 16:45 ` [pbs-devel] [PATCH proxmox-backup v10 39/46] ui: expose s3 refresh button for datastores backed by object store Christian Ebner
2025-07-21 16:45 ` [pbs-devel] [PATCH proxmox-backup v10 40/46] datastore: conditionally upload atime marker chunk to s3 backend Christian Ebner
2025-07-21 16:45 ` [pbs-devel] [PATCH proxmox-backup v10 41/46] bin: implement client subcommands for s3 configuration manipulation Christian Ebner
2025-07-21 16:45 ` [pbs-devel] [PATCH proxmox-backup v10 42/46] bin: expose reuse-datastore flag for proxmox-backup-manager Christian Ebner
2025-07-21 16:45 ` [pbs-devel] [PATCH proxmox-backup v10 43/46] datastore: mark store as in-use by setting marker on s3 backend Christian Ebner
2025-07-21 16:45 ` [pbs-devel] [PATCH proxmox-backup v10 44/46] datastore: run s3-refresh when reusing a datastore with " Christian Ebner
2025-07-21 16:45 ` [pbs-devel] [PATCH proxmox-backup v10 45/46] api/ui: add flag to allow overwriting in-use marker for " Christian Ebner
2025-07-21 16:45 ` [pbs-devel] [PATCH proxmox-backup v10 46/46] docs: Add section describing how to setup s3 backed datastore Christian Ebner
2025-07-22  8:14 ` Hannes Laimer [this message]
2025-07-22  9:29 ` [pbs-devel] [PATCH proxmox{, -backup} v10 00/49] fix #2943: S3 storage backend for datastores Lukas Wagner
2025-07-22 10:13 ` [pbs-devel] superseded: " Christian Ebner

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=DBIFMRPTT6KF.2UKFU0Q147M1Z@proxmox.com \
    --to=h.laimer@proxmox.com \
    --cc=pbs-devel-bounces@lists.proxmox.com \
    --cc=pbs-devel@lists.proxmox.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox
Service provided by Proxmox Server Solutions GmbH | Privacy | Legal