From: Christian Ebner <c.ebner@proxmox.com>
To: Lukas Wagner <l.wagner@proxmox.com>,
Proxmox Backup Server development discussion
<pbs-devel@lists.proxmox.com>
Subject: Re: [pbs-devel] [PATCH proxmox-backup v8 14/45] api: reader: fetch chunks based on datastore backend
Date: Fri, 18 Jul 2025 12:10:13 +0200 [thread overview]
Message-ID: <103d8839-6be4-477d-88e3-49f025d6c3b4@proxmox.com> (raw)
In-Reply-To: <da75f209-e056-4e51-8c2a-586f6d7e0784@proxmox.com>
On 7/18/25 12:03 PM, Lukas Wagner wrote:
> On 2025-07-18 11:58, Christian Ebner wrote:
>> On 7/18/25 10:38 AM, Lukas Wagner wrote:
>>> One comment inline, but nothing prohibitive of a R-b:
>>>
>>> Reviewed-by: Lukas Wagner <l.wagner@proxmox.com>
>>>
>>>
>>> On 2025-07-15 14:53, Christian Ebner wrote:
>>>> Read the chunk based on the datastores backend, reading from local
>>>> filesystem or fetching from S3 object store.
>>>>
>>>> Signed-off-by: Christian Ebner <c.ebner@proxmox.com>
>>>> ---
>>>> changes since version 7:
>>>> - no changes
>>>>
>>>> src/api2/reader/environment.rs | 12 ++++++----
>>>> src/api2/reader/mod.rs | 41 +++++++++++++++++++++++-----------
>>>> 2 files changed, 36 insertions(+), 17 deletions(-)
>>>>
>>>> diff --git a/src/api2/reader/environment.rs b/src/api2/reader/environment.rs
>>>> index 3b2f06f43..8924352b0 100644
>>>> --- a/src/api2/reader/environment.rs
>>>> +++ b/src/api2/reader/environment.rs
>>>> @@ -1,13 +1,14 @@
>>>> use std::collections::HashSet;
>>>> use std::sync::{Arc, RwLock};
>>>> +use anyhow::Error;
>>>> use serde_json::{json, Value};
>>>> use proxmox_router::{RpcEnvironment, RpcEnvironmentType};
>>>> use pbs_api_types::Authid;
>>>> use pbs_datastore::backup_info::BackupDir;
>>>> -use pbs_datastore::DataStore;
>>>> +use pbs_datastore::{DataStore, DatastoreBackend};
>>>> use proxmox_rest_server::formatter::*;
>>>> use proxmox_rest_server::WorkerTask;
>>>> use tracing::info;
>>>> @@ -23,6 +24,7 @@ pub struct ReaderEnvironment {
>>>> pub worker: Arc<WorkerTask>,
>>>> pub datastore: Arc<DataStore>,
>>>> pub backup_dir: BackupDir,
>>>> + pub backend: DatastoreBackend,
>>>> allowed_chunks: Arc<RwLock<HashSet<[u8; 32]>>>,
>>>> }
>>>> @@ -33,8 +35,9 @@ impl ReaderEnvironment {
>>>> worker: Arc<WorkerTask>,
>>>> datastore: Arc<DataStore>,
>>>> backup_dir: BackupDir,
>>>> - ) -> Self {
>>>> - Self {
>>>> + ) -> Result<Self, Error> {
>>>> + let backend = datastore.backend()?;
>>>> + Ok(Self {
>>>> result_attributes: json!({}),
>>>> env_type,
>>>> auth_id,
>>>> @@ -43,8 +46,9 @@ impl ReaderEnvironment {
>>>> debug: tracing::enabled!(tracing::Level::DEBUG),
>>>> formatter: JSON_FORMATTER,
>>>> backup_dir,
>>>> + backend,
>>>> allowed_chunks: Arc::new(RwLock::new(HashSet::new())),
>>>> - }
>>>> + })
>>>> }
>>>> pub fn log<S: AsRef<str>>(&self, msg: S) {
>>>> diff --git a/src/api2/reader/mod.rs b/src/api2/reader/mod.rs
>>>> index a77216043..997d9ca77 100644
>>>> --- a/src/api2/reader/mod.rs
>>>> +++ b/src/api2/reader/mod.rs
>>>> @@ -3,6 +3,7 @@
>>>> use anyhow::{bail, format_err, Context, Error};
>>>> use futures::*;
>>>> use hex::FromHex;
>>>> +use http_body_util::BodyExt;
>>>> use hyper::body::Incoming;
>>>> use hyper::header::{self, HeaderValue, CONNECTION, UPGRADE};
>>>> use hyper::http::request::Parts;
>>>> @@ -27,8 +28,9 @@ use pbs_api_types::{
>>>> };
>>>> use pbs_config::CachedUserInfo;
>>>> use pbs_datastore::index::IndexFile;
>>>> -use pbs_datastore::{DataStore, PROXMOX_BACKUP_READER_PROTOCOL_ID_V1};
>>>> +use pbs_datastore::{DataStore, DatastoreBackend, PROXMOX_BACKUP_READER_PROTOCOL_ID_V1};
>>>> use pbs_tools::json::required_string_param;
>>>> +use proxmox_s3_client::S3Client;
>>>> use crate::api2::backup::optional_ns_param;
>>>> use crate::api2::helpers;
>>>> @@ -162,7 +164,7 @@ fn upgrade_to_backup_reader_protocol(
>>>> worker.clone(),
>>>> datastore,
>>>> backup_dir,
>>>> - );
>>>> + )?;
>>>> env.debug = debug;
>>>> @@ -323,17 +325,10 @@ fn download_chunk(
>>>> ));
>>>> }
>>>> - let (path, _) = env.datastore.chunk_path(&digest);
>>>> - let path2 = path.clone();
>>>> -
>>>> - env.debug(format!("download chunk {:?}", path));
>>>> -
>>>> - let data =
>>>> - proxmox_async::runtime::block_in_place(|| std::fs::read(path)).map_err(move |err| {
>>>> - http_err!(BAD_REQUEST, "reading file {:?} failed: {}", path2, err)
>>>> - })?;
>>>> -
>>>> - let body = Body::from(data);
>>>> + let body = match &env.backend {
>>>> + DatastoreBackend::Filesystem => load_from_filesystem(env, &digest)?,
>>>> + DatastoreBackend::S3(s3_client) => fetch_from_object_store(s3_client, &digest).await?,
>>>> + };
>>>> // fixme: set other headers ?
>>>> Ok(Response::builder()
>>>> @@ -345,6 +340,26 @@ fn download_chunk(
>>>> .boxed()
>>>> }
>>>> +async fn fetch_from_object_store(s3_client: &S3Client, digest: &[u8; 32]) -> Result<Body, Error> {
>>>> + let object_key = pbs_datastore::s3::object_key_from_digest(digest)?;
>>>> + if let Some(response) = s3_client.get_object(object_key).await? {
>>>
>>> ^ Do we maybe want some kind of retry-logic for retrieving objects as well? Disregard
>>> in case you implement it in a later patch, I'm reviewing this series patch by patch.
>>
>> While a retry might be of interest in case of inter-mitten issues, for the time being I would like to refrain from doing so for the reasons stated in my reply to proxmox-backup patch 0004. If the need for this truly arises, adding this later on should be rather simple. If you already see this as an issue now, I can of course add the retry logic right away.
>
> No, I'm fine with revisiting this later, e.g. after a potential rollout where we have some initial user feedback. It's still
> experimental after all :)
Yes, could well be that this is needed and makes sense, after all I
added the put retry logic and rate limiting option only after seeing
that this is indeed an issue when I encountered errors when uploading to
fast to Cloudflare R2 object stores, which seems was rather easily
overwhelm.
_______________________________________________
pbs-devel mailing list
pbs-devel@lists.proxmox.com
https://lists.proxmox.com/cgi-bin/mailman/listinfo/pbs-devel
next prev parent reply other threads:[~2025-07-18 10:09 UTC|newest]
Thread overview: 108+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-07-15 12:52 [pbs-devel] [PATCH proxmox{, -backup} v8 00/54] fix #2943: S3 storage backend for datastores Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox v8 1/9] s3 client: add crate for AWS s3 compatible object store client Christian Ebner
2025-07-15 21:13 ` [pbs-devel] partially-applied-series: " Thomas Lamprecht
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox v8 2/9] s3 client: implement AWS signature v4 request authentication Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox v8 3/9] s3 client: add dedicated type for s3 object keys Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox v8 4/9] s3 client: add type for last modified timestamp in responses Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox v8 5/9] s3 client: add helper to parse http date headers Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox v8 6/9] s3 client: implement methods to operate on s3 objects in bucket Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox v8 7/9] s3 client: add example usage for basic operations Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox v8 8/9] pbs-api-types: extend datastore config by backend config enum Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox v8 9/9] pbs-api-types: maintenance: add new maintenance mode S3 refresh Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox-backup v8 01/45] datastore: add helpers for path/digest to s3 object key conversion Christian Ebner
2025-07-18 7:24 ` Lukas Wagner
2025-07-18 8:34 ` Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox-backup v8 02/45] config: introduce s3 object store client configuration Christian Ebner
2025-07-18 7:22 ` Lukas Wagner
2025-07-18 8:37 ` Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox-backup v8 03/45] api: config: implement endpoints to manipulate and list s3 configs Christian Ebner
2025-07-18 7:32 ` Lukas Wagner
2025-07-18 8:40 ` Christian Ebner
2025-07-18 9:07 ` Lukas Wagner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox-backup v8 04/45] api: datastore: check s3 backend bucket access on datastore create Christian Ebner
2025-07-18 7:40 ` Lukas Wagner
2025-07-18 8:55 ` Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox-backup v8 05/45] api/cli: add endpoint and command to check s3 client connection Christian Ebner
2025-07-18 7:43 ` Lukas Wagner
2025-07-18 9:04 ` Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox-backup v8 06/45] datastore: allow to get the backend for a datastore Christian Ebner
2025-07-18 7:52 ` Lukas Wagner
2025-07-18 9:10 ` Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox-backup v8 07/45] api: backup: store datastore backend in runtime environment Christian Ebner
2025-07-18 7:54 ` Lukas Wagner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox-backup v8 08/45] api: backup: conditionally upload chunks to s3 object store backend Christian Ebner
2025-07-18 8:11 ` Lukas Wagner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox-backup v8 09/45] api: backup: conditionally upload blobs " Christian Ebner
2025-07-18 8:13 ` Lukas Wagner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox-backup v8 10/45] api: backup: conditionally upload indices " Christian Ebner
2025-07-18 8:20 ` Lukas Wagner
2025-07-18 9:24 ` Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox-backup v8 11/45] api: backup: conditionally upload manifest " Christian Ebner
2025-07-18 8:26 ` Lukas Wagner
2025-07-18 9:33 ` Christian Ebner
2025-07-15 12:52 ` [pbs-devel] [PATCH proxmox-backup v8 12/45] api: datastore: conditionally upload client log to s3 backend Christian Ebner
2025-07-18 8:28 ` Lukas Wagner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 13/45] sync: pull: conditionally upload content " Christian Ebner
2025-07-18 8:35 ` Lukas Wagner
2025-07-18 9:43 ` Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 14/45] api: reader: fetch chunks based on datastore backend Christian Ebner
2025-07-18 8:38 ` Lukas Wagner
2025-07-18 9:58 ` Christian Ebner
2025-07-18 10:03 ` Lukas Wagner
2025-07-18 10:10 ` Christian Ebner [this message]
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 15/45] datastore: local chunk reader: read chunks based on backend Christian Ebner
2025-07-18 8:45 ` Lukas Wagner
2025-07-18 10:11 ` Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 16/45] verify worker: add datastore backed to verify worker Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 17/45] verify: implement chunk verification for stores with s3 backend Christian Ebner
2025-07-18 8:56 ` Lukas Wagner
2025-07-18 11:45 ` Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 18/45] datastore: create namespace marker in " Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 19/45] datastore: create/delete protected marker file on s3 storage backend Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 20/45] datastore: prune groups/snapshots from s3 object store backend Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 21/45] datastore: get and set owner for s3 " Christian Ebner
2025-07-18 9:25 ` Lukas Wagner
2025-07-18 12:12 ` Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 22/45] datastore: implement garbage collection for s3 backend Christian Ebner
2025-07-18 9:47 ` Lukas Wagner
2025-07-18 14:31 ` Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 23/45] ui: add datastore type selector and reorganize component layout Christian Ebner
2025-07-18 9:55 ` Lukas Wagner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 24/45] ui: add s3 client edit window for configuration create/edit Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 25/45] ui: add s3 client view for configuration Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 26/45] ui: expose the s3 client view in the navigation tree Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 27/45] ui: add s3 client selector and bucket field for s3 backend setup Christian Ebner
2025-07-18 10:02 ` Lukas Wagner
2025-07-19 12:28 ` Christian Ebner
2025-07-22 9:25 ` Lukas Wagner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 28/45] tools: lru cache: add removed callback for evicted cache nodes Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 29/45] tools: async lru cache: implement insert, remove and contains methods Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 30/45] datastore: add local datastore cache for network attached storages Christian Ebner
2025-07-18 11:24 ` Lukas Wagner
2025-07-18 14:59 ` Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 31/45] api: backup: use local datastore cache on s3 backend chunk upload Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 32/45] api: reader: use local datastore cache on s3 backend chunk fetching Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 33/45] datastore: local chunk reader: get cached chunk from local cache store Christian Ebner
2025-07-18 11:36 ` Lukas Wagner
2025-07-18 15:04 ` Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 34/45] api: backup: add no-cache flag to bypass local datastore cache Christian Ebner
2025-07-18 11:41 ` Lukas Wagner
2025-07-18 15:37 ` Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 35/45] api/datastore: implement refresh endpoint for stores with s3 backend Christian Ebner
2025-07-18 12:01 ` Lukas Wagner
2025-07-18 15:51 ` Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 36/45] cli: add dedicated subcommand for datastore s3 refresh Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 37/45] ui: render s3 refresh as valid maintenance type and task description Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 38/45] ui: expose s3 refresh button for datastores backed by object store Christian Ebner
2025-07-18 12:46 ` Lukas Wagner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 39/45] datastore: conditionally upload atime marker chunk to s3 backend Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 40/45] bin: implement client subcommands for s3 configuration manipulation Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 41/45] bin: expose reuse-datastore flag for proxmox-backup-manager Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 42/45] datastore: mark store as in-use by setting marker on s3 backend Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 43/45] datastore: run s3-refresh when reusing a datastore with " Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 44/45] api/ui: add flag to allow overwriting in-use marker for " Christian Ebner
2025-07-15 12:53 ` [pbs-devel] [PATCH proxmox-backup v8 45/45] docs: Add section describing how to setup s3 backed datastore Christian Ebner
2025-07-18 13:14 ` Maximiliano Sandoval
2025-07-18 14:38 ` Christian Ebner
2025-07-18 13:16 ` [pbs-devel] [PATCH proxmox{, -backup} v8 00/54] fix #2943: S3 storage backend for datastores Lukas Wagner
2025-07-19 12:52 ` [pbs-devel] superseded: " Christian Ebner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=103d8839-6be4-477d-88e3-49f025d6c3b4@proxmox.com \
--to=c.ebner@proxmox.com \
--cc=l.wagner@proxmox.com \
--cc=pbs-devel@lists.proxmox.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox