* [pbs-devel] [PATCH proxmox-backup] api: chunk reader: make reading from filesystem fully async
@ 2025-11-26 16:28 Christian Ebner
2025-11-26 18:10 ` Thomas Lamprecht
0 siblings, 1 reply; 5+ messages in thread
From: Christian Ebner @ 2025-11-26 16:28 UTC (permalink / raw)
To: pbs-devel
Blocking the thread is problematic here and must be avoided, so
read the chunk data via tokio::fs::read() instead of std::fs::read()
and make the full loading from filesystem branch async.
Encountered while investigating a user provided backtrace looking for
possible causes of hanging backups reported in [0].
[0] https://forum.proxmox.com/threads/176444/post-819858
Signed-off-by: Christian Ebner <c.ebner@proxmox.com>
---
src/api2/reader/mod.rs | 7 ++++---
1 file changed, 4 insertions(+), 3 deletions(-)
diff --git a/src/api2/reader/mod.rs b/src/api2/reader/mod.rs
index f7adc366f..1e74b0758 100644
--- a/src/api2/reader/mod.rs
+++ b/src/api2/reader/mod.rs
@@ -321,7 +321,7 @@ fn download_chunk(
}
let body = match &env.backend {
- DatastoreBackend::Filesystem => load_from_filesystem(env, &digest)?,
+ DatastoreBackend::Filesystem => load_from_filesystem(env, &digest).await?,
DatastoreBackend::S3(s3_client) => match env.datastore.cache() {
None => fetch_from_object_store(s3_client, &digest).await?,
Some(cache) => {
@@ -357,13 +357,14 @@ async fn fetch_from_object_store(s3_client: &S3Client, digest: &[u8; 32]) -> Res
bail!("cannot find chunk with digest {}", hex::encode(digest));
}
-fn load_from_filesystem(env: &ReaderEnvironment, digest: &[u8; 32]) -> Result<Body, Error> {
+async fn load_from_filesystem(env: &ReaderEnvironment, digest: &[u8; 32]) -> Result<Body, Error> {
let (path, _) = env.datastore.chunk_path(digest);
let path2 = path.clone();
env.debug(format!("download chunk {path:?}"));
- let data = proxmox_async::runtime::block_in_place(|| std::fs::read(path))
+ let data = tokio::fs::read(path)
+ .await
.map_err(move |err| http_err!(BAD_REQUEST, "reading file {path2:?} failed: {err}"))?;
Ok(Body::from(data))
}
--
2.47.3
_______________________________________________
pbs-devel mailing list
pbs-devel@lists.proxmox.com
https://lists.proxmox.com/cgi-bin/mailman/listinfo/pbs-devel
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [pbs-devel] [PATCH proxmox-backup] api: chunk reader: make reading from filesystem fully async
2025-11-26 16:28 [pbs-devel] [PATCH proxmox-backup] api: chunk reader: make reading from filesystem fully async Christian Ebner
@ 2025-11-26 18:10 ` Thomas Lamprecht
2025-11-27 8:55 ` Christian Ebner
2025-11-27 9:01 ` Fabian Grünbichler
0 siblings, 2 replies; 5+ messages in thread
From: Thomas Lamprecht @ 2025-11-26 18:10 UTC (permalink / raw)
To: Proxmox Backup Server development discussion, Christian Ebner
Am 26.11.25 um 17:28 schrieb Christian Ebner:
> Blocking the thread is problematic here and must be avoided, so
> read the chunk data via tokio::fs::read() instead of std::fs::read()
> and make the full loading from filesystem branch async.
Nothing against that, but "async" here comes a bit with a bigger asterisks,
as:
"This operation is implemented by running the equivalent blocking operation
on a separate thread pool using spawn_blocking."
-- https://docs.rs/tokio/latest/tokio/fs/fn.read.html
So technically async, but not really does any async IO (tokio io uring when? ;)).
The important thing is that it cannot block anything, so it _is_ an OK solution
here, might be nice to adapt the commit message slightly though, e.g. something
like:
...::read() to move the blocking file read in the "full loading from filesystem"
branch to it's own thread pool. Can be done on applying though.
>
> Encountered while investigating a user provided backtrace looking for
> possible causes of hanging backups reported in [0].
>
> [0] https://forum.proxmox.com/threads/176444/post-819858
>
> Signed-off-by: Christian Ebner <c.ebner@proxmox.com>
> ---
> src/api2/reader/mod.rs | 7 ++++---
> 1 file changed, 4 insertions(+), 3 deletions(-)
>
> diff --git a/src/api2/reader/mod.rs b/src/api2/reader/mod.rs
> index f7adc366f..1e74b0758 100644
> --- a/src/api2/reader/mod.rs
> +++ b/src/api2/reader/mod.rs
> @@ -321,7 +321,7 @@ fn download_chunk(
> }
>
> let body = match &env.backend {
> - DatastoreBackend::Filesystem => load_from_filesystem(env, &digest)?,
> + DatastoreBackend::Filesystem => load_from_filesystem(env, &digest).await?,
> DatastoreBackend::S3(s3_client) => match env.datastore.cache() {
> None => fetch_from_object_store(s3_client, &digest).await?,
> Some(cache) => {
> @@ -357,13 +357,14 @@ async fn fetch_from_object_store(s3_client: &S3Client, digest: &[u8; 32]) -> Res
> bail!("cannot find chunk with digest {}", hex::encode(digest));
> }
>
> -fn load_from_filesystem(env: &ReaderEnvironment, digest: &[u8; 32]) -> Result<Body, Error> {
> +async fn load_from_filesystem(env: &ReaderEnvironment, digest: &[u8; 32]) -> Result<Body, Error> {
> let (path, _) = env.datastore.chunk_path(digest);
> let path2 = path.clone();
>
> env.debug(format!("download chunk {path:?}"));
>
> - let data = proxmox_async::runtime::block_in_place(|| std::fs::read(path))
> + let data = tokio::fs::read(path)
> + .await
> .map_err(move |err| http_err!(BAD_REQUEST, "reading file {path2:?} failed: {err}"))?;
> Ok(Body::from(data))
> }
_______________________________________________
pbs-devel mailing list
pbs-devel@lists.proxmox.com
https://lists.proxmox.com/cgi-bin/mailman/listinfo/pbs-devel
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [pbs-devel] [PATCH proxmox-backup] api: chunk reader: make reading from filesystem fully async
2025-11-26 18:10 ` Thomas Lamprecht
@ 2025-11-27 8:55 ` Christian Ebner
2025-11-27 9:03 ` Thomas Lamprecht
2025-11-27 9:01 ` Fabian Grünbichler
1 sibling, 1 reply; 5+ messages in thread
From: Christian Ebner @ 2025-11-27 8:55 UTC (permalink / raw)
To: Thomas Lamprecht, Proxmox Backup Server development discussion
On 11/26/25 7:10 PM, Thomas Lamprecht wrote:
> Am 26.11.25 um 17:28 schrieb Christian Ebner:
>> Blocking the thread is problematic here and must be avoided, so
>> read the chunk data via tokio::fs::read() instead of std::fs::read()
>> and make the full loading from filesystem branch async.
>
> Nothing against that, but "async" here comes a bit with a bigger asterisks,
> as:
>
> "This operation is implemented by running the equivalent blocking operation
> on a separate thread pool using spawn_blocking."
> -- https://docs.rs/tokio/latest/tokio/fs/fn.read.html
>
> So technically async, but not really does any async IO (tokio io uring when? ;)).
>
> The important thing is that it cannot block anything, so it _is_ an OK solution
> here, might be nice to adapt the commit message slightly though, e.g. something
> like:
>
> ...::read() to move the blocking file read in the "full loading from filesystem"
> branch to it's own thread pool. Can be done on applying though.
True, fully async is indeed overreaching and incorrect.
Can send a v2 with the commit message adapted if requested. Just to
clarify as this came up in off-list discussion with Fabian. I do not
expect this to be the cause of the issues as reported by the users, so
finding that has priority.
>
>>
>> Encountered while investigating a user provided backtrace looking for
>> possible causes of hanging backups reported in [0].
>>
>> [0] https://forum.proxmox.com/threads/176444/post-819858
>>
>> Signed-off-by: Christian Ebner <c.ebner@proxmox.com>
>> ---
>> src/api2/reader/mod.rs | 7 ++++---
>> 1 file changed, 4 insertions(+), 3 deletions(-)
>>
>> diff --git a/src/api2/reader/mod.rs b/src/api2/reader/mod.rs
>> index f7adc366f..1e74b0758 100644
>> --- a/src/api2/reader/mod.rs
>> +++ b/src/api2/reader/mod.rs
>> @@ -321,7 +321,7 @@ fn download_chunk(
>> }
>>
>> let body = match &env.backend {
>> - DatastoreBackend::Filesystem => load_from_filesystem(env, &digest)?,
>> + DatastoreBackend::Filesystem => load_from_filesystem(env, &digest).await?,
>> DatastoreBackend::S3(s3_client) => match env.datastore.cache() {
>> None => fetch_from_object_store(s3_client, &digest).await?,
>> Some(cache) => {
>> @@ -357,13 +357,14 @@ async fn fetch_from_object_store(s3_client: &S3Client, digest: &[u8; 32]) -> Res
>> bail!("cannot find chunk with digest {}", hex::encode(digest));
>> }
>>
>> -fn load_from_filesystem(env: &ReaderEnvironment, digest: &[u8; 32]) -> Result<Body, Error> {
>> +async fn load_from_filesystem(env: &ReaderEnvironment, digest: &[u8; 32]) -> Result<Body, Error> {
>> let (path, _) = env.datastore.chunk_path(digest);
>> let path2 = path.clone();
>>
>> env.debug(format!("download chunk {path:?}"));
>>
>> - let data = proxmox_async::runtime::block_in_place(|| std::fs::read(path))
>> + let data = tokio::fs::read(path)
>> + .await
>> .map_err(move |err| http_err!(BAD_REQUEST, "reading file {path2:?} failed: {err}"))?;
>> Ok(Body::from(data))
>> }
>
_______________________________________________
pbs-devel mailing list
pbs-devel@lists.proxmox.com
https://lists.proxmox.com/cgi-bin/mailman/listinfo/pbs-devel
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [pbs-devel] [PATCH proxmox-backup] api: chunk reader: make reading from filesystem fully async
2025-11-26 18:10 ` Thomas Lamprecht
2025-11-27 8:55 ` Christian Ebner
@ 2025-11-27 9:01 ` Fabian Grünbichler
1 sibling, 0 replies; 5+ messages in thread
From: Fabian Grünbichler @ 2025-11-27 9:01 UTC (permalink / raw)
To: Christian Ebner, Proxmox Backup Server development discussion
On November 26, 2025 7:10 pm, Thomas Lamprecht wrote:
> Am 26.11.25 um 17:28 schrieb Christian Ebner:
>> Blocking the thread is problematic here and must be avoided, so
>> read the chunk data via tokio::fs::read() instead of std::fs::read()
>> and make the full loading from filesystem branch async.
>
> Nothing against that, but "async" here comes a bit with a bigger asterisks,
> as:
>
> "This operation is implemented by running the equivalent blocking operation
> on a separate thread pool using spawn_blocking."
> -- https://docs.rs/tokio/latest/tokio/fs/fn.read.html
>
> So technically async, but not really does any async IO (tokio io uring when? ;)).
https://github.com/tokio-rs/tokio/pull/7713
:-P
_______________________________________________
pbs-devel mailing list
pbs-devel@lists.proxmox.com
https://lists.proxmox.com/cgi-bin/mailman/listinfo/pbs-devel
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [pbs-devel] [PATCH proxmox-backup] api: chunk reader: make reading from filesystem fully async
2025-11-27 8:55 ` Christian Ebner
@ 2025-11-27 9:03 ` Thomas Lamprecht
0 siblings, 0 replies; 5+ messages in thread
From: Thomas Lamprecht @ 2025-11-27 9:03 UTC (permalink / raw)
To: Christian Ebner, Proxmox Backup Server development discussion
Am 27.11.25 um 09:54 schrieb Christian Ebner:
> True, fully async is indeed overreaching and incorrect.
>
> Can send a v2 with the commit message adapted if requested. Just to clarify
as this came up in off-list discussion with Fabian. I do not expect this to be
the cause of the issues as reported by the users, so finding that has priority.
Yeah, no worries, this is just a nit.
_______________________________________________
pbs-devel mailing list
pbs-devel@lists.proxmox.com
https://lists.proxmox.com/cgi-bin/mailman/listinfo/pbs-devel
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2025-11-27 9:03 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2025-11-26 16:28 [pbs-devel] [PATCH proxmox-backup] api: chunk reader: make reading from filesystem fully async Christian Ebner
2025-11-26 18:10 ` Thomas Lamprecht
2025-11-27 8:55 ` Christian Ebner
2025-11-27 9:03 ` Thomas Lamprecht
2025-11-27 9:01 ` Fabian Grünbichler
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox