From: "Fabian Grünbichler" <f.gruenbichler@proxmox.com>
To: Wolfgang Bumiller <w.bumiller@proxmox.com>
Cc: pbs-devel@lists.proxmox.com
Subject: Re: [pbs-devel] [PATCH proxmox-backup] mapped loop device: use read loop instead of read_exact
Date: Mon, 27 Nov 2023 18:27:53 +0100 (CET) [thread overview]
Message-ID: <1541339416.519.1701106073073@webmail.proxmox.com> (raw)
In-Reply-To: <io2wan7ner6cxpc3i4apratcfrpjsl5rhsdqmfn2yk66y662xn@qqkgskjp5rrl>
> Wolfgang Bumiller <w.bumiller@proxmox.com> hat am 27.11.2023 14:22 CET geschrieben:
>
> On Thu, Jun 29, 2023 at 12:32:13PM +0200, Fabian Grünbichler wrote:
> > since read_exact does not support short reads, which can easily happen if the
> > mapped image's EOF is not aligned with the request size.
> >
> > Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
> > ---
> >
> > Notes:
> > reported on the forum:
> >
> > https://forum.proxmox.com/threads/problem-backing-up-using-backup-client.129347
> >
> > did a quick test reading from a mapped image full of random data, observed
> > no performance difference..
>
> Do you get one if we just drop the loop logic and *actually* just
> `read()` once? IMO this is more in line with what a read syscall
> *should* be doing.
> Further, we use a `CachedChunkReader` under it which actually does a
> read loop anyway, so AFAICT this *can't* make a difference.
with a plain read (+ optional truncate of the reply buf) performance is still the same. but (and I am unfortunately not sure if this is a regression in the meantime, or was also broken back when I originally wrote this patch) access via the loop device actually truncates the resulting data:
- my test input image is 1701838801 bytes long (arbitrary misaligned size, straight from /dev/urandom)
- the fuse session correctly gets this passed in as size
- a regular restore restores as many (correct) bytes
- reading via the loop device with bs=1024 or bs=512 or bs=32 only returns 1701838336 bytes (465 are missing)
-- the fuse requests quickly ramp up to 128k request size (no matter the block size used to read from the loop device)
-- the last fuse read request is for 16384 bytes, but the read from PBS (correctly!) only returns 16337
-- 16337 - 31*512 = 465
-- so it seems the short read result is lost somewhere?
-- reading with O_DIRECT doesn't help (in fact, it tanks performance while still reproducing the issue)
anyhow, this requires further analysis and fixing before being applied in whichever fashion..
next prev parent reply other threads:[~2023-11-27 17:27 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-06-29 10:32 Fabian Grünbichler
2023-06-29 11:03 ` Dominik Csapak
2023-06-29 11:35 ` Fabian Grünbichler
2023-06-29 11:46 ` Dominik Csapak
2023-11-27 9:53 ` Fabian Grünbichler
2023-11-27 13:22 ` Wolfgang Bumiller
2023-11-27 17:27 ` Fabian Grünbichler [this message]
2023-11-28 10:07 ` Wolfgang Bumiller
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1541339416.519.1701106073073@webmail.proxmox.com \
--to=f.gruenbichler@proxmox.com \
--cc=pbs-devel@lists.proxmox.com \
--cc=w.bumiller@proxmox.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox