From: "DERUMIER, Alexandre via pve-devel" <pve-devel@lists.proxmox.com>
To: "pve-devel@lists.proxmox.com" <pve-devel@lists.proxmox.com>,
	"f.ebner@proxmox.com" <f.ebner@proxmox.com>
Cc: "DERUMIER, Alexandre" <alexandre.derumier@groupe-cyllene.com>
Subject: Re: [pve-devel] [PATCH v2 pve-storage 1/2] lvmplugin: use blkdiscard when supported instead cstream to saferemove drive
Date: Wed, 22 Oct 2025 09:35:32 +0000	[thread overview]
Message-ID: <mailman.185.1761125773.362.pve-devel@lists.proxmox.com> (raw)
In-Reply-To: <c1cf0323-231b-44f4-be1e-cb629c35aa4d@proxmox.com>
[-- Attachment #1: Type: message/rfc822, Size: 17464 bytes --]
From: "DERUMIER, Alexandre" <alexandre.derumier@groupe-cyllene.com>
To: "pve-devel@lists.proxmox.com" <pve-devel@lists.proxmox.com>, "f.ebner@proxmox.com" <f.ebner@proxmox.com>
Subject: Re: [pve-devel] [PATCH v2 pve-storage 1/2] lvmplugin: use blkdiscard when supported instead cstream to saferemove drive
Date: Wed, 22 Oct 2025 09:35:32 +0000
Message-ID: <414f3fba72876b4138458b625275892e059378af.camel@groupe-cyllene.com>
> +           
> file_read_firstline("$sysdir/queue/write_zeroes_max_bytes") // 0;
> +        ($write_zeroes_max_bytes) = $write_zeroes_max_bytes =~
> m/^(\d+)$/; #untaint
> +
> +        if ($write_zeroes_max_bytes == 0) {
>>What if a storage supports discard, but this value is zero? Then we'd
>>fall back to the slow method even if we don't need to.
Do you think that they exist storages supporting discard but not write
zeroes ?
but ok , no problem, I'll change this to really use cstream as fallback
for both (but see my other comments, I think that we can't use on
discard anyway)
> +            }
> +
> +            my $discard_enabled = undef;
> +
> +            if ($scfg->{'saferemove-discard'}) {
> +                my $discard_zeroes_data =
> +                   
> file_read_firstline("$sysdir/queue/discard_zeroes_data") // 0;
>>Are you sure this works? See:
>>https://www.kernel.org/doc/html/v6.17/admin-guide/abi-
>>stable.html#abi-sys-block-disk-queue-discard-zeroes-data
>>
>>"[RO] Will always return 0. Don’t rely on any specific behavior for
>>discards, and don’t read this file."
ah, you are right, is has been removed some year ago
https://git.zx2c4.com/linux-rng/commit/?h=jd/vdso-test-harness&id=48920ff2a5a940cd07d12cc79e4a2c75f1185aee
>>
from what I understand, it was a hack because REQ_OP_WRITE_ZEROES was
not implemented, so it was using discard with zeroing (when it was
possible).
>>Isn't discard_max_hw_bytes the correct one, which also can be used to
>>determine the step size:
>>https://www.kernel.org/doc/html/v6.17/admin-guide/abi-
>>stable.html#abi-sys-block-disk-queue-discard-max-hw-bytes
>>"[RO] Devices that support discard functionality may have internal
>>limits on the number of bytes that can be trimmed or unmapped in a
>>single operation. The discard_max_hw_bytes parameter is set by the
>>device driver to the maximum number of bytes that can be discarded in
>>a
>>single operation. Discard requests issued to the device must not
>>exceed
>>this limit. A discard_max_hw_bytes value of 0 means that the device
>>does
>>not support discard functionality."
mmm, it's not that simple because the queue/discard_zeroes_data  was a
flag to known if the discarded block on the storage are also really
zero filled at same time (RZAT TRIM (return zero after trim).)
I known this was quite buggy because of storage implementation bug,
maybe this is why redhat have removed discard support later
https://access.redhat.com/errata/RHBA-2018:0135
"The kernel no longer supports the /sys/block/dm-
X/queue/discard_zeroes_data file in sysfs. It is therefore no longer
possible to determine whether discarded blocks from a block device
returns zeros or the actual data. Therefore, the virtual machine disk
properties "Wipe After Delete" and "Enable Discard" are no longer
supported at the same time. (BZ#1529305)"
So, maybe we don't need to use discard at all, REQ_OP_WRITE_ZEROES is
enough. (From my test, it's quite fast)
>>And I'm not sure a limit of 32 MiB makes sense then. If the hardware
>>supports much more, it should be fine to use that, or? 
The default of 32MB for zeroing was from redhat benchmark on real san
hardware, balance between zeroing speed and latency for the running vm
workload. 
>>Do we even >>want
>>to consider saferemove-stepsize if discard is supported? Of course
>>depending on what we decide on the description in the schema needs to
>>be
>>adapted.
it could still be possible to use the step-size under disk-queue-
discard-max-hw-bytes. It's really here to avoid pushing too much load
at once.
[-- Attachment #2: Type: text/plain, Size: 160 bytes --]
_______________________________________________
pve-devel mailing list
pve-devel@lists.proxmox.com
https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-devel
next prev parent reply	other threads:[~2025-10-22  9:35 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20251021140054.3916467-1-alexandre.derumier@groupe-cyllene.com>
2025-10-21 14:00 ` Alexandre Derumier via pve-devel
2025-10-22  8:22   ` Fiona Ebner
2025-10-22  9:35     ` DERUMIER, Alexandre via pve-devel [this message]
     [not found]     ` <414f3fba72876b4138458b625275892e059378af.camel@groupe-cyllene.com>
2025-10-22  9:43       ` Fiona Ebner
2025-10-21 14:00 ` [pve-devel] [PATCH v2 pve-storage 2/2] fix #6941 : lvmplugin : fix activation on secure delete Alexandre Derumier via pve-devel
2025-10-22  9:12   ` Fiona Ebner
2025-10-22  9:35     ` Fiona Ebner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox
  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):
  git send-email \
    --in-reply-to=mailman.185.1761125773.362.pve-devel@lists.proxmox.com \
    --to=pve-devel@lists.proxmox.com \
    --cc=alexandre.derumier@groupe-cyllene.com \
    --cc=f.ebner@proxmox.com \
    /path/to/YOUR_REPLY
  https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
  Be sure your reply has a Subject: header at the top and a blank line
  before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox