From: Fiona Ebner <f.ebner@proxmox.com>
To: pve-devel@lists.proxmox.com
Subject: [pve-devel] [PATCH qemu-server v2 30/32] blockdev: add helpers to generate blockdev commandline
Date: Wed, 18 Jun 2025 15:02:07 +0200 [thread overview]
Message-ID: <20250618130209.90649-31-f.ebner@proxmox.com> (raw)
In-Reply-To: <20250618130209.90649-1-f.ebner@proxmox.com>
The drive device and node structure is:
front-end device {ide-hd,scsi-hd,virtio-blk-pci} (id=$drive_id)
- throttle node (node-name=$drive_id)
- format node (node-name=f$encoded-info)
- file node (node-name=e$encoded-info)
The node-name can only be 31 characters long and needs to start with a
letter. The throttle node will stay inserted below the front-end
device. The other nodes might change however, because of drive
mirroring and similar. There currently are no good helpers to
query/walk the block graph via QMP, x-debug-query-block-graph is
experimental and for debugging only. Therefore, necessary information
is encoded in the node name to be able to find it again. In
particular, this is the volume ID, the drive ID and optionally a
snapshot name. As long as the configuration file matches with the
running instance, this is enough to find the correct node for
block operations like mirror and resize.
The 'snapshot' option, for QEMU's snapshot mode, i.e. writes are only
temporary, is not yet supported.
Originally-by: Alexandre Derumier <alexandre.derumier@groupe-cyllene.com>
[FE: split up patch
expand commit message
explicitly test for drivers with aio setting
improve readonly handling
improve CD-ROM handling
fix failure for storage named 'nbd' by always using full regex
improve node name generation
fail when drive->{snapshot} is set]
Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>
---
Changes in v2:
* Use json_bool helper.
* Coerce some additional strings by quoting for JSON, just to be sure.
src/PVE/QemuServer/Blockdev.pm | 171 ++++++++++++++++++++++++++++++++-
1 file changed, 170 insertions(+), 1 deletion(-)
diff --git a/src/PVE/QemuServer/Blockdev.pm b/src/PVE/QemuServer/Blockdev.pm
index 4d1266b8..76a00383 100644
--- a/src/PVE/QemuServer/Blockdev.pm
+++ b/src/PVE/QemuServer/Blockdev.pm
@@ -3,7 +3,41 @@ package PVE::QemuServer::Blockdev;
use strict;
use warnings;
-use PVE::QemuServer::Drive;
+use Digest::SHA;
+use Fcntl qw(S_ISBLK S_ISCHR);
+use File::stat;
+
+use PVE::JSONSchema qw(json_bool);
+use PVE::Storage;
+
+use PVE::QemuServer::Drive qw(drive_is_cdrom);
+
+my sub get_node_name {
+ my ($type, $drive_id, $volid, $snap) = @_;
+
+ my $info = "drive=$drive_id,";
+ $info .= "snap=$snap," if defined($snap);
+ $info .= "volid=$volid";
+
+ my $hash = substr(Digest::SHA::sha256_hex($info), 0, 30);
+
+ my $prefix = "";
+ if ($type eq 'fmt') {
+ $prefix = 'f';
+ } elsif ($type eq 'file') {
+ $prefix = 'e';
+ } else {
+ die "unknown node type '$type'";
+ }
+ # node-name must start with an alphabetical character
+ return "${prefix}${hash}";
+}
+
+my sub read_only_json_option {
+ my ($drive, $options) = @_;
+
+ return json_bool($drive->{ro} || drive_is_cdrom($drive) || $options->{'read-only'});
+}
sub generate_throttle_group {
my ($drive) = @_;
@@ -41,4 +75,139 @@ sub generate_throttle_group {
};
}
+sub generate_blockdev_drive_cache {
+ my ($drive, $scfg) = @_;
+
+ my $cache_direct = PVE::QemuServer::Drive::drive_uses_cache_direct($drive, $scfg);
+ return {
+ direct => json_bool($cache_direct),
+ 'no-flush' => json_bool($drive->{cache} && $drive->{cache} eq 'unsafe'),
+ };
+}
+
+sub generate_file_blockdev {
+ my ($storecfg, $drive, $options) = @_;
+
+ my $blockdev = {};
+ my $scfg = undef;
+
+ die "generate_file_blockdev called without volid/path\n" if !$drive->{file};
+ die "generate_file_blockdev called with 'none'\n" if $drive->{file} eq 'none';
+ # FIXME use overlay and new config option to define storage for temp write device
+ die "'snapshot' option is not yet supported for '-blockdev'\n" if $drive->{snapshot};
+
+ my $drive_id = PVE::QemuServer::Drive::get_drive_id($drive);
+
+ if ($drive->{file} eq 'cdrom') {
+ my $path = PVE::QemuServer::Drive::get_iso_path($storecfg, $drive->{file});
+ $blockdev = { driver => 'host_cdrom', filename => "$path" };
+ } elsif ($drive->{file} =~ m|^/|) {
+ my $path = $drive->{file};
+ # The 'file' driver only works for regular files. The check below is taken from
+ # block/file-posix.c:hdev_probe_device() in QEMU. To detect CD-ROM host devices, QEMU issues
+ # an ioctl, while the code here relies on the media=cdrom flag instead.
+ my $st = File::stat::stat($path) or die "stat for '$path' failed - $!\n";
+ my $driver = 'file';
+ if (S_ISCHR($st->mode) || S_ISBLK($st->mode)) {
+ $driver = drive_is_cdrom($drive) ? 'host_cdrom' : 'host_device';
+ }
+ $blockdev = { driver => "$driver", filename => "$path" };
+ } else {
+ my $volid = $drive->{file};
+ my ($storeid) = PVE::Storage::parse_volume_id($volid);
+
+ my $vtype = (PVE::Storage::parse_volname($storecfg, $drive->{file}))[0];
+ die "$drive_id: explicit media parameter is required for iso images\n"
+ if !defined($drive->{media}) && defined($vtype) && $vtype eq 'iso';
+
+ my $storage_opts = { hints => {} };
+ $storage_opts->{hints}->{'efi-disk'} = 1 if $drive->{interface} eq 'efidisk';
+ $storage_opts->{'snapshot-name'} = $options->{'snapshot-name'}
+ if defined($options->{'snapshot-name'});
+ $blockdev = PVE::Storage::qemu_blockdev_options($storecfg, $volid, $storage_opts);
+ $scfg = PVE::Storage::storage_config($storecfg, $storeid);
+ }
+
+ $blockdev->{cache} = generate_blockdev_drive_cache($drive, $scfg);
+
+ my $driver = $blockdev->{driver};
+ # only certain drivers have the aio setting
+ if ($driver eq 'file' || $driver eq 'host_cdrom' || $driver eq 'host_device') {
+ $blockdev->{aio} =
+ PVE::QemuServer::Drive::aio_cmdline_option($scfg, $drive, $blockdev->{cache}->{direct});
+ }
+
+ if (!drive_is_cdrom($drive)) {
+ $blockdev->{discard} = $drive->{discard} && $drive->{discard} eq 'on' ? 'unmap' : 'ignore';
+ $blockdev->{'detect-zeroes'} = PVE::QemuServer::Drive::detect_zeroes_cmdline_option($drive);
+ }
+
+ $blockdev->{'node-name'} =
+ get_node_name('file', $drive_id, $drive->{file}, $options->{'snapshot-name'});
+
+ $blockdev->{'read-only'} = read_only_json_option($drive, $options);
+
+ return $blockdev;
+}
+
+sub generate_format_blockdev {
+ my ($storecfg, $drive, $child, $options) = @_;
+
+ die "generate_format_blockdev called without volid/path\n" if !$drive->{file};
+ die "generate_format_blockdev called with 'none'\n" if $drive->{file} eq 'none';
+
+ my $scfg;
+ my $format;
+ my $volid = $drive->{file};
+ my $drive_id = PVE::QemuServer::Drive::get_drive_id($drive);
+ my ($storeid) = PVE::Storage::parse_volume_id($volid, 1);
+
+ # For PVE-managed volumes, use the format from the storage layer and prevent overrides via the
+ # drive's 'format' option. For unmanaged volumes, fallback to 'raw' to avoid auto-detection by
+ # QEMU.
+ if ($storeid) {
+ $scfg = PVE::Storage::storage_config($storecfg, $storeid);
+ $format = PVE::QemuServer::Drive::checked_volume_format($storecfg, $volid);
+ if ($drive->{format} && $drive->{format} ne $format) {
+ die "drive '$drive->{interface}$drive->{index}' - volume '$volid'"
+ . " - 'format=$drive->{format}' option different from storage format '$format'\n";
+ }
+ } else {
+ $format = $drive->{format} // 'raw';
+ }
+
+ # define cache option on both format && file node like libvirt does
+ my $cache = generate_blockdev_drive_cache($drive, $scfg);
+
+ my $node_name = get_node_name('fmt', $drive_id, $drive->{file}, $options->{'snapshot-name'});
+
+ return {
+ 'node-name' => "$node_name",
+ driver => "$format",
+ file => $child,
+ cache => $cache,
+ 'read-only' => read_only_json_option($drive, $options),
+ };
+}
+
+sub generate_drive_blockdev {
+ my ($storecfg, $drive, $options) = @_;
+
+ my $drive_id = PVE::QemuServer::Drive::get_drive_id($drive);
+
+ die "generate_drive_blockdev called without volid/path\n" if !$drive->{file};
+ die "generate_drive_blockdev called with 'none'\n" if $drive->{file} eq 'none';
+
+ my $child = generate_file_blockdev($storecfg, $drive, $options);
+ $child = generate_format_blockdev($storecfg, $drive, $child, $options);
+
+ # this is the top filter entry point, use $drive-drive_id as nodename
+ return {
+ driver => "throttle",
+ 'node-name' => "drive-$drive_id",
+ 'throttle-group' => "throttle-drive-$drive_id",
+ file => $child,
+ };
+}
+
1;
--
2.39.5
_______________________________________________
pve-devel mailing list
pve-devel@lists.proxmox.com
https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-devel
next prev parent reply other threads:[~2025-06-18 13:04 UTC|newest]
Thread overview: 42+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-06-18 13:01 [pve-devel] [PATCH-SERIES common/qemu-server v2 00/32] preparation for switch to blockdev Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH common v2 01/32] schema: parse property string: support skipping keys Fiona Ebner
2025-06-20 11:00 ` Fabian Grünbichler
2025-06-18 13:01 ` [pve-devel] [PATCH common v2 02/32] json schema: add helper to convert to JSON boolean Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 03/32] buildsys: order Perl source files in QemuServer/Makefile Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 04/32] drive: code cleanup: drop unused $vmid parameter from get_path_and_format() Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 05/32] cfg2cmd: require at least QEMU binary version 6.0 Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 06/32] drive: parse: use hash argument for optional parameters Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 07/32] drive: parse: properly handle dropped properties for restore Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 08/32] drive: remove geometry options gone since QEMU 3.1 Fiona Ebner
2025-06-20 11:03 ` Fabian Grünbichler
2025-06-20 11:20 ` Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 09/32] clone disk: io uring check: fix call to determine cache direct Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 10/32] drive: move storage_allows_io_uring_default() and drive_uses_cache_direct() helpers to drive module Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 11/32] drive: introduce aio_cmdline_option() helper Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 12/32] drive: introduce detect_zeroes_cmdline_option() helper Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 13/32] vm start: assert that migration type is set for 'tcp' migration Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 14/32] introduce StateFile module for state file related helpers Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 15/32] vm start: move state file handling to dedicated module Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 16/32] vm start: move config_to_command() call further down Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 17/32] vm start/commandline: also clean up pci reservation when config_to_command() fails Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 18/32] vm start/commandline: activate volumes before config_to_command() Fiona Ebner
2025-06-20 11:33 ` Fabian Grünbichler
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 19/32] pci: add missing includes Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 20/32] test: add tests for PCI reservations Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 21/32] cfg2cmd: print vga: fix call to print_pcie_addr() Fiona Ebner
2025-06-18 13:01 ` [pve-devel] [PATCH qemu-server v2 22/32] pci: code cleanup: remove superfluous machine type paramater from print_pci_addr Fiona Ebner
2025-06-18 15:19 ` Fiona Ebner
2025-06-18 13:02 ` [pve-devel] [PATCH qemu-server v2 23/32] cfg2cmd: collect optional parameters as a hash Fiona Ebner
2025-06-18 13:02 ` [pve-devel] [PATCH qemu-server v2 24/32] qm: showcmd: never reserve PCI devices Fiona Ebner
2025-06-18 13:02 ` [pve-devel] [PATCH qemu-server v2 25/32] vm devices list: prepare querying block device names for -blockdev Fiona Ebner
2025-06-18 13:02 ` [pve-devel] [PATCH qemu-server v2 26/32] print drive device: explicitly set write-cache starting with machine version 10.0 Fiona Ebner
2025-06-18 13:02 ` [pve-devel] [PATCH qemu-server v2 27/32] print drive device: set {r, w}error front-end properties " Fiona Ebner
2025-06-18 13:02 ` [pve-devel] [PATCH qemu-server v2 28/32] print drive device: don't reference any drive for 'none' " Fiona Ebner
2025-06-18 13:02 ` [pve-devel] [PATCH qemu-server v2 29/32] drive: create a throttle group for each drive " Fiona Ebner
2025-06-18 13:02 ` Fiona Ebner [this message]
2025-06-18 13:02 ` [pve-devel] [RFC qemu-server v2 31/32] blockdev: add support for NBD paths Fiona Ebner
2025-06-18 13:02 ` [pve-devel] [RFC qemu-server v2 32/32] command line: switch to blockdev starting with machine version 10.0 Fiona Ebner
2025-06-23 9:12 ` DERUMIER, Alexandre via pve-devel
2025-06-23 9:31 ` Fiona Ebner
2025-06-23 13:06 ` DERUMIER, Alexandre via pve-devel
2025-06-20 13:03 ` [pve-devel] partially-applied: [PATCH-SERIES common/qemu-server v2 00/32] preparation for switch to blockdev Fabian Grünbichler
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20250618130209.90649-31-f.ebner@proxmox.com \
--to=f.ebner@proxmox.com \
--cc=pve-devel@lists.proxmox.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox
Service provided by Proxmox Server Solutions GmbH | Privacy | Legal