public inbox for pve-devel@lists.proxmox.com
 help / color / mirror / Atom feed
From: "Fabian Grünbichler" <f.gruenbichler@proxmox.com>
To: "Dominic Jäger" <d.jaeger@proxmox.com>,
	"Proxmox VE development discussion" <pve-devel@lists.proxmox.com>
Subject: Re: [pve-devel] [PATCH v4 qemu-server] Add API for disk & VM import
Date: Fri, 12 Feb 2021 10:03:50 +0100	[thread overview]
Message-ID: <1613119192.onm1x6fg5c.astroid@nora.none> (raw)
In-Reply-To: <20210211103223.GA34013@mala>

On February 11, 2021 11:32 am, Dominic Jäger wrote:
> Thank you for looking so carefully!
> 
> On Wed, Feb 10, 2021 at 10:40:56AM +0100, Fabian Grünbichler wrote:
>> 
>> On February 5, 2021 11:04 am, Dominic Jäger wrote:
>> > Extend qm importdisk/importovf functionality to the API.
>> >  
>> > @@ -4325,4 +4324,378 @@ __PACKAGE__->register_method({
>> >  	return PVE::QemuServer::Cloudinit::dump_cloudinit_config($conf, $param->{vmid}, $param->{type});
>> >      }});
>> >  
>> > +# Raise exception if $format is not supported by $storageid
>> > +my $check_format_is_supported = sub {
>> > +    my ($format, $storageid) = @_;
>> > +
>> > +    return if !$format;
>> > +
>> > +    my $store_conf = PVE::Storage::config();
>> 
>> it probably makes sense to pass this in as parameter, all call sites 
>> probably have it already ;)
> 
> I just noticed that I have it in importdisk, but unused.  Does it make a
> relevant difference in terms of performance to call ::config() multiple times
> instead of passing it as parameter?

well, it should be the same and fairly cheap since it comes from a cache 
after the first request, but if you trigger a cfs_update() in-between 
you might get two different versions.

> 
>> 
>> > +    }
>> > +};
>> > +
>> > +# paths are returned as is
>> > +# volids are returned as paths
>> > +#
>> > +# Also checks if $original actually exists
>> > +my $convert_to_path = sub {
>> > +	my ($original) = @_;
>> > +	my $volid_as_path = eval { # Nonempty iff $original_source is a volid
>> > +	    PVE::Storage::path(PVE::Storage::config(), $original);
>> > +	};
>> > +	my $result = $volid_as_path || $original ;
>> > +	if (!-e $result) {
>> > +	    die "Could not import because source '$original' does not exist!";
>> > +	}
>> > +	return $result;
>> > +};
>> 
>> what does this do that PVE::Storage::abs_filesystem_path doesn't already 
>> do (except having 'import' in the error message ;))?
> 
> Haven't thought about that function in the last time... 
> I'll have a look at it.
> 
>> ah, further down below I see that -b might be missing for raw block 
>> devices.. maybe it makes sense to allow those in that helper as well? 
>> optionally? call-sites would need to be checked..
>  Would it make sense to not change helper for the moment, certainly not break
> other call-sites and refactor later in a separate patch?

optionally supporting block devices should not be able to break any 
existing call sites ;) but yeah, you can also replace your helper here 
with a fork of abs_filesystem_path that allows -b, that would allow 
simply switching it out if we want to handle that in pve-storage..

>> 
>> > +
>> > +# vmid ... target VM ID
>> > +# source ... absolute path of the source image (volid must be converted before)
>> 
>> that restriction is not actually needed, see below
>> 
>> > +# storage ... target storage for the disk image
>> > +# format ... target format for the disk image (optional)
>> > +#
>> > +# returns ... volid of the allocated disk image (e.g. local-lvm:vm-100-disk-2)
>> > +my $import_disk_image = sub {
>> > +    my ($param) = @_;
>> > +    my $vmid = $param->{vmid};
>> > +    my $requested_format = $param->{format};
>> > +    my $storage = $param->{storage};
>> > +    my $source = $param->{source};
>> > +
>> > +    my $vm_conf = PVE::QemuConfig->load_config($vmid);
>> 
>> could be passed in, the call sites already have it.. and it might allow 
>> to pull in some of the surrounding codes at the call sites that is 
>> similar (config updating, property string building) into this helper
> 
> It's actually unused. I'll remove it.
>> 
>> > +    my $store_conf = PVE::Storage::config();
>> 
>> could also be passed in here
> * see below
>> 
>> > +    if (!$source) {
>> > +	die "It is necessary to pass the source parameter";
>> > +    }
>> 
>> just a check for definedness
> Hmm but if someone does
> $import_disk_image->({vmid => 102, source => "", storage => "local-lvm"});
> then
>> 
>> > +    if ($source !~ m!^/!) {
>> > +	die "source must be an absolute path but is $source";
> will only say "... path but is  at /usr/share/..."?

no, because that check gets removed as well.. $source could be a volid 
at this point

> 
>> 
>> > +    if (!$storage) {
>> > +	die "It is necessary to pass the storage parameter";
>> > +    }
>> 
>> call PVE::Storage::storage_config() here to get both that and the check 
>> that the storage exists..
> Do you mean $store_conf->{ids}->{$storage} ?

no, I mean storage_config() ;)

it checks whether $storage is defined & non-empty, is an actually 
defined-in-storage.cfg storage, and returns that storage's config 
section.

> 
>> 
>> > +
>> > +__PACKAGE__->register_method ({
>> > +    name => 'importdisk',
>> > +    path => '{vmid}/importdisk',
>> > +    method => 'POST',
>> > +    protected => 1, # for worker upid file
>> 
>> huh? no - because we need elevated privileges to allocate disks on 
>> storages..
> Might be left over, I'll check.
>> 
>> > +    proxyto => 'node',
>> > +    description => "Import an external disk image into a VM. The image format ".
>> > +	"has to be supported by qemu-img.",
>> > +    parameters => {
>> > +	additionalProperties => 0,
>> > +	properties => {
>> > +	    node => get_standard_option('pve-node'),
>> > +	    vmid => get_standard_option('pve-vmid',
>> > +		{completion => \&PVE::QemuServer::complete_vmid}),
>> > +	    source => {
>> > +		description => "Disk image to import. Can be a volid ".
>> > +		    "(local-lvm:vm-104-disk-0), an image on a PVE storage ".
>> > +		    "(local:104/toImport.raw) or (for root only) an absolute ".
>> 
>> how does the second work here? and the whole thing is root-only, so that 
>> qualifier at the end is redundant ;)
> 
> The description is bad... I guess local:104/toImport.raw is a volid, too?
> I could just image it to be useful here to allow special volids like
> --source local:99/somedisk.qcow2

it's a volid, but not a valid VM-owned volume that we can do ACL checks 
on.. so it's important to think about where to use `parse_volid` vs 
`parse_volname` (the latter gives you all the checks per storage type, 
and returns type of volume, associated VMID, etc.).

for this series/patch, we can accept a proper vdisk volid if the user 
has access, or any volid/path for root. we can't allow arbitrary volids 
for unprivileged users, as that would allow reading ANY file on the 
storage.

> 
>> 
>> > +	    format => {
>> > +		type => 'string',
>> > +		description => 'Target format.',
>> > +		enum => [ 'raw', 'qcow2', 'vmdk' ],
>> > +		optional => 1,
>> > +	    },
>> 
>> not directly related to this series, but this enum is copied all over 
>> the place and might be defined in one place as standard option 
>> (qm-new-disk-format ?)
> Makes sense to me. Only
>> (qm-new-disk-format ?)
> raw, qcow2 and vmdk are not only for new disks? Those are allowed formats for every directory storage, so maybe something like dir-disk-format?

as separate parameter they are only used when creating/importing/.. NEW 
disks. of course, they are also part of the schema as part of property 
strings. also dir-disk-format would be wrong - just because 
non-dir-storages only support raw(/subvol) doesn't make the parameter 
invalid for them ;)

>> 
>> > +	    digest => get_standard_option('pve-config-digest'),
>> > +	},
>> > +    },
>> > +    returns => { type => 'string'},
>> > +    code => sub {
>> > +	my ($param) = @_;
>> > +	my $vmid = extract_param($param, 'vmid');
>> > +	my $node = extract_param($param, 'node');
>> > +	my $original_source = extract_param($param, 'source');
>> 
>> not sure why this gets this name, it's just passed once and not used 
>> afterwards..
> It was useful previously... Changed it now.
>> 
>> > +	my $digest = extract_param($param, 'digest');
>> > +	my $device_options = extract_param($param, 'device_options');
>> 
>> IMHO this one needs to be parsed - e.g., by adding a fake volume with 
>> the syntax used in importvm at the front and parsing it according to the 
>> disk schema..
>> 
>> both to catch bogus stuff early, and to make the handling below more 
>> robust w.r.t. future changes..
> 
> OK. The API currently allows any string. Would it be worth to change that then?

not sure whether that's trivially possible - the options are not uniform 
across bus / device types? you could have a 'everything optional 
superset of all disk device types' schema, that would be less strict 
than the final check but better than nothing? looking at 
PVE::QemuServer::Drive it almost exists in $alldrive_fmt, so cloning 
that and removing the file property could work?

>> > +	    if ($format_explicit && $format_device_option) {
>> > +		raise_param_exc({format => "Disk format may be specified only once!"});
>> 
>> format already specified in device_options?
> Also good for me.
> 
>> 
>> > +	    }
>> > +	}
>> > +	my $format = $format_explicit || $format_device_option;
>> 
>> since both of those are not needed after this point, this could just be 
>> returned / set by the above if, dropping the extra variables in the 
>> outer scope..
> Do you mean with a small function?
> my $get_format = sub {
> 	...
> 	return $explicit || $device_options
> }
> my $format = $get_format($device_options, extract_param($param, 'format'));

I think a function is overkill for a single use, just

my $format;
if (...) {
  // decide and set $format
}

>> 
>> > +	$check_format_is_supported->($format, $storeid);
>> > +
>> > +	my $locked = sub {
>> > +	    my $conf = PVE::QemuConfig->load_config($vmid);
>> > +	    PVE::Tools::assert_if_modified($conf->{digest}, $digest);
>> > +
>> > +	    if ($device && $conf->{$device}) {
>> > +		die "Could not import because device $device is already in ".
>> > +		"use in VM $vmid. Choose a different device!";
>> 
>> both of these checks might make sense outside of the worker already to 
>> give immediate feedback in the API response..
> I wanted to do this but didn't know what to do with the lock.
> If I check first and lock later then the config could already have changed?
> Or check twice?
> 1. outside the worker, because most times it is OK and gives a nice error and
> 2. inside the worker, to be really sure?
> 
> Or lock outside the worker? I'll have to try what is actually possible...

check before, compare digest after fork + lock + reload should handle any 
changes in-between?

then you only need to duplicate the digest check.

> 
>> > +	    }
>> 
>> > +
>> > +	my $msg = "There must be exactly as many devices specified as there " .
>> > +	    " are devices in the diskimage parameter.\n For example for " .
>> > +	    "--scsi0 local-lvm:0,discard=on --scsi1 local:0,cache=unsafe " .
>> > +	    "there must be --diskimages scsi0=/source/path,scsi1=/other/path";
>> > +	my $device_count = grep { $use_import->($_) } keys %$param;
>> 
>> why though? isn't it sufficient to say there must be a matching device for 
>> each diskimages entry and vice-versa?
> That's what I meant, but your formulation is better.

it also allows you to give concrete error messages if you do one-to-one 
mapping:

"scsi0 configured for disk import, but no matching import source given"
"virtio0 not configured for disk import, but import source '...' given"

> 
>> 
>> > +	}
>> > +
>> > +	my $worker = sub {
>> > +	    eval { PVE::QemuConfig->create_and_lock_config($vmid, 0, 'import') };
>> > +	    die "Unable to create config for VM import: $@" if $@;
>> 
>> should happen outside the worker (so that a VMID clash is detected 
>> early). inside the worker you just lock and load, then check that the 
>> 'import' lock is still there..
>> 
>> we could also filter out disk parameters, and call update_vm_api with 
>> the rest here (those should be fast, but potentially in the future could 
>> run into permission issues that would be nice to return early before 
>> doing a massive disk conversion that has to be undone afterwards..
>> 
>> > +
>> > +	    my @volids_of_imported = ();
>> > +	    eval { foreach my $opt (keys %$param) {
>> 
>> this could then just iterate over the disk parameters
> I thought about that, too. But I didn't take future permission issues into account. And then having a single loop and no filters seemed like the easier-to-read option to me.
> 
> I'll change it.
>> 
>> > +		next if ($opt eq 'start');
>> > +
>> > +		my $updated_value;
>> > +		if ($use_import->($opt)) {
>> > +		    # $opt is bus/device like ide0, scsi5
>> > +
>> > +		    my $device = PVE::QemuServer::parse_drive($opt, $param->{$opt});
>> 
>> $drive? $device is used in this very patch for something else ;)
> Right.
> 
> 
> "OK, will fix" to everything that I haven't mentioned. I certainly read it.




  reply	other threads:[~2021-02-12  9:04 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-02-05 10:04 Dominic Jäger
2021-02-05 10:04 ` [pve-devel] [PATCH v4 manager] gui: Add import wizard for disk & VM Dominic Jäger
2021-02-10  9:49   ` Fabian Grünbichler
2021-03-08 11:38     ` Dominic Jäger
2021-03-08 12:39       ` Fabian Grünbichler
2021-02-10  9:40 ` [pve-devel] [PATCH v4 qemu-server] Add API for disk & VM import Fabian Grünbichler
2021-02-11 10:32   ` Dominic Jäger
2021-02-12  9:03     ` Fabian Grünbichler [this message]
2021-02-12 11:13       ` Dominic Jäger

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1613119192.onm1x6fg5c.astroid@nora.none \
    --to=f.gruenbichler@proxmox.com \
    --cc=d.jaeger@proxmox.com \
    --cc=pve-devel@lists.proxmox.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox
Service provided by Proxmox Server Solutions GmbH | Privacy | Legal