From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from firstgate.proxmox.com (firstgate.proxmox.com [212.224.123.68]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits)) (No client certificate requested) by lists.proxmox.com (Postfix) with ESMTPS id 70DEB60B46 for ; Mon, 19 Oct 2020 14:19:45 +0200 (CEST) Received: from firstgate.proxmox.com (localhost [127.0.0.1]) by firstgate.proxmox.com (Proxmox) with ESMTP id 6F00C2C611 for ; Mon, 19 Oct 2020 14:19:45 +0200 (CEST) Received: from proxmox-new.maurer-it.com (proxmox-new.maurer-it.com [212.186.127.180]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits)) (No client certificate requested) by firstgate.proxmox.com (Proxmox) with ESMTPS id 8D3562C5CD for ; Mon, 19 Oct 2020 14:19:43 +0200 (CEST) Received: from proxmox-new.maurer-it.com (localhost.localdomain [127.0.0.1]) by proxmox-new.maurer-it.com (Proxmox) with ESMTP id 52F7845E16 for ; Mon, 19 Oct 2020 14:19:43 +0200 (CEST) From: Stefan Reiter To: pve-devel@lists.proxmox.com Cc: d.csapak@proxmox.com, w.bumiller@proxmox.com Date: Mon, 19 Oct 2020 14:18:35 +0200 Message-Id: <20201019121842.20277-1-s.reiter@proxmox.com> X-Mailer: git-send-email 2.20.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-SPAM-LEVEL: Spam detection results: 0 AWL -0.035 Adjusted score from AWL reputation of From: address KAM_DMARC_STATUS 0.01 Test Rule for DKIM or SPF Failure with Strict Alignment RCVD_IN_DNSWL_MED -2.3 Sender listed at https://www.dnswl.org/, medium trust SPF_HELO_NONE 0.001 SPF: HELO does not publish an SPF Record SPF_PASS -0.001 SPF: sender matches SPF record Subject: [pve-devel] [PATCH v2 0/7] Handle guest shutdown during backups X-BeenThere: pve-devel@lists.proxmox.com X-Mailman-Version: 2.1.29 Precedence: list List-Id: Proxmox VE development discussion List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 19 Oct 2020 12:19:45 -0000 Use QEMU's -no-shutdown argument so the QEMU instance stays alive even if the guest shuts down. This allows running backups to continue. To handle cleanup of QEMU processes, this series extends the qmeventd to handle SHUTDOWN events not just for detecting guest triggered shutdowns, but also to clean the QEMU process via SIGTERM (which quits it even with -no-shutdown enabled). A VZDump instance can then signal qmeventd (via the /var/run/qmeventd.sock) to keep alive certain VM processes if they're backing up, and once the backup is done, they close their connection to the socket, and qmeventd knows that it can now safely kill the VM (as long as the guest hasn't booted again, which is possible with some changes to the vm_start code also done in this series). This series requires a lot of testing, since there can be quite a few edge cases lounging around. So far it's been doing well for me, aside from the VNC GUI looking a bit confused when you do the 'shutdown during backup' motion (i.e. the last image from the framebuffer stays in the VNC window, looks more like the guest has crashed than shut down) - but I haven't found a solution for that. v2: * use a pidfd (see `man pidfd_open`, though the manpage does not seem to be available on Debian atm - I suppose since they don't support kernel 5.3 yet?), fall back to regular racy kill() included, for people running older kernels * initialize client->type with CLIENT_NONE instead of client->state * rebase on latest master qemu-server: Stefan Reiter (6): qmeventd: add handling for -no-shutdown QEMU instances qmeventd: add last-ditch effort SIGKILL cleanup vzdump: connect to qmeventd for duration of backup vzdump: use dirty bitmap for not running VMs too config_to_command: use -no-shutdown option fix vm_resume and allow vm_start with QMP status 'shutdown' PVE/QemuServer.pm | 25 +- PVE/VZDump/QemuServer.pm | 40 +- debian/control | 1 + qmeventd/Makefile | 4 +- qmeventd/qmeventd.c | 364 ++++++++++++++++-- qmeventd/qmeventd.h | 67 +++- test/cfg2cmd/bootorder-empty.conf.cmd | 1 + test/cfg2cmd/bootorder-legacy.conf.cmd | 1 + test/cfg2cmd/bootorder.conf.cmd | 1 + .../custom-cpu-model-defaults.conf.cmd | 1 + .../custom-cpu-model-host-phys-bits.conf.cmd | 1 + test/cfg2cmd/custom-cpu-model.conf.cmd | 1 + test/cfg2cmd/efi-raw-old.conf.cmd | 1 + test/cfg2cmd/efi-raw.conf.cmd | 1 + test/cfg2cmd/i440fx-win10-hostpci.conf.cmd | 1 + test/cfg2cmd/minimal-defaults.conf.cmd | 1 + test/cfg2cmd/netdev.conf.cmd | 1 + test/cfg2cmd/pinned-version.conf.cmd | 1 + .../q35-linux-hostpci-multifunction.conf.cmd | 1 + test/cfg2cmd/q35-linux-hostpci.conf.cmd | 1 + test/cfg2cmd/q35-win10-hostpci.conf.cmd | 1 + test/cfg2cmd/simple-virtio-blk.conf.cmd | 1 + test/cfg2cmd/simple1.conf.cmd | 1 + test/cfg2cmd/spice-enhancments.conf.cmd | 1 + test/cfg2cmd/spice-linux-4.1.conf.cmd | 1 + test/cfg2cmd/spice-usb3.conf.cmd | 1 + test/cfg2cmd/spice-win.conf.cmd | 1 + 27 files changed, 472 insertions(+), 50 deletions(-) manager: Stefan Reiter (1): ui: qemu: set correct disabled state for start button www/manager6/qemu/Config.js | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) -- 2.20.1