From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from firstgate.proxmox.com (firstgate.proxmox.com [212.224.123.68]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits)) (No client certificate requested) by lists.proxmox.com (Postfix) with ESMTPS id 7DD8E6D33E for ; Thu, 4 Feb 2021 17:59:59 +0100 (CET) Received: from firstgate.proxmox.com (localhost [127.0.0.1]) by firstgate.proxmox.com (Proxmox) with ESMTP id 6997327A48 for ; Thu, 4 Feb 2021 17:59:29 +0100 (CET) Received: from proxmox-new.maurer-it.com (proxmox-new.maurer-it.com [212.186.127.180]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits)) (No client certificate requested) by firstgate.proxmox.com (Proxmox) with ESMTPS id DCD5327A39 for ; Thu, 4 Feb 2021 17:59:27 +0100 (CET) Received: from proxmox-new.maurer-it.com (localhost.localdomain [127.0.0.1]) by proxmox-new.maurer-it.com (Proxmox) with ESMTP id 9F108461E2; Thu, 4 Feb 2021 17:59:27 +0100 (CET) To: Sergey Korobkov Cc: Damir Chanyshev , Proxmox VE user list References: <9b662ee8-10c1-9a71-e598-674538535c51@gmail.com> From: Stefan Reiter Message-ID: <71fdbe8b-6212-e355-fdd3-ae2d69908307@proxmox.com> Date: Thu, 4 Feb 2021 17:59:26 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.6.1 MIME-Version: 1.0 In-Reply-To: <9b662ee8-10c1-9a71-e598-674538535c51@gmail.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-SPAM-LEVEL: Spam detection results: 0 AWL 0.052 Adjusted score from AWL reputation of From: address KAM_DMARC_STATUS 0.01 Test Rule for DKIM or SPF Failure with Strict Alignment NICE_REPLY_A -0.178 Looks like a legit reply (A) RCVD_IN_DNSWL_MED -2.3 Sender listed at https://www.dnswl.org/, medium trust SPF_HELO_NONE 0.001 SPF: HELO does not publish an SPF Record SPF_PASS -0.001 SPF: sender matches SPF record URIBL_BLOCKED 0.001 ADMINISTRATOR NOTICE: The query to URIBL was blocked. See http://wiki.apache.org/spamassassin/DnsBlocklists#dnsbl-block for more information. [nongnu.org] Subject: Re: [PVE-User] Live migration fails with "Mismatched RAM page size ram-node0 (local) 2097152 != 1526773257204281392" X-BeenThere: pve-user@lists.proxmox.com X-Mailman-Version: 2.1.29 Precedence: list List-Id: Proxmox VE user list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 04 Feb 2021 16:59:59 -0000 On 02/02/2021 15:50, Sergey Korobkov wrote: > Hello, > > Two exactly the same machines ( except ram size 380G and 1.5T ). > > Upgraded on Debian 10.7 from: > pve-manager/6.1-5/9bf06119 > Linux 5.3.13-1-pve #1 SMP PVE 5.3.13-1 (Thu, 05 Dec 2019 07:18:14 +0100) > QEMU emulator version 4.1.1 (pve-qemu-kvm_4.1.1) > > to: > pve-manager/6.3-3/eee5f901 > Linux 5.4.78-2-pve #1 SMP PVE 5.4.78-2 (Thu, 03 Dec 2020 14:26:17 +0100) > QEMU emulator version 5.1.0 (pve-qemu-kvm_5.1.0) > > We had enabled hugepages for virtual machines( "hugepages: 2" specified > in virtual machine description). > > Live migration fails with errors like this: > > Feb 02 16:26:13 QEMU[12090]: kvm7: load of migration failed: Invalid > argument > Feb 02 16:26:13 QEMU[12090]: kvm7: error while loading state for > instance 0x0 of device 'ram' > Feb 02 16:26:13 QEMU[12090]: kvm7: Mismatched RAM page size ram-node0 > (local) 2097152 != 1526773257204281392 > > We think it's some overflow issue. > Hi! After looking carefully I believe to have found the root cause of this issue in an upstream bug that we run into since pve-qemu-kvm 5.1.0-4, where we started migrating dirty bitmaps. I have sent a potential fix to the upstream qemu-devel mailing list: https://lists.nongnu.org/archive/html/qemu-devel/2021-02/msg01711.html If the resident experts on there agree that this is indeed the solution, we will most likely ship it once we release our QEMU 5.2 build :) Thanks for the report!