From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from firstgate.proxmox.com (firstgate.proxmox.com [212.224.123.68]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by lists.proxmox.com (Postfix) with ESMTPS id E4D136DAF0 for ; Wed, 18 Aug 2021 13:16:13 +0200 (CEST) Received: from firstgate.proxmox.com (localhost [127.0.0.1]) by firstgate.proxmox.com (Proxmox) with ESMTP id AE7E2B1B9 for ; Wed, 18 Aug 2021 13:16:13 +0200 (CEST) Received: from mx0.matrixscience.co.uk (mx0.matrixscience.co.uk [83.217.111.202]) by firstgate.proxmox.com (Proxmox) with ESMTP id 378AAB1AA for ; Wed, 18 Aug 2021 13:16:13 +0200 (CEST) Received: from [192.168.10.196] (bison.matrixscience.co.uk [192.168.10.196]) by mx0.matrixscience.co.uk (Postfix) with ESMTP id 161792C03F0 for ; Wed, 18 Aug 2021 12:10:41 +0100 (BST) To: pve-user@lists.proxmox.com From: Adam Weremczuk Organization: Matrix Science Ltd Message-ID: <39b5815a-c591-8c08-21ce-fb441dff8bb3@matrixscience.com> Date: Wed, 18 Aug 2021 12:10:41 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.11.0 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Content-Language: en-US X-SPAM-LEVEL: Spam detection results: 0 AWL 0.622 Adjusted score from AWL reputation of From: address BAYES_00 -1.9 Bayes spam probability is 0 to 1% KAM_DMARC_STATUS 0.01 Test Rule for DKIM or SPF Failure with Strict Alignment SPF_HELO_NONE 0.001 SPF: HELO does not publish an SPF Record SPF_PASS -0.001 SPF: sender matches SPF record Subject: [PVE-User] stuck container X-BeenThere: pve-user@lists.proxmox.com X-Mailman-Version: 2.1.29 Precedence: list List-Id: Proxmox VE user list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 18 Aug 2021 11:16:14 -0000 Hi all, Host PVE version: 6.2-6, Debian 10.4 Dual host set up using zfs and replication. Container: Debian 10.10 The container responds to ping and telnet to SSH port but I can't ssh to it, access via PVE console or stop/start it seeing all sorts of errors: command 'lxc-stop -n 100 --timeout 5' failed: got timeout TASK ERROR: received interrupt command 'lxc-stop -n 100 --timeout 5' failed: got timeout TASK ERROR: got unexpected control message: trying to acquire lock... TASK ERROR: can't lock file '/run/lock/lxc/pve-config-100.lock' - got timeout malformed authentication string TASK ERROR: command '/usr/bin/termproxy 5901 --path /vms/101 --perm VM.Console -- /usr/bin/dtach -A /var/run/dtach/vzctlconsole101 -r winch -z lxc-console -n 101 -e -1' failed: exit code 255 From the shell I tried deleting /run/lock/lxc/pve-config-100.lock, killing all processes referring to container 100 and starting it but no joy: pct reboot 100 --timeout 5 trying to acquire lock... can't lock file '/run/lock/lxc/pve-config-100.lock' - got timeout pct stop 100 --skiplock 1 trying to acquire lock... can't lock file '/run/lock/lxc/pve-config-100.lock' - got timeout pct shutdown 100 --forceStop 1 --timeout 5 trying to acquire lock... can't lock file '/run/lock/lxc/pve-config-100.lock' - got timeout pct stop 100 --skiplock 1 [stalls here] All the other containers appear to be running fine. Any ideas? Thanks, Adam