From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: <pve-user-bounces@lists.proxmox.com> Received: from firstgate.proxmox.com (firstgate.proxmox.com [IPv6:2a01:7e0:0:424::9]) by lore.proxmox.com (Postfix) with ESMTPS id 92CE71FF162 for <inbox@lore.proxmox.com>; Sat, 10 Aug 2024 09:28:51 +0200 (CEST) Received: from firstgate.proxmox.com (localhost [127.0.0.1]) by firstgate.proxmox.com (Proxmox) with ESMTP id 39FD48611; Sat, 10 Aug 2024 09:28:57 +0200 (CEST) Date: Sat, 10 Aug 2024 09:22:11 +0200 To: casati@kona.it In-Reply-To: <f7806f1a-6604-453b-96c4-d127df37cd17@kona.it> References: <f7806f1a-6604-453b-96c4-d127df37cd17@kona.it> MIME-Version: 1.0 Message-ID: <mailman.171.1723274935.302.pve-user@lists.proxmox.com> List-Id: Proxmox VE user list <pve-user.lists.proxmox.com> List-Post: <mailto:pve-user@lists.proxmox.com> From: Alwin Antreich via pve-user <pve-user@lists.proxmox.com> Precedence: list Cc: Alwin Antreich <alwin@antreich.com>, Proxmox VE user list <pve-user@lists.proxmox.com> X-Mailman-Version: 2.1.29 X-BeenThere: pve-user@lists.proxmox.com List-Subscribe: <https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-user>, <mailto:pve-user-request@lists.proxmox.com?subject=subscribe> List-Unsubscribe: <https://lists.proxmox.com/cgi-bin/mailman/options/pve-user>, <mailto:pve-user-request@lists.proxmox.com?subject=unsubscribe> List-Archive: <http://lists.proxmox.com/pipermail/pve-user/> Reply-To: Proxmox VE user list <pve-user@lists.proxmox.com> List-Help: <mailto:pve-user-request@lists.proxmox.com?subject=help> Subject: Re: [PVE-User] Dell R350, Proxmox VE 8.2.2, sas-megaraid error and system hang Content-Type: multipart/mixed; boundary="===============1078843701354809579==" Errors-To: pve-user-bounces@lists.proxmox.com Sender: "pve-user" <pve-user-bounces@lists.proxmox.com> --===============1078843701354809579== Content-Type: message/rfc822 Content-Disposition: inline Return-Path: <alwin@antreich.com> X-Original-To: pve-user@lists.proxmox.com Delivered-To: pve-user@lists.proxmox.com Received: from firstgate.proxmox.com (firstgate.proxmox.com [212.224.123.68]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by lists.proxmox.com (Postfix) with ESMTPS id 24382C2EA4 for <pve-user@lists.proxmox.com>; Sat, 10 Aug 2024 09:28:55 +0200 (CEST) Received: from firstgate.proxmox.com (localhost [127.0.0.1]) by firstgate.proxmox.com (Proxmox) with ESMTP id 0133384D6 for <pve-user@lists.proxmox.com>; Sat, 10 Aug 2024 09:28:25 +0200 (CEST) Received: from mx.antreich.com (mx.antreich.com [173.249.42.230]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits)) (No client certificate requested) by firstgate.proxmox.com (Proxmox) with ESMTPS for <pve-user@lists.proxmox.com>; Sat, 10 Aug 2024 09:28:23 +0200 (CEST) Received: from mail2.antreich.com (unknown [172.16.9.25]) by mx.antreich.com (Postfix) with ESMTPS id 03EB26E2E02; Sat, 10 Aug 2024 09:22:11 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=antreich.com; s=2018; t=1723274532; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=r52u8srye6mMD4q7NGNiNFahhku1V0HOI7f+eyCZ51A=; b=l0W/YjnK2NlelR8I9hY+T446sY5M0vNLGKpFbEjVa5hQKpgzgbbyKdqXad1gh0PyCoDrvO YjfCEf7cDWXfxlKPss1MNuti84cPk1bp17RIdrRIDJVn6x2oG9ReBKRmBeq7t5vA/c7Nt+ 28K7qU0Fana4x3hKer8U3nNuGymgOv6wktuDKb+D5Sf3KXmY3e4rO/5MT4WbegyIRFi3e+ 0nESUKl3gl5BS4xKIrv8+lW/sHJTWisHszCThWLHkPyev05tbPJ7NiGUYoK6WPjeBk5Z3k ELwZs4ov06GA/8uhxgEno9CuzjjudJpuLlBXuMrPSO9OR+d2plEZHWZQ3oxynA== Date: Sat, 10 Aug 2024 09:22:11 +0200 From: Alwin Antreich <alwin@antreich.com> To: casati@kona.it CC: Proxmox VE user list <pve-user@lists.proxmox.com> Subject: =?US-ASCII?Q?Re=3A_=5BPVE-User=5D_Dell_R350=2C_Proxmox_VE_8=2E?= =?US-ASCII?Q?2=2E2=2C_sas-megaraid_error_and_system_hang?= In-Reply-To: <f7806f1a-6604-453b-96c4-d127df37cd17@kona.it> References: <f7806f1a-6604-453b-96c4-d127df37cd17@kona.it> Message-ID: <B601AE4A-F3C4-44A1-9F48-C116F8208BD8@antreich.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-SPAM-LEVEL: Spam detection results: 0 AWL 0.120 Adjusted score from AWL reputation of From: address BAYES_00 -1.9 Bayes spam probability is 0 to 1% DKIM_SIGNED 0.1 Message has a DKIM or DK signature, not necessarily valid DKIM_VALID -0.1 Message has at least one valid DKIM or DK signature DKIM_VALID_AU -0.1 Message has a valid DKIM or DK signature from author's domain DKIM_VALID_EF -0.1 Message has a valid DKIM or DK signature from envelope-from domain DMARC_PASS -0.1 DMARC pass policy RCVD_IN_VALIDITY_CERTIFIED_BLOCKED 0.001 ADMINISTRATOR NOTICE: The query to Validity was blocked. See https://knowledge.validity.com/hc/en-us/articles/20961730681243 for more information. RCVD_IN_VALIDITY_RPBL_BLOCKED 0.001 ADMINISTRATOR NOTICE: The query to Validity was blocked. See https://knowledge.validity.com/hc/en-us/articles/20961730681243 for more information. RCVD_IN_VALIDITY_SAFE_BLOCKED 0.001 ADMINISTRATOR NOTICE: The query to Validity was blocked. See https://knowledge.validity.com/hc/en-us/articles/20961730681243 for more information. SPF_HELO_PASS -0.001 SPF: HELO matches SPF record SPF_PASS -0.001 SPF: sender matches SPF record URIBL_BLOCKED 0.001 ADMINISTRATOR NOTICE: The query to URIBL was blocked. See http://wiki.apache.org/spamassassin/DnsBlocklists#dnsbl-block for more information. [antreich.com] On August 9, 2024 1:30:22 PM GMT+02:00, Andrea Casati <casati@kona=2Eit> wr= ote: >Hello > >Dell R350 with PERC H755=2E >Tried with kernel 6=2E8=2E4, 6=2E8=2E8 and 6=2E5=2E13=2E >System hangs (need to phisically power off/on the machine) every day duri= ng compressed backup, and sometimes during normal usage of VM=2E > >Log with kernel 6=2E8=2E4: >*Jul 15 19:04:45 r350ve kernel: megaraid_sas 0000:01:00=2E0: Adapter is O= PERATIONAL for scsi:0 >Jul 15 19:04:45 r350ve kernel: megaraid_sas 0000:01:00=2E0: Snap dump wai= t time=C2=A0=C2=A0=C2=A0 : 15 >Jul 15 19:04:45 r350ve kernel: megaraid_sas 0000:01:00=2E0: Reset success= ful for scsi0=2E >Jul 15 19:04:45 r350ve kernel: megaraid_sas 0000:01:00=2E0: 3296 (7743782= 51s/0x0020/DEAD) - Fatal firmware error: Line 188 in fw\raid\utils=2Ec >Jul 15 19:04:45 r350ve kernel: megaraid_sas 0000:01:00=2E0: 3300 (boot + = 5s/0x0020/CRIT) - Controller encountered an error and was reset* > >Errors on console with kernel 6=2E5=2E13: >*kvm_intel: kvm [2225]: vcpu0, guest rIP: 0xfffff80277d68f93 Unhandled WR= MSR(0x1d9) =3D 0x1* >*megaraid_sas 0000:01:00=2E0: FW in FAULT state Fault code:0x10000 subcod= e:0x0 func:megasas_wait_for_outstanding_fusion* > > >IDRAC reports no errors - Dell support reports no problems=2E > >Have anyone seen something like this before? I've seen similar issues with other controllers when a faulty disk was pre= sent=2E=20 And do you have the latest firmware on the controller? Cheers, Alwin=20 Hi Andrea, --===============1078843701354809579== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ pve-user mailing list pve-user@lists.proxmox.com https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-user --===============1078843701354809579==--