* [PVE-User] Proxmox Fencing @ 2021-07-05 11:18 Alex K [not found] ` <mailman.217.1625484651.464.pve-user@lists.proxmox.com> 0 siblings, 1 reply; 3+ messages in thread From: Alex K @ 2021-07-05 11:18 UTC (permalink / raw) To: pve-user Hi all, I'm new to proxmox and trying to setup a 2 + 1 node active/active HA cluster on top glusterfs using latest community pve-manager/6.4-4/337d6701 (running kernel: 5.4.106-1-pve). The third node is used for gluster arbitration and perhaps I have to configure in it a quorum disk also to keep quorum in case of a node failure (not clear yet at my mind, still reading the docs). I am stuck at the moment at the fencing part of the setup. Reading through the docs it seems that I have only the option to setup hardware watchdog fencing. I would expect to be able to use external media such as IPMI, iDrac, HP iLO or UPS based power management (APC) though I can't find any info how these are configured at current version of Proxmox. In case of a network partition and not a node hardware issue, how is the watchdog going to behave? Is a healthy but disconnected node going to be power cycled? I will soon proceed with testing as soon as I manage to setup fencing though I wanted to better understand this part of fencing. Appreciate any feedback from your experience and use cases, Alex ^ permalink raw reply [flat|nested] 3+ messages in thread
[parent not found: <mailman.217.1625484651.464.pve-user@lists.proxmox.com>]
* Re: [PVE-User] Proxmox Fencing [not found] ` <mailman.217.1625484651.464.pve-user@lists.proxmox.com> @ 2021-07-05 11:49 ` Alex K [not found] ` <mailman.220.1625486801.464.pve-user@lists.proxmox.com> 0 siblings, 1 reply; 3+ messages in thread From: Alex K @ 2021-07-05 11:49 UTC (permalink / raw) To: Proxmox VE user list Hi Eneko, On Mon, Jul 5, 2021 at 2:30 PM Eneko Lacunza via pve-user < pve-user@lists.proxmox.com> wrote: > > > > ---------- Forwarded message ---------- > From: Eneko Lacunza <elacunza@binovo.es> > To: pve-user@lists.proxmox.com > Cc: > Bcc: > Date: Mon, 5 Jul 2021 13:30:41 +0200 > Subject: Re: [PVE-User] Proxmox Fencing > Hi Alex, > > El 5/7/21 a las 13:18, Alex K escribió: > > Hi all, > > > > I'm new to proxmox and trying to setup a 2 + 1 node active/active HA > > cluster on top glusterfs using latest community > pve-manager/6.4-4/337d6701 > > (running kernel: 5.4.106-1-pve). The third node is used for gluster > > arbitration and perhaps I have to configure in it a quorum disk also to > > keep quorum in case of a node failure (not clear yet at my mind, still > > reading the docs). > If you have 3 nodes, you want all them in Proxmox cluster for proper > quorum majority. No need for quorum disk that way. (note that I don't > know how gluster works). > Gluser has a similar concept for quorum so as to keep writes on the storage. Hence I am placing a third node in the setup. Due to cost limitations, the third node has minimal specs and is not meant to host VMs. It is a mini-PC thats why I did not add it as a proxmox host. I am wondering if it is possible to add it as a proxmox host and put a constraint to avoid VMs migrating into it. In this way I will achieve the required quorum levels without adding a full spec host. > I am stuck at the moment at the fencing part of the setup. Reading through > > the docs it seems that I have only the option to setup hardware watchdog > > fencing. I would expect to be able to use external media such as IPMI, > > iDrac, HP iLO or UPS based power management (APC) though I can't find any > > info how these are configured at current version of Proxmox. > Currently by default Proxmox uses a software watchdog. I'm not sure if > hardware watchdog support was introduced, others may help with this. > According to the docs it seems there is hardware watchdog option: https://pve.proxmox.com/pve-docs/chapter-ha-manager.html Q+++ hardware watchdog - if not available we fall back to the linux kernel software watchdog (softdog) +++Q > > In case of a network partition and not a node hardware issue, how is the > > watchdog going to behave? Is a healthy but disconnected node going to be > > power cycled? I will soon proceed with testing as soon as I manage to > setup > > fencing though I wanted to better understand this part of fencing. > > The node that drops out of quorum will be rebooted. If there where > CM/CTs configured for HA in that node, Proxmox will attempt to restart > them in another node. > So soft-fencing is done from ha-manager? How are the other nodes notified that the rebooted host is indeed rebooted so as to start the HA VMs? Thanx for the feedback > Cheers > > Eneko Lacunza > Zuzendari teknikoa | Director técnico > Binovo IT Human Project > > Tel. +34 943 569 206 | https://www.binovo.es > Astigarragako Bidea, 2 - 2º izda. Oficina 10-11, 20180 Oiartzun > > https://www.youtube.com/user/CANALBINOVO > https://www.linkedin.com/company/37269706/ > > > > > > ---------- Forwarded message ---------- > From: Eneko Lacunza via pve-user <pve-user@lists.proxmox.com> > To: pve-user@lists.proxmox.com > Cc: Eneko Lacunza <elacunza@binovo.es> > Bcc: > Date: Mon, 5 Jul 2021 13:30:41 +0200 > Subject: Re: [PVE-User] Proxmox Fencing > _______________________________________________ > pve-user mailing list > pve-user@lists.proxmox.com > https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-user > ^ permalink raw reply [flat|nested] 3+ messages in thread
[parent not found: <mailman.220.1625486801.464.pve-user@lists.proxmox.com>]
* Re: [PVE-User] Proxmox Fencing [not found] ` <mailman.220.1625486801.464.pve-user@lists.proxmox.com> @ 2021-07-05 12:32 ` Alex K 0 siblings, 0 replies; 3+ messages in thread From: Alex K @ 2021-07-05 12:32 UTC (permalink / raw) To: Proxmox VE user list On Mon, Jul 5, 2021 at 3:06 PM Eneko Lacunza via pve-user < pve-user@lists.proxmox.com> wrote: > > > > ---------- Forwarded message ---------- > From: Eneko Lacunza <elacunza@binovo.es> > To: pve-user@lists.proxmox.com > Cc: > Bcc: > Date: Mon, 5 Jul 2021 14:06:31 +0200 > Subject: Re: [PVE-User] Proxmox Fencing > Hi Alex, > > >> El 5/7/21 a las 13:18, Alex K escribió: > >>> Hi all, > >>> > >>> I'm new to proxmox and trying to setup a 2 + 1 node active/active HA > >>> cluster on top glusterfs using latest community > >> pve-manager/6.4-4/337d6701 > >>> (running kernel: 5.4.106-1-pve). The third node is used for gluster > >>> arbitration and perhaps I have to configure in it a quorum disk also to > >>> keep quorum in case of a node failure (not clear yet at my mind, still > >>> reading the docs). > >> If you have 3 nodes, you want all them in Proxmox cluster for proper > >> quorum majority. No need for quorum disk that way. (note that I don't > >> know how gluster works). > >> > > Gluser has a similar concept for quorum so as to keep writes on the > > storage. Hence I am placing a third node in the setup. Due to cost > > limitations, the third node has minimal specs and is not meant to host > VMs. > > It is a mini-PC thats why I did not add it as a proxmox host. I am > > wondering if it is possible to add it as a proxmox host and put a > > constraint to avoid VMs migrating into it. In this way I will achieve the > > required quorum levels without adding a full spec host. > > Yes, you can create node-groups in HA groups, and add the desired nodes > to the group. Then when adding a VM/CT to HA, configure the group there > too. > I see. Thanx for the pointer. >> I am stuck at the moment at the fencing part of the setup. Reading > through > >>> the docs it seems that I have only the option to setup hardware > watchdog > >>> fencing. I would expect to be able to use external media such as IPMI, > >>> iDrac, HP iLO or UPS based power management (APC) though I can't find > any > >>> info how these are configured at current version of Proxmox. > >> Currently by default Proxmox uses a software watchdog. I'm not sure if > >> hardware watchdog support was introduced, others may help with this. > >> > > According to the docs it seems there is hardware watchdog option: > > https://pve.proxmox.com/pve-docs/chapter-ha-manager.html > > Q+++ > > hardware watchdog - if not available we fall back to the linux kernel > > software watchdog (softdog) > > +++Q > Never used that, sorry. > >>> In case of a network partition and not a node hardware issue, how is > the > >>> watchdog going to behave? Is a healthy but disconnected node going to > be > >>> power cycled? I will soon proceed with testing as soon as I manage to > >> setup > >>> fencing though I wanted to better understand this part of fencing. > >> The node that drops out of quorum will be rebooted. If there where > >> CM/CTs configured for HA in that node, Proxmox will attempt to restart > >> them in another node. > >> > > So soft-fencing is done from ha-manager? How are the other nodes notified > > that the rebooted host is indeed rebooted so as to start the HA VMs? > There is a time delay that allows the fended node time to reboot before > other nodes take over the HA VMs. It's like 1-2 minutes. The fenced node > (the one out of the quorum) will reboot in max 60s. > OK. I understand that there is a locking mechanism which takes place and determines the node states. > Cheers > > Eneko Lacunza > Zuzendari teknikoa | Director técnico > Binovo IT Human Project > > Tel. +34 943 569 206 | https://www.binovo.es > Astigarragako Bidea, 2 - 2º izda. Oficina 10-11, 20180 Oiartzun > > https://www.youtube.com/user/CANALBINOVO > https://www.linkedin.com/company/37269706/ > > > > > > ---------- Forwarded message ---------- > From: Eneko Lacunza via pve-user <pve-user@lists.proxmox.com> > To: pve-user@lists.proxmox.com > Cc: Eneko Lacunza <elacunza@binovo.es> > Bcc: > Date: Mon, 5 Jul 2021 14:06:31 +0200 > Subject: Re: [PVE-User] Proxmox Fencing > _______________________________________________ > pve-user mailing list > pve-user@lists.proxmox.com > https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-user > ^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2021-07-05 12:33 UTC | newest] Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2021-07-05 11:18 [PVE-User] Proxmox Fencing Alex K [not found] ` <mailman.217.1625484651.464.pve-user@lists.proxmox.com> 2021-07-05 11:49 ` Alex K [not found] ` <mailman.220.1625486801.464.pve-user@lists.proxmox.com> 2021-07-05 12:32 ` Alex K
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox