* [PVE-User] HA migrate failing without error
@ 2021-05-07 11:34 Ralf Storm
2021-05-07 11:47 ` Aaron Lauterer
0 siblings, 1 reply; 5+ messages in thread
From: Ralf Storm @ 2021-05-07 11:34 UTC (permalink / raw)
To: pve-user
Hello List,
we have a cluster of 7 Nodes with ceph, all updated with subscription.
since update to 6.4 we cannot ha migrate to one single node anymore,
without any error, it just says " start ha migrate ok" and "End HA
Migrate OK"
rebooted everything without success
Migrate of NON-HA vms works like a charm to this node
Anybody had a similar issue already?
best regards
Ralf
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PVE-User] HA migrate failing without error
2021-05-07 11:34 [PVE-User] HA migrate failing without error Ralf Storm
@ 2021-05-07 11:47 ` Aaron Lauterer
2021-05-07 12:06 ` Ralf Storm
0 siblings, 1 reply; 5+ messages in thread
From: Aaron Lauterer @ 2021-05-07 11:47 UTC (permalink / raw)
To: Proxmox VE user list, Ralf Storm
On 5/7/21 1:34 PM, Ralf Storm wrote:
> Hello List,
>
> we have a cluster of 7 Nodes with ceph, all updated with subscription.
>
> since update to 6.4 we cannot ha migrate to one single node anymore, without any error, it just says " start ha migrate ok" and "End HA Migrate OK"
That is normal. What should happen a few seconds later, is that the actual migration task starts. I tested it quickly in one of my test clusters and it worked.
First there is the 'HA <VMID> - Migrate' tags with the logs as you described them. Then a few seconds later, the 'VM <VMID> - Migrate' tasks starts which does the actual migration.
Do you see that second task happening at all?
>
> rebooted everything without success
>
> Migrate of NON-HA vms works like a charm to this node
>
>
> Anybody had a similar issue already?
>
>
> best regards
>
>
> Ralf
>
>
> _______________________________________________
> pve-user mailing list
> pve-user@lists.proxmox.com
> https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-user
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PVE-User] HA migrate failing without error
2021-05-07 11:47 ` Aaron Lauterer
@ 2021-05-07 12:06 ` Ralf Storm
2021-05-07 13:17 ` Ralf Storm
0 siblings, 1 reply; 5+ messages in thread
From: Ralf Storm @ 2021-05-07 12:06 UTC (permalink / raw)
To: Aaron Lauterer, Proxmox VE user list
I know this behavour, but the second task does not start
actually it seems like no host can migrate ha vms anymore, only without ha
Am 07/05/2021 um 13:47 schrieb Aaron Lauterer:
>
>
> On 5/7/21 1:34 PM, Ralf Storm wrote:
>> Hello List,
>>
>> we have a cluster of 7 Nodes with ceph, all updated with subscription.
>>
>> since update to 6.4 we cannot ha migrate to one single node anymore,
>> without any error, it just says " start ha migrate ok" and "End HA
>> Migrate OK"
>
> That is normal. What should happen a few seconds later, is that the
> actual migration task starts. I tested it quickly in one of my test
> clusters and it worked.
> First there is the 'HA <VMID> - Migrate' tags with the logs as you
> described them. Then a few seconds later, the 'VM <VMID> - Migrate'
> tasks starts which does the actual migration.
>
> Do you see that second task happening at all?
>
>>
>> rebooted everything without success
>>
>> Migrate of NON-HA vms works like a charm to this node
>>
>>
>> Anybody had a similar issue already?
>>
>>
>> best regards
>>
>>
>> Ralf
>>
>>
>> _______________________________________________
>> pve-user mailing list
>> pve-user@lists.proxmox.com
>> https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-user
>
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PVE-User] HA migrate failing without error
2021-05-07 12:06 ` Ralf Storm
@ 2021-05-07 13:17 ` Ralf Storm
2021-05-07 14:46 ` [LFWD] Virgil O
0 siblings, 1 reply; 5+ messages in thread
From: Ralf Storm @ 2021-05-07 13:17 UTC (permalink / raw)
To: pve-user
I observed the error further and tried to restart some nodes, as I see
under "HA" at the Master: old timestamp - dead?
after reboot quorum is aquired as usual, but the entryfor the master
remains and no other master is elected
as i migrated some vms for the reboot - which I had to remove from HA
before, to be able to migrate them, they stuck on "deleting" in HA
as i investigated further i find this VMs to be locked under several
nodes and i can see ths strange entry on all nodes: lock--1.conf
can i force the master to change?
any other suggestions? I run out of ideas...
Am 07/05/2021 um 14:06 schrieb Ralf Storm:
> I know this behavour, but the second task does not start
>
>
> actually it seems like no host can migrate ha vms anymore, only
> without ha
>
>
>
> Am 07/05/2021 um 13:47 schrieb Aaron Lauterer:
>>
>>
>> On 5/7/21 1:34 PM, Ralf Storm wrote:
>>> Hello List,
>>>
>>> we have a cluster of 7 Nodes with ceph, all updated with subscription.
>>>
>>> since update to 6.4 we cannot ha migrate to one single node anymore,
>>> without any error, it just says " start ha migrate ok" and "End HA
>>> Migrate OK"
>>
>> That is normal. What should happen a few seconds later, is that the
>> actual migration task starts. I tested it quickly in one of my test
>> clusters and it worked.
>> First there is the 'HA <VMID> - Migrate' tags with the logs as you
>> described them. Then a few seconds later, the 'VM <VMID> - Migrate'
>> tasks starts which does the actual migration.
>>
>> Do you see that second task happening at all?
>>
>>>
>>> rebooted everything without success
>>>
>>> Migrate of NON-HA vms works like a charm to this node
>>>
>>>
>>> Anybody had a similar issue already?
>>>
>>>
>>> best regards
>>>
>>>
>>> Ralf
>>>
>>>
>>> _______________________________________________
>>> pve-user mailing list
>>> pve-user@lists.proxmox.com
>>> https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-user
>>
>
> _______________________________________________
> pve-user mailing list
> pve-user@lists.proxmox.com
> https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-user
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PVE-User] HA migrate failing without error
2021-05-07 13:17 ` Ralf Storm
@ 2021-05-07 14:46 ` [LFWD] Virgil O
0 siblings, 0 replies; 5+ messages in thread
From: [LFWD] Virgil O @ 2021-05-07 14:46 UTC (permalink / raw)
To: Proxmox VE user list
Hi Ralf,
Have you tried to check the status of the crm service ?
cf. https://forum.proxmox.com/threads/ha-mgr-old-timestamp-dead.47097/
if you have this kind of message, just remove the configuration file or
move it to the node.
Virgil
On Fri, May 7, 2021 at 3:17 PM Ralf Storm <ralf.storm@konzept-is.de> wrote:
> I observed the error further and tried to restart some nodes, as I see
> under "HA" at the Master: old timestamp - dead?
>
> after reboot quorum is aquired as usual, but the entryfor the master
> remains and no other master is elected
>
> as i migrated some vms for the reboot - which I had to remove from HA
> before, to be able to migrate them, they stuck on "deleting" in HA
>
> as i investigated further i find this VMs to be locked under several
> nodes and i can see ths strange entry on all nodes: lock--1.conf
>
> can i force the master to change?
>
> any other suggestions? I run out of ideas...
>
>
> Am 07/05/2021 um 14:06 schrieb Ralf Storm:
> > I know this behavour, but the second task does not start
> >
> >
> > actually it seems like no host can migrate ha vms anymore, only
> > without ha
> >
> >
> >
> > Am 07/05/2021 um 13:47 schrieb Aaron Lauterer:
> >>
> >>
> >> On 5/7/21 1:34 PM, Ralf Storm wrote:
> >>> Hello List,
> >>>
> >>> we have a cluster of 7 Nodes with ceph, all updated with subscription.
> >>>
> >>> since update to 6.4 we cannot ha migrate to one single node anymore,
> >>> without any error, it just says " start ha migrate ok" and "End HA
> >>> Migrate OK"
> >>
> >> That is normal. What should happen a few seconds later, is that the
> >> actual migration task starts. I tested it quickly in one of my test
> >> clusters and it worked.
> >> First there is the 'HA <VMID> - Migrate' tags with the logs as you
> >> described them. Then a few seconds later, the 'VM <VMID> - Migrate'
> >> tasks starts which does the actual migration.
> >>
> >> Do you see that second task happening at all?
> >>
> >>>
> >>> rebooted everything without success
> >>>
> >>> Migrate of NON-HA vms works like a charm to this node
> >>>
> >>>
> >>> Anybody had a similar issue already?
> >>>
> >>>
> >>> best regards
> >>>
> >>>
> >>> Ralf
> >>>
> >>>
> >>> _______________________________________________
> >>> pve-user mailing list
> >>> pve-user@lists.proxmox.com
> >>> https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-user
> >>
> >
> > _______________________________________________
> > pve-user mailing list
> > pve-user@lists.proxmox.com
> > https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-user
>
> _______________________________________________
> pve-user mailing list
> pve-user@lists.proxmox.com
> https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-user
>
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2021-05-07 14:56 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-05-07 11:34 [PVE-User] HA migrate failing without error Ralf Storm
2021-05-07 11:47 ` Aaron Lauterer
2021-05-07 12:06 ` Ralf Storm
2021-05-07 13:17 ` Ralf Storm
2021-05-07 14:46 ` [LFWD] Virgil O
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox