From: Maximiliano Sandoval <m.sandoval@proxmox.com>
To: Jakob Klocker <j.klocker@proxmox.com>
Cc: pve-devel@lists.proxmox.com
Subject: Re: [PATCH qemu-server v3 3/3] fix #5032: agent: sync guest time via QGA when the clock falls behind
Date: Thu, 02 Jul 2026 12:56:50 +0200 [thread overview]
Message-ID: <s8o7bnd7j7x.fsf@toolbox> (raw)
In-Reply-To: <20260702103201.164567-4-j.klocker@proxmox.com> (Jakob Klocker's message of "Thu, 2 Jul 2026 12:32:01 +0200")
Jakob Klocker <j.klocker@proxmox.com> writes:
Some comments below.
One design comment: I am not sure if the functions should have an
$override parameter, it could be argued than it is the job of the
call-site to determine whether how to proceeded based on API+config
parameters.
disclaimer: I did not check v1 and v2's discussions.
> Add a 'sync-time-on-resume' agent config property (default off) that,
> when enabled, issues the guest-set-time QGA command to set the guest
> clock to the host's current time after operations that leave the clock
> behind:
> - resuming from hibernation (suspend-to-disk)
> - resuming from pause/suspend
> - rolling back to a snapshot that includes RAM
> - live migration
> - taking a snapshot, during which the guest is briefly frozen
>
> The start, resume and rollback API endpoints gain a
> 'sync-time-on-resume' parameter (e.g. 'qm start <vmid>
> --sync-time-on-resume 0') that overrides the configured agent value for
> a single operation, forcing the sync on or off regardless of config.
>
> guest_set_time prints a confirmation line to stdout on success. During
> live migration the resume step runs over the migration tunnel, which
> also uses stdout, so an unconditional print corrupts the tunnel
> protocol. vm_resume therefore takes a 'quiet' parameter to suppress the
> print on the migration path; the confirmation is logged by the migration
> code instead.
>
> Signed-off-by: Jakob Klocker <j.klocker@proxmox.com>
> ---
> changes since v1:
> - adapt warning message in the resume-from-saved-state path
> changes since v2:
> - add sync on pause/resume
> - add override options
> - add print task
> - adapt formating and naming
>
> src/PVE/API2/Qemu.pm | 46 ++++++++++++++++++++++++++---
> src/PVE/CLI/qm.pm | 2 +-
> src/PVE/QemuConfig.pm | 12 ++++++--
> src/PVE/QemuMigrate.pm | 2 ++
> src/PVE/QemuServer.pm | 18 +++++++++++-
> src/PVE/QemuServer/Agent.pm | 54 ++++++++++++++++++++++++++++++++++
> src/PVE/QemuServer/RunState.pm | 11 ++++++-
> 7 files changed, 136 insertions(+), 9 deletions(-)
>
> diff --git a/src/PVE/API2/Qemu.pm b/src/PVE/API2/Qemu.pm
> index 9023c344..f58151e8 100644
> --- a/src/PVE/API2/Qemu.pm
> +++ b/src/PVE/API2/Qemu.pm
> @@ -3510,6 +3510,14 @@ __PACKAGE__->register_method({
> . ' the migration. A value of 0 means that the host_mtu parameter is to be'
> . ' avoided for the corresponding device.',
> },
> + 'sync-time-on-resume' => {
> + type => 'boolean',
> + description =>
Please have both a `description` and a `verbose_description`. The
verbose description should mention all cases where this can happen and
if possible mention the exact QGA command used (guest-set-time).
There is a more complete verbose_description at Agent.pm below, but imho
all doc strings should be complete enough to be read on their own,
unless they reference another doc.
> + "Override the VM's configured 'sync-time-on-resume' agent setting for this"
This is an agent config so: "The VM QGA's configured..." or similar.
> + . " start only. Controls whether the guest clock is synchronized after the"
> + . " RAM state is restored. The guest agent must be enabled.",
> + optional => 1,
> + },
> },
> },
> returns => {
> @@ -3525,6 +3533,7 @@ __PACKAGE__->register_method({
> my $vmid = extract_param($param, 'vmid');
> my $timeout = extract_param($param, 'timeout');
> my $machine = extract_param($param, 'machine');
> + my $sync_time = extract_param($param, 'sync-time-on-resume');
>
> my $get_root_param = sub {
> my $value = extract_param($param, $_[0]);
> @@ -3631,6 +3640,7 @@ __PACKAGE__->register_method({
> timeout => $timeout,
> forcecpu => $force_cpu,
> 'nets-host-mtu' => $nets_host_mtu,
> + 'sync-time-on-resume' => $sync_time,
> };
>
> PVE::QemuServer::vm_start($storecfg, $vmid, $params, $migrate_opts);
> @@ -4115,6 +4125,14 @@ __PACKAGE__->register_method({
> description => "Do not check whether the VM is running, used"
> . " internally during migration. Only root may use this option.",
> },
> + 'sync-time-on-resume' => {
> + type => 'boolean',
> + description =>
> + "Override the VM's configured 'sync-time-on-resume' agent setting for this"
> + . " resume only. Controls whether the guest clock is synchronized after the"
> + . " RAM state is restored. The guest agent must be enabled.",
Same considerations here.
> + optional => 1,
> + },
> },
> },
> returns => {
> @@ -4131,6 +4149,8 @@ __PACKAGE__->register_method({
>
> my $vmid = extract_param($param, 'vmid');
>
> + my $sync_time = extract_param($param, 'sync-time-on-resume');
> +
> my $skiplock = extract_param($param, 'skiplock');
> raise_param_exc({ skiplock => "Only root may use this option." })
> if $skiplock && $authuser ne 'root@pam';
> @@ -4161,10 +4181,14 @@ __PACKAGE__->register_method({
> syslog('info', "resume VM $vmid: $upid\n");
>
> if (!$to_disk_suspended) {
> - PVE::QemuServer::RunState::vm_resume($vmid, $skiplock, $nocheck);
> + PVE::QemuServer::RunState::vm_resume($vmid, $skiplock, $nocheck, $sync_time);
> } else {
> my $storecfg = PVE::Storage::config();
> - PVE::QemuServer::vm_start($storecfg, $vmid, { skiplock => $skiplock });
> + PVE::QemuServer::vm_start(
> + $storecfg,
> + $vmid,
> + { skiplock => $skiplock, 'sync-time-on-resume' => $sync_time },
> + );
> }
>
> return;
> @@ -6323,6 +6347,14 @@ __PACKAGE__->register_method({
> optional => 1,
> default => 0,
> },
> + 'sync-time-on-resume' => {
> + type => 'boolean',
> + description =>
> + "Override the VM's configured sync-time-on-resume agent setting for this"
> + . " rollback only. Controls whether the guest clock is synchronized"
> + . " after the RAM state is restored. The guest agent must be enabled.",
> + optional => 1,
Ditto.
> + },
> },
> },
> returns => {
> @@ -6342,6 +6374,8 @@ __PACKAGE__->register_method({
>
> my $snapname = extract_param($param, 'snapname');
>
> + my $sync_time = extract_param($param, 'sync-time-on-resume');
> +
> # vm_start is invoked directly from the worker, so its own permissions
> # predicate doesn't fire here - check VM.PowerMgmt up front so a user
> # with only VM.Snapshot.Rollback can't power the VM on as a side effect.
> @@ -6350,7 +6384,11 @@ __PACKAGE__->register_method({
>
> my $realcmd = sub {
> PVE::Cluster::log_msg('info', $authuser, "rollback snapshot VM $vmid: $snapname");
> - PVE::QemuConfig->snapshot_rollback($vmid, $snapname);
> + PVE::QemuConfig->snapshot_rollback(
> + $vmid,
> + $snapname,
> + { 'sync-time-on-resume' => $sync_time },
> + );
>
> if ($param->{start} && !PVE::QemuServer::Helpers::vm_running_locally($vmid)) {
> PVE::API2::Qemu->vm_start({ vmid => $vmid, node => $node });
> @@ -6915,7 +6953,7 @@ __PACKAGE__->register_method({
> },
> 'resume' => sub {
> if (PVE::QemuServer::Helpers::vm_running_locally($state->{vmid})) {
> - PVE::QemuServer::RunState::vm_resume($state->{vmid}, 1, 1);
> + PVE::QemuServer::RunState::vm_resume($state->{vmid}, 1, 1, undef, 1);
> } else {
> die "VM $state->{vmid} not running\n";
> }
> diff --git a/src/PVE/CLI/qm.pm b/src/PVE/CLI/qm.pm
> index b903c1f1..25351f64 100755
> --- a/src/PVE/CLI/qm.pm
> +++ b/src/PVE/CLI/qm.pm
> @@ -473,7 +473,7 @@ __PACKAGE__->register_method({
> if (PVE::QemuServer::Helpers::vm_running_locally($vmid)) {
> # vm_resume with nocheck, since local node might not have processed config
> # move/rename yet
> - eval { PVE::QemuServer::RunState::vm_resume($vmid, 1, 1); };
> + eval { PVE::QemuServer::RunState::vm_resume($vmid, 1, 1, undef, 1); };
> if ($@) {
> $tunnel_write->("ERR: resume failed - $@");
> } else {
> diff --git a/src/PVE/QemuConfig.pm b/src/PVE/QemuConfig.pm
> index 26f0fda2..e95a8c61 100644
> --- a/src/PVE/QemuConfig.pm
> +++ b/src/PVE/QemuConfig.pm
> @@ -383,6 +383,11 @@ sub __snapshot_create_vol_snapshots_hook {
> next;
> }
> }
> + my $conf = $class->load_config($vmid);
> + if (PVE::QemuServer::Agent::should_sync_time_on_resume($conf->{agent})) {
> + eval { PVE::QemuServer::Agent::guest_set_time($vmid); };
> + warn "could not sync guest time after snapshot - $@" if $@;
> + }
> }
> }
> }
> @@ -444,8 +449,9 @@ sub __snapshot_rollback_hook {
> my ($class, $vmid, $conf, $snap, $prepare, $data) = @_;
>
> if ($prepare) {
> - # we save the machine of the current config
> + # we save the machine and agent of the current config
> $data->{oldmachine} = $conf->{machine};
> + $data->{agent} = $conf->{agent};
> } else {
> # if we have a 'runningmachine' entry in the snapshot we use that
> # for the forcemachine parameter, else we use the old logic
> @@ -509,7 +515,7 @@ sub __snapshot_rollback_vm_stop {
> }
>
> sub __snapshot_rollback_vm_start {
> - my ($class, $vmid, $vmstate, $data) = @_;
> + my ($class, $vmid, $vmstate, $data, $opts) = @_;
>
> my $storecfg = PVE::Storage::config();
> my $params = {
> @@ -517,6 +523,8 @@ sub __snapshot_rollback_vm_start {
> forcemachine => $data->{forcemachine},
> forcecpu => $data->{forcecpu},
> 'nets-host-mtu' => $data->{'nets-host-mtu'},
> + agent => $data->{agent},
> + 'sync-time-on-resume' => $opts->{'sync-time-on-resume'},
> };
> PVE::QemuServer::vm_start($storecfg, $vmid, $params);
> }
> diff --git a/src/PVE/QemuMigrate.pm b/src/PVE/QemuMigrate.pm
> index 8da6f15d..5846578d 100644
> --- a/src/PVE/QemuMigrate.pm
> +++ b/src/PVE/QemuMigrate.pm
> @@ -1770,6 +1770,8 @@ sub phase3_cleanup {
> $self->{errors} = 1;
> }
> }
> + $self->log('info', "synced guest clock via guest agent")
> + if PVE::QemuServer::Agent::should_sync_time_on_resume($conf->{agent});
> }
>
> if (
> diff --git a/src/PVE/QemuServer.pm b/src/PVE/QemuServer.pm
> index cdf66e89..e30e89e7 100644
> --- a/src/PVE/QemuServer.pm
> +++ b/src/PVE/QemuServer.pm
> @@ -5179,7 +5179,10 @@ sub vmconfig_update_agent {
>
> die "skip\n" if !$conf->{$opt};
>
> - my $hotplug_options = { fstrim_cloned_disks => 1 };
> + my $hotplug_options = {
> + fstrim_cloned_disks => 1,
> + 'sync-time-on-resume' => 1,
> + };
>
> my $old_agent = parse_guest_agent($conf->{agent});
> my $agent = parse_guest_agent($value);
> @@ -6006,6 +6009,19 @@ sub vm_start_nolock {
> );
> }
>
> + # $resume = suspend-to-disk, $statefile = rollback with RAM. Rollback carries the snapshot's
> + # agent setting in params; hibernate resume falls back to the live config
> + if ($resume || ($statefile && !$migratedfrom)) {
> + my $agent = $params->{agent} // $conf->{agent};
> + if (PVE::QemuServer::Agent::should_sync_time_on_resume(
> + $agent,
> + $params->{'sync-time-on-resume'},
> + )) {
> + eval { PVE::QemuServer::Agent::guest_set_time($vmid); 1 };
> + warn "could not sync guest time after resume from saved state - $@" if $@;
> + }
> + }
> +
> return $res;
> }
>
> diff --git a/src/PVE/QemuServer/Agent.pm b/src/PVE/QemuServer/Agent.pm
> index be6df443..77cd2bd6 100644
> --- a/src/PVE/QemuServer/Agent.pm
> +++ b/src/PVE/QemuServer/Agent.pm
> @@ -4,6 +4,7 @@ use v5.36;
>
> use JSON;
> use MIME::Base64 qw(decode_base64 encode_base64);
> +use Time::HiRes qw(time);
>
> use PVE::JSONSchema;
>
> @@ -52,6 +53,24 @@ our $agent_fmt = {
> optional => 1,
> default => 1,
> },
> + # FIXME: MAJOR VERSION: 10.0.0 - reconsider enabling this option as default
Unless it was already discussed that this should be done, I would say
this is a "TODO".
> + 'sync-time-on-resume' => {
> + description => "Synchronize the guest clock through QGA after operations that can leave"
I think this should be "the guest's clock". Same in other places.
> + . " the guest clock behind.",
> + verbose_description =>
> + "Whether to issue the guest-set-time QEMU guest agent command to synchronize the"
s/guest-set-time/'guest-set-time'/.
> + . " guest clock to the host's current time after operations that can leave the guest"
> + . " clock behind. This happens when resuming from hibernation, resuming from a paused"
> + . " state, taking a snapshot, after a migration, and after rolling back to a snapshot"
> + . " that includes RAM. In these cases the guest's clock may still reflect an earlier"
> + . " time. The time is only synchronized when the QEMU Guest Agent option is enabled"
> + . " in the guest's configuration and the agent is running inside of the guest. For"
> + . " resume, hibernation resume and snapshot rollback, the synchronization can be"
> + . " overridden per operation.",
> + type => 'boolean',
> + optional => 1,
> + default => 0,
> + },
> type => {
> description => "Select the agent type",
> type => 'string',
> @@ -332,4 +351,39 @@ sub guest_fs_freeze_applicable($agent_str, $vmid, $logfunc = undef) {
> return 1;
> }
>
> +=head3 should_sync_time_on_resume
> +
> +Returns whether the guest's clock should be synchronized to the host's via the QEMU Guest Agent
> +when the VM is resumed from saved state. Requires the guest agent to be enabled. An explicit
> +C<$override> (from an API parameter) takes precedence over the C<sync-time-on-resume> agent
> +config property when defined; otherwise the config value is used (defaulting to off). Does B<not>
> +check whether the agent is actually running.
> +
> +=cut
> +
> +sub should_sync_time_on_resume($agent_str, $override = undef) {
> + my $agent = parse_guest_agent($agent_str);
> + return 0 if !$agent->{enabled};
> + return $override ? 1 : 0 if defined($override);
> + return $agent->{'sync-time-on-resume'} // 0;
> +}
> +
> +=head3 guest_set_time
> +
> +Sets the guest's clock to the current host time via the QEMU Guest Agent's
> +C<guest-set-time> command. The host time is passed explicitly (as nanoseconds
> +since the UNIX epoch) because the argument-less form reads the time from the
> +guest's RTC, which isn't supported on all platforms (e.g. Windows); passing
> +the value directly works regardless of guest OS.
> +
> +=cut
> +
> +sub guest_set_time($vmid, $quiet = undef) {
> + my $time_ns = Time::HiRes::time() * 1_000_000_000;
> + my $res = PVE::QemuServer::Monitor::mon_cmd($vmid, 'guest-set-time', time => int($time_ns));
> + check_agent_error($res, "unable to set guest time");
> + print "synced guest clock via guest agent\n" if !$quiet;
> + return;
Just an observation, this `return` seems unnecessary to me, however the
thaw fn also has one.
> +}
> +
> 1;
> diff --git a/src/PVE/QemuServer/RunState.pm b/src/PVE/QemuServer/RunState.pm
> index bbbcc88e..a72aed4d 100644
> --- a/src/PVE/QemuServer/RunState.pm
> +++ b/src/PVE/QemuServer/RunState.pm
> @@ -128,7 +128,7 @@ sub vm_suspend {
> # location of the config file (source or target node) is not deterministic,
> # since migration cannot wait for pmxcfs to process the rename
> sub vm_resume {
> - my ($vmid, $skiplock, $nocheck) = @_;
> + my ($vmid, $skiplock, $nocheck, $sync_time, $quiet) = @_;
>
> PVE::QemuConfig->lock_config(
> $vmid,
> @@ -180,6 +180,15 @@ sub vm_resume {
> if $resume_cmd eq 'cont';
>
> mon_cmd($vmid, $resume_cmd);
> +
> + if (
> + $resume_cmd eq 'cont'
> + && PVE::QemuServer::Agent::should_sync_time_on_resume($conf->{agent},
> + $sync_time)
> + ) {
> + eval { PVE::QemuServer::Agent::guest_set_time($vmid, $quiet); };
> + warn "could not sync guest time after resume - $@" if $@;
> + }
> },
> );
> }
--
Maximiliano
next prev parent reply other threads:[~2026-07-02 10:57 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-07-02 10:31 [PATCH guest-common/manager/qemu-server v3 0/3] fix #5032: add guest time sync via QGA Jakob Klocker
2026-07-02 10:31 ` [PATCH pve-guest-common v3 1/3] AbstractConfig: allow passing options to snapshot_rollback Jakob Klocker
2026-07-02 10:32 ` [PATCH pve-manager v3 2/3] fix #5032: ui: qemu agent: add sync-time-on-resume option Jakob Klocker
2026-07-02 11:00 ` Maximiliano Sandoval
2026-07-02 10:32 ` [PATCH qemu-server v3 3/3] fix #5032: agent: sync guest time via QGA when the clock falls behind Jakob Klocker
2026-07-02 10:56 ` Maximiliano Sandoval [this message]
2026-07-03 7:04 ` Jakob Klocker
2026-07-03 7:37 ` Jakob Klocker
2026-07-02 15:14 ` [PATCH guest-common/manager/qemu-server v3 0/3] fix #5032: add guest time sync via QGA Maximiliano Sandoval
2026-07-03 6:33 ` Jakob Klocker
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=s8o7bnd7j7x.fsf@toolbox \
--to=m.sandoval@proxmox.com \
--cc=j.klocker@proxmox.com \
--cc=pve-devel@lists.proxmox.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.