From: Hannes Laimer <h.laimer@proxmox.com>
To: Proxmox Backup Server development discussion
<pbs-devel@lists.proxmox.com>,
"Max R. Carrara" <m.carrara@proxmox.com>
Subject: Re: [pbs-devel] [PATCH proxmox master v1] fix #6738: rest-server: improve TLS handshake error logging
Date: Fri, 7 Nov 2025 15:24:42 +0100 [thread overview]
Message-ID: <b7be5976-bf7c-49ea-808a-5816f9306603@proxmox.com> (raw)
In-Reply-To: <20250910154041.1032251-1-m.carrara@proxmox.com>
comment + small nit inline
On 9/10/25 17:40, Max R. Carrara wrote:
> Right now, when a client connects to the server, we continuously peek
> into the TCP stream's buffer until we can perform the TLS handshake
> check. If the client closes the connection after sending < 5 bytes, we
> exit the peek loop. If the client instead keeps the connection open,
> the future times out if the handshake check isn't made.
>
> Instead of exiting with `ErrorKind::ConnectionAborted`, exit with
> `ErrorKind::Other` instead, as that variant isn't ever returned by
> stdlib functions. That way we can be sure we don't accidentally ignore
> an error.
>
I don't think this is accurate, stdlib maps things it doesn't have an
ErrorKind for to ::Other [1]
Could we maybe have something like
```
#[derive(Debug, Error)]
#[error("client stopped sending data while waiting for TLS handshake
bytes")]
struct HandshakeNoProgress;
...
if peek_len == previous_peek_len {
Err(HandshakeNoProgress.into())
} else {
...
if err.downcast_ref::<HandshakeNoProgress>().is_some() {
return;
}
log::error!("[{peer}] TLS handshake error: {err:?}");
```
note: did not test this. Like that we could avoid unnecessary logging,
but don't lose out on potentially actual errors.
[1]
https://github.com/rust-lang/rust/blob/96064126a086a8428d66e07fb3b91421bb86a512/library/std/src/sys/pal/unix/mod.rs#L235
> Reduce the amount of log noise by ignoring our `ErrorKind::Other`
> errors, as those are usually only returned when some monitoring
> solution opens and closes a TCP connection for its healthchecks [0].
>
> Log timeouts without their error chain (like before), as
> there isn't any additional context for them.
>
> Because any other kinds of errors should be sufficiently rare, log
> them with their error chain. The error chain is formatted to stay in
> one line, as the default formatting looks quite awkward otherwise.
>
> Make the context added to the `async_io()` coroutine call more
> general and informative, as not only the stream buffer peeking can
> fail. Shorten the general log message as well.
>
> [0]: https://forum.proxmox.com/threads/subject-intermittent-backup-errors-and-repeated-tls-handshake-failures.170089/post-795100
>
> Fixes: #6738
> Signed-off-by: Max R. Carrara <m.carrara@proxmox.com>
> ---
>
> Tested this interactively in my Python console by opening a raw TCP
> stream using the `socket` module.
>
> proxmox-rest-server/src/connection.rs | 29 ++++++++++++++++++++++++---
> 1 file changed, 26 insertions(+), 3 deletions(-)
>
> diff --git a/proxmox-rest-server/src/connection.rs b/proxmox-rest-server/src/connection.rs
> index 9511b7cb..81c0e05e 100644
> --- a/proxmox-rest-server/src/connection.rs
> +++ b/proxmox-rest-server/src/connection.rs
> @@ -464,7 +464,28 @@ impl AcceptBuilder {
> }
> }
> Err(err) => {
> - log::error!("[{peer}] failed to check for TLS handshake: {err}");
> + // Ignore io::Errors of kind "Other", as that means the client didn't send any
> + // additional data between stream buffer peeks in our case
> + if let Some(io::ErrorKind::Other) =
> + err.downcast_ref::<io::Error>().map(io::Error::kind)
> + {
> + return;
> + }
> +
> + // Timeouts are printed without their cause
> + if err.downcast_ref::<tokio::time::error::Elapsed>().is_some() {
> + log::error!("[{peer}] TLS handshake error: {err}");
> + return;
> + }
> +
> + // As other kinds of errors are relatively rare, print them with their error chain
> + let mut err_chain = err.chain().map(ToString::to_string);
> + let err_text = err_chain.by_ref().take(1).collect::<Vec<_>>().join("");
> + let cause = err_chain.collect::<Vec<_>>().join(" - ");
> +
> + log::error!(
> + "[{peer}] TLS handshake error: {err_text} - Caused by: {cause}"
`cause` could be empty, no? we should probably skip the ' - Caused by: '
in that case
> + );
> }
> }
> }
> @@ -505,7 +526,7 @@ impl AcceptBuilder {
> // if we detect the same peek len again but still got a readable stream,
> // the connection was probably closed, so abort here
> if peek_len == previous_peek_len {
> - Err(io::ErrorKind::ConnectionAborted.into())
> + Err(io::ErrorKind::Other.into())
> } else {
> previous_peek_len = peek_len;
> Err(io::ErrorKind::WouldBlock.into())
> @@ -516,7 +537,9 @@ impl AcceptBuilder {
> }
> })
> .await
> - .context("couldn't peek into incoming TCP stream")
> + .context(
> + "failed to determine whether the peer is initiating a TLS handshake or not",
> + )
> };
>
> tokio::time::timeout(timeout, future)
_______________________________________________
pbs-devel mailing list
pbs-devel@lists.proxmox.com
https://lists.proxmox.com/cgi-bin/mailman/listinfo/pbs-devel
prev parent reply other threads:[~2025-11-07 14:24 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-09-10 15:40 Max R. Carrara
2025-11-07 13:03 ` Max R. Carrara
2025-11-07 14:24 ` Hannes Laimer [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=b7be5976-bf7c-49ea-808a-5816f9306603@proxmox.com \
--to=h.laimer@proxmox.com \
--cc=m.carrara@proxmox.com \
--cc=pbs-devel@lists.proxmox.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.