From: Wolfgang Bumiller <w.bumiller@proxmox.com>
To: Lukas Wagner <l.wagner@proxmox.com>
Cc: pdm-devel@lists.proxmox.com
Subject: Re: [PATCH datacenter-manager 1/4] add persistent, generic, namespaced key-value cache implementation
Date: Tue, 12 May 2026 14:29:15 +0200 [thread overview]
Message-ID: <3b7fegzovd2vvukjsctqba73qtdhize7s2xchoko7dq5xngzp6@2x7dunsotkik> (raw)
In-Reply-To: <20260508150330.363622-2-l.wagner@proxmox.com>
On Fri, May 08, 2026 at 05:03:27PM +0200, Lukas Wagner wrote:
> Namespaces map to directories inside a base directory. Cache contents
> are stored as a single JSON-file per key inside the namespace directory.
>
> Value expiry is implemented via a max_age parameter when retrieving
> values. Callers might have different requirements to the freshness of
> entries, so this seemed a better fit than statically setting an expiry
> time when *setting* the value.
>
> The current implementation is pretty naive about namespace names and key
> names; these must come from verified, path-safe identifiers, as they are
> used directly as components of the path of the resulting directories and
> JSON files. This should be fixed before using this implementation
> elsewhere.
>
> Signed-off-by: Lukas Wagner <l.wagner@proxmox.com>
> ---
>
> Notes:
> This module might be well suited to be moved to proxmox.git when it has
> stabilized enough. Kept here for now to ease development.
>
> Cargo.toml | 1 +
> server/Cargo.toml | 2 +
> server/src/lib.rs | 1 +
> server/src/namespaced_cache.rs | 324 +++++++++++++++++++++++++++++++++
> 4 files changed, 328 insertions(+)
> create mode 100644 server/src/namespaced_cache.rs
>
> diff --git a/Cargo.toml b/Cargo.toml
> index 9806a4f0..5cf05b41 100644
> --- a/Cargo.toml
> +++ b/Cargo.toml
> @@ -125,6 +125,7 @@ serde = { version = "1.0", features = ["derive"] }
> serde_cbor = "0.11.1"
> serde_json = "1.0"
> serde_plain = "1"
> +tempfile = "3.15"
> thiserror = "1.0"
> tokio = "1.6"
> tracing = "0.1"
> diff --git a/server/Cargo.toml b/server/Cargo.toml
> index 3f185bbc..6d6dd583 100644
> --- a/server/Cargo.toml
> +++ b/server/Cargo.toml
> @@ -29,6 +29,8 @@ openssl.workspace = true
> percent-encoding.workspace = true
> serde.workspace = true
> serde_json.workspace = true
> +tempfile.workspace = true
> +thiserror.workspace = true
> tokio = { workspace = true, features = [ "fs", "io-util", "io-std", "macros", "net", "parking_lot", "process", "rt", "rt-multi-thread", "signal", "time" ] }
> tracing.workspace = true
> url.workspace = true
> diff --git a/server/src/lib.rs b/server/src/lib.rs
> index 5ed10d69..0b7642ab 100644
> --- a/server/src/lib.rs
> +++ b/server/src/lib.rs
> @@ -7,6 +7,7 @@ pub mod context;
> pub mod env;
> pub mod jobstate;
> pub mod metric_collection;
> +pub mod namespaced_cache;
> pub mod parallel_fetcher;
> pub mod remote_cache;
> pub mod remote_tasks;
> diff --git a/server/src/namespaced_cache.rs b/server/src/namespaced_cache.rs
> new file mode 100644
> index 00000000..89ed5b7f
> --- /dev/null
> +++ b/server/src/namespaced_cache.rs
> @@ -0,0 +1,324 @@
> +//! Generic namespaced cache implementation with optional value expiry.
> +//!
> +//! NOTE:
> +//! The current implementation is pretty naive about namespace names and key
> +//! names; these must come from verified, path-safe identifiers, as they are
> +//! used directly as components of the path of the resulting directories and
> +//! JSON files. This should be fixed before using this implementation
> +//! elsewhere.
> +
> +use std::fs::File;
> +use std::io::ErrorKind;
> +use std::path::{Path, PathBuf};
> +use std::time::Duration;
> +
> +use proxmox_sys::fs::CreateOptions;
> +use serde::{de::DeserializeOwned, Deserialize, Serialize};
> +
> +#[derive(thiserror::Error, Debug)]
> +pub enum CacheError {
> + #[error("IO error: {0}")]
> + Io(#[from] std::io::Error),
> +
> + #[error("serialization error: {0}")]
> + Serde(#[from] serde_json::Error),
> +
> + #[error("error: {0}")]
> + Other(#[from] anyhow::Error),
> +}
> +
> +#[derive(Serialize, Deserialize, Debug)]
> +struct CacheEntry<T> {
> + timestamp: i64,
> + value: T,
> +}
> +
> +impl<T> CacheEntry<T> {
> + fn is_expired(&self, now: i64, max_age: i64) -> bool {
> + if max_age == 0 {
> + return true;
> + }
> +
> + let diff = now - self.timestamp;
> + diff >= max_age || diff < 0
> + }
> +}
> +
> +pub struct NamespacedCache {
> + base_path: PathBuf,
> + file_options: CreateOptions,
> + dir_options: CreateOptions,
> +}
> +
> +impl NamespacedCache {
> + /// Create a new cache instance.
> + pub fn new<P: AsRef<Path>>(
Consider `Into<PathBuf>` - since you force a clone on `PathBuf` now.
> + base_path: P,
> + dir_options: CreateOptions,
> + file_options: CreateOptions,
> + ) -> Self {
> + Self {
> + base_path: base_path.as_ref().into(),
> + dir_options,
> + file_options,
> + }
> + }
> +
> + /// Lock a namespace for writing.
> + pub fn write(
> + &self,
> + namespace: &str,
> + timeout: Duration,
> + ) -> Result<WritableCacheNamespace, CacheError> {
> + let path = get_lockfile(&self.base_path, namespace);
> +
> + let lock = proxmox_sys::fs::open_file_locked(&path, timeout, true, self.file_options)?;
> +
> + Ok(WritableCacheNamespace {
> + _lock: lock,
> + namespace: namespace.to_string(),
> + base_path: self.base_path.clone(),
> + dir_options: self.dir_options,
> + file_options: self.file_options,
> + })
> + }
> +
> + /// Lock a namespace for reading.
> + pub fn read(
> + &self,
> + namespace: &str,
> + timeout: Duration,
> + ) -> Result<ReadableCacheNamespace, CacheError> {
> + let path = get_lockfile(&self.base_path, namespace);
> +
> + let lock = proxmox_sys::fs::open_file_locked(&path, timeout, false, self.file_options)?;
> +
> + Ok(ReadableCacheNamespace {
> + _lock: lock,
> + namespace: namespace.to_string(),
> + base_path: self.base_path.clone(),
> + })
> + }
> +}
> +
> +/// A readable cache namespace.
> +pub struct ReadableCacheNamespace {
> + _lock: File,
> + namespace: String,
> + base_path: PathBuf,
> +}
> +
> +impl ReadableCacheNamespace {
> + /// Read a value from the cache.
> + ///
> + /// # Errors:
> + /// - The file associated with this key could not be read
> + /// - The file could not be deserialized (e.g. invalid format)
> + pub fn get<T: Serialize + DeserializeOwned>(&self, key: &str) -> Result<Option<T>, CacheError> {
> + get_impl(&self.base_path, &self.namespace, key, None)
> + }
> +
> + /// Read a value from the cache, given a maximum age of the cache entry.
> + ///
> + /// # Errors:
> + /// - The file associated with this key could not be read
> + /// - The file could not be deserialized (e.g. invalid format)
> + pub fn get_with_max_age<T: Serialize + DeserializeOwned>(
> + &self,
> + key: &str,
> + max_age: i64,
> + ) -> Result<Option<T>, CacheError> {
> + get_impl(&self.base_path, &self.namespace, key, Some(max_age))
> + }
> +}
> +
> +/// A writable cache namespace.
> +pub struct WritableCacheNamespace {
> + _lock: File,
> + namespace: String,
> + base_path: PathBuf,
> + dir_options: CreateOptions,
> + file_options: CreateOptions,
> +}
> +
> +impl WritableCacheNamespace {
> + /// Remote a cache entry.
> + ///
> + /// This returns `Ok(())` if the key does not exist.
> + ///
> + /// # Errors:
> + /// - The file could not be deleted due to insufficient privileges.
> + pub fn remove(&self, key: &str) -> Result<(), CacheError> {
> + let path = get_path(&self.base_path, &self.namespace, key);
> +
> + if let Err(err) = std::fs::remove_file(path) {
> + if err.kind() == ErrorKind::NotFound {
> + return Ok(());
> + }
> +
> + return Err(err.into());
> + }
> +
> + Ok(())
> + }
> +
> + /// Set a cache entry.
> + ///
> + /// # Errors
> + /// - `value` could not be serialized
> + /// - The namespace directory could not be created
> + /// - The cache file could not be written to or atomically replaced
> + pub fn set<T: Serialize + DeserializeOwned>(
> + &self,
> + key: &str,
> + entry: &T,
> + ) -> Result<(), CacheError> {
> + self.set_with_timestamp(key, entry, proxmox_time::epoch_i64())
> + }
> +
> + /// Set a cache entry with an explicitly provided timestamp.
> + ///
> + /// # Errors
> + /// - `value` could not be serialized
> + /// - The namespace directory could not be created
> + /// - The cache file could not be written to or atomically replaced
> + pub fn set_with_timestamp<T: Serialize + DeserializeOwned>(
> + &self,
> + key: &str,
> + value: &T,
> + timestamp: i64,
> + ) -> Result<(), CacheError> {
> + let path = get_path(&self.base_path, &self.namespace, key);
> +
> + proxmox_sys::fs::create_path(
> + path.parent().unwrap(),
> + Some(self.dir_options),
> + Some(self.dir_options),
> + )?;
> +
> + let entry = CacheEntry { timestamp, value };
> +
> + let data = serde_json::to_vec(&entry)?;
> + proxmox_sys::fs::replace_file(path, &data, self.file_options, true)?;
> +
> + Ok(())
> + }
> +
> + /// Read a value from the cache.
> + ///
> + /// # Errors:
> + /// - The file associated with this key could not be read
> + /// - The file could not be deserialized (e.g. invalid format)
> + pub fn get<T: Serialize + DeserializeOwned>(&self, key: &str) -> Result<Option<T>, CacheError> {
> + get_impl(&self.base_path, &self.namespace, key, None)
> + }
> +
> + /// Read a value from the cache, given a maximum age of the cache entry.
> + ///
> + /// # Errors:
> + /// - The file associated with this key could not be read
> + /// - The file could not be deserialized (e.g. invalid format)
> + pub fn get_with_max_age<T: Serialize + DeserializeOwned>(
> + &self,
> + key: &str,
> + max_age: i64,
> + ) -> Result<Option<T>, CacheError> {
> + get_impl(&self.base_path, &self.namespace, key, Some(max_age))
> + }
> +}
> +
> +fn get_impl<T: Serialize + DeserializeOwned>(
> + base: &Path,
> + namespace: &str,
> + key: &str,
> + max_age: Option<i64>,
> +) -> Result<Option<T>, CacheError> {
> + let path = get_path(base, namespace, key);
> + let content = proxmox_sys::fs::file_read_optional_string(path)?;
> +
> + if let Some(content) = content {
> + let val = serde_json::from_str::<CacheEntry<T>>(&content)?;
> +
> + if let Some(max_age) = max_age {
> + if val.is_expired(proxmox_time::epoch_i64(), max_age) {
> + return Ok(None);
> + }
> + }
> + return Ok(Some(val.value));
> + }
> +
> + return Ok(None);
> +}
> +
> +fn get_path(base: &Path, namespace: &str, key: &str) -> PathBuf {
> + let mut path = base.join(namespace).join(key);
> + path.set_extension("json");
^ I'd rather join base with `format!("{key}.json")`, otherwise if a
`key` contains dots, it'll lose its final component.
Alternatively we could just skip the extension altogether.
Ah and I think we might want to `panic` (or fail) if
`namespace.starts_with('/')`, that would cause discard `base` to be
discarded! (I'm leaning towards panicking since this isn't something
that should ever come from the outside (I hope…), at least currently
it's all `const`s…)
> + path
> +}
> +
> +fn get_lockfile(base: &Path, namespace: &str) -> PathBuf {
> + let path = base.join(format!(".{namespace}.lock"));
> + path
> +}
> +
> +#[cfg(test)]
> +mod tests {
> + use tempfile::TempDir;
> +
> + use super::*;
> +
> + fn make_cache() -> (TempDir, NamespacedCache) {
> + let dir = tempfile::tempdir().unwrap();
(I wonder if we should use `tempdir_in` and pass `$OUT_DIR` or some
other dir cargo provides via env vars for these things?)
> +
> + let cache = NamespacedCache::new(dir.as_ref(), CreateOptions::new(), CreateOptions::new());
> +
> + (dir, cache)
> + }
> +
> + #[test]
> + fn test_cache() {
> + let (_dir, cache) = make_cache();
> +
> + let write_guard = cache.write("remote-a", Duration::from_secs(1)).unwrap();
> + write_guard.set("val1", &1).unwrap();
> + write_guard.set("val2", &1).unwrap();
> +
> + assert_eq!(write_guard.get::<i32>("val1").unwrap().unwrap(), 1);
> +
> + write_guard.remove("val1").unwrap();
> + assert!(write_guard.get::<String>("val1").unwrap().is_none());
> +
> + drop(write_guard);
> +
> + let read_guard = cache.read("remote-a", Duration::from_secs(1)).unwrap();
> +
> + assert_eq!(read_guard.get::<i32>("val2").unwrap().unwrap(), 1);
> + }
> +
> + #[test]
> + fn test_delete_nonexisting() {
> + let (_dir, cache) = make_cache();
> +
> + let a = cache.write("remote-a", Duration::from_secs(1)).unwrap();
> + a.set("val", &1).unwrap();
> +
> + // Deleting a key that does not exist is okay and should not error.
> + assert!(a.remove("val").is_ok());
> + }
> +
> + #[test]
> + fn test_expiration() {
> + let entry = CacheEntry {
> + value: (),
> + timestamp: 1000,
> + };
> +
> + assert!(!entry.is_expired(1000, 100));
> + assert!(!entry.is_expired(1099, 100));
> + assert!(entry.is_expired(1100, 100));
> + assert!(entry.is_expired(1101, 100));
> +
> + // if max-age is 0, the entry is never fresh
> + assert!(entry.is_expired(1000, 0));
> + }
> +}
> --
> 2.47.3
next prev parent reply other threads:[~2026-05-12 12:29 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-05-08 15:03 [RFC datacenter-manager 0/4] add generic, per-remote (and global) cache for remote API responses Lukas Wagner
2026-05-08 15:03 ` [PATCH datacenter-manager 1/4] add persistent, generic, namespaced key-value cache implementation Lukas Wagner
2026-05-12 12:29 ` Wolfgang Bumiller [this message]
2026-05-13 8:45 ` Lukas Wagner
2026-05-08 15:03 ` [PATCH datacenter-manager 2/4] add pdm_cache cache as a specialized wrapper around the namespaced cache Lukas Wagner
2026-05-08 15:03 ` [PATCH datacenter-manager 3/4] api: resources: subscriptions: switch over to pdm_cache Lukas Wagner
2026-05-08 15:03 ` [PATCH datacenter-manager 4/4] remote-updates: " Lukas Wagner
2026-05-11 12:37 ` [RFC datacenter-manager 0/4] add generic, per-remote (and global) cache for remote API responses Dominik Csapak
2026-05-12 8:39 ` Lukas Wagner
2026-05-13 13:56 ` superseded: " Lukas Wagner
-- strict thread matches above, loose matches on Subject: below --
2026-05-13 13:54 [PATCH " Lukas Wagner
2026-05-13 13:54 ` [PATCH datacenter-manager 1/4] add persistent, generic, namespaced key-value cache implementation Lukas Wagner
2026-05-15 9:06 ` Thomas Lamprecht
2026-05-15 9:19 ` Lukas Wagner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=3b7fegzovd2vvukjsctqba73qtdhize7s2xchoko7dq5xngzp6@2x7dunsotkik \
--to=w.bumiller@proxmox.com \
--cc=l.wagner@proxmox.com \
--cc=pdm-devel@lists.proxmox.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox