Configuration

Recached is configured entirely through environment variables. There is no config file.

Environment variable reference

Variable	Default	Description
`RECACHED_PASSWORD`	(none)	Require clients to authenticate with `AUTH <password>`. If unset, the server accepts connections without authentication. After 5 consecutive failed `AUTH` attempts, the connection is closed.
`RECACHED_ALLOW_IPS`	(allow all)	Comma-separated list of IP addresses allowed to connect. Any connection from an IP not in the list is immediately closed. Invalid entries are logged and skipped.
`RECACHED_MAX_KEYS`	(unlimited)	Maximum number of keys in the store. When this limit is reached, behavior depends on `RECACHED_EVICTION`. If set to `noeviction` (the default), write commands that would exceed the cap return an error.
`RECACHED_EVICTION`	`noeviction`	Eviction policy when `RECACHED_MAX_KEYS` is reached. See eviction policies below.
`RECACHED_METRICS_PORT`	`9091`	Port for the Prometheus metrics HTTP server. Metrics are available at `/metrics`. Set to `0` to disable.
`RECACHED_SAVE_PATH`	`recached.rdb`	Path to the snapshot file. The server loads this file on startup and writes to it on `SAVE`, `BGSAVE`, autosave, and clean shutdown.
`RECACHED_SAVE`	(none)	Multi-condition autosave policy as comma-separated `seconds:changes` pairs. A snapshot is triggered when any condition is satisfied: `elapsed_since_last_save >= seconds` and `dirty_writes >= changes`. Example: `"900:1,300:10,60:10000"` — save after 1 write in 15 min, 10 writes in 5 min, or 10 000 writes in 1 min. When set, `RECACHED_SAVE_INTERVAL` is ignored. Skips saves when no writes have occurred since the last snapshot.
`RECACHED_SAVE_INTERVAL`	`900`	Autosave interval in seconds (single-condition fallback when `RECACHED_SAVE` is not set). The server saves automatically at this interval if at least one write has occurred since the last save. Set to `0` to disable autosave entirely (manual `SAVE`/`BGSAVE` still work).
`RECACHED_AOF_PATH`	(disabled)	Path to the append-only file. When set, every write command is appended to this file in addition to snapshot saves. On startup the snapshot is loaded first, then AOF commands are replayed for the delta. The AOF is truncated after each successful snapshot save.
`RECACHED_AOF_SYNC`	`everysec`	AOF fsync policy. `always`: fsync after every write (safest, slowest). `everysec`: fsync once per second (default, good balance). `no`: let the OS decide (fastest, least safe).
`RECACHED_MAX_CONNECTIONS`	`1024`	Maximum number of concurrent client connections (TCP + WebSocket combined). New connections are dropped when the limit is reached.
`RECACHED_REPL_PORT`	`6381`	TCP port the primary listens on for incoming replica connections. Only active when `RECACHED_REPLICAOF` is not set (i.e. this server is a primary).
`RECACHED_REPLICAOF`	(none)	Set to `host:port` to run this server as a read-only replica. On startup it connects to the primary, receives a full snapshot, and then streams all subsequent writes. Reconnects automatically with exponential backoff on disconnect.
`RECACHED_REPL_PASSWORD`	(none)	Shared secret for the replication channel. When set, replicas must send this password during the handshake before receiving any data. Must match on both primary and replica. Strongly recommended for any network-exposed replication port.
`RECACHED_FAILOVER_TIMEOUT`	(disabled)	Seconds a replica waits with the primary unreachable before automatically promoting itself to primary. Set on replicas only. `0` or unset disables auto-failover — the replica reconnects indefinitely. See auto-failover below.
`RECACHED_REPL_BUFFER`	`4096`	Per-replica channel capacity (number of pending write frames buffered on the primary before a lagging replica is disconnected). A replica that falls this many writes behind is dropped and must reconnect from a fresh snapshot. Increase if replicas are on a consistently slow or high-latency link; decrease to reduce memory use per replica.
`RECACHED_MAX_MEMORY`	(unlimited)	Maximum approximate heap usage for the key-value store. Accepts a byte count or a human-readable suffix: `512mb`, `2gb`, `1073741824`. When the limit is exceeded, the background eviction loop runs the configured eviction policy. Has no effect when `RECACHED_EVICTION` is `noeviction`.
`RECACHED_TLS_CERT`	(none)	Path to a PEM-encoded TLS certificate file. TLS is enabled on both ports when this and `RECACHED_TLS_KEY` are set.
`RECACHED_TLS_KEY`	(none)	Path to a PEM-encoded TLS private key file. If either `RECACHED_TLS_CERT` or `RECACHED_TLS_KEY` is missing, the server falls back to plain TCP/WS.
`RUST_LOG`	`info`	Log level. Accepts `error`, `warn`, `info`, `debug`, `trace`. Module-specific: `RUST_LOG=recached=debug,tokio=warn`.

Eviction policies

Eviction runs when RECACHED_MAX_KEYS is reached and a write command would add a new key.

Policy	Behavior
`noeviction`	Write commands return an error when the key cap is reached. Existing keys are never evicted. Default behavior.
`lru`	Evicts the least-recently-used key. Applies to all keys regardless of TTL.
`allkeys-random`	Evicts a randomly selected key. Lower overhead than LRU.
`volatile-lru`	Evicts the least-recently-used key that has a TTL set. If no keys have a TTL, falls back to `noeviction`.
`volatile-ttl`	Evicts the key with the shortest remaining TTL. Prioritizes keys that are closest to expiring. Falls back to `noeviction` if no keys have a TTL.

For most applications, lru is the right default when a key cap is configured.

Common configurations

Development (no auth, verbose logging)

bash

RUST_LOG=debug recached-server

Production with auth and key cap

bash

RECACHED_PASSWORD="a-strong-random-secret" \
RECACHED_MAX_KEYS="1000000" \
RECACHED_EVICTION="lru" \
RECACHED_METRICS_PORT="9091" \
RUST_LOG="info" \
recached-server

With IP allowlist (restrict to local network)

bash

RECACHED_PASSWORD="secret" \
RECACHED_ALLOW_IPS="127.0.0.1,10.0.0.0/8,192.168.1.100" \
recached-server

Note: RECACHED_ALLOW_IPS accepts individual IP addresses. CIDR notation support depends on the version — check the release notes. If only specific hosts should connect, prefer TLS + auth over IP allowlisting in cloud environments where IPs rotate.

With TLS (secure RESP and WSS)

Generate a self-signed certificate for local testing:

bash

openssl req -x509 -newkey rsa:4096 -keyout key.pem -out cert.pem \
  -days 365 -nodes -subj "/CN=localhost"

Start with TLS:

bash

RECACHED_TLS_CERT="./cert.pem" \
RECACHED_TLS_KEY="./key.pem" \
RECACHED_PASSWORD="secret" \
recached-server

Clients connecting over RESP must now use rediss:// (RESP over TLS). Browser clients use wss:// instead of ws://.

typescript

// Backend
const cache = new Redis('rediss://127.0.0.1:6379')

typescript

// Browser
import { createCache } from 'recached-edge'
const cache = await createCache({
  connect: { url: 'wss://your.domain:6380' },
})

For a production certificate, use Let's Encrypt via certbot or provide the cert/key from your certificate authority.

Prometheus metrics scrape

bash

RECACHED_METRICS_PORT="9091" recached-server

Configure your Prometheus instance to scrape:

yaml

scrape_configs:
  - job_name: recached
    static_configs:
      - targets: ['localhost:9091']
    metrics_path: /metrics
    scrape_interval: 15s

Available metrics include key count, command latency histograms, active connections, and WebSocket client count.

With AOF + snapshot (strong durability)

Combining snapshots with an append-only file means you lose at most a few writes on crash, rather than up to 15 minutes of writes with snapshots alone.

bash

RECACHED_SAVE_PATH="/data/recached.rdb" \
RECACHED_AOF_PATH="/data/recached.aof" \
RECACHED_AOF_SYNC="everysec" \
recached-server

On startup: snapshot is loaded first, then any AOF commands written after the snapshot are replayed. The AOF is automatically truncated after each successful snapshot save.

With leader-follower replication

Run a primary and one or more read-only replicas. Replicas receive a full snapshot on connect, then stream every subsequent write in real time.

bash

# Primary (default replication port 6381, auth required)
RECACHED_SAVE_PATH="/data/recached.rdb" \
RECACHED_REPL_PORT="6381" \
RECACHED_REPL_PASSWORD="repl-secret" \
recached-server

# Replica (connects to primary, rejects writes)
RECACHED_REPLICAOF="primary-host:6381" \
RECACHED_REPL_PASSWORD="repl-secret" \
recached-server

Replicas reconnect automatically with exponential backoff (2s → 4s → … → 30s cap) if the primary is temporarily unavailable. Write commands sent to a replica return -READONLY.

Each replica has an internal write buffer (default 4096 frames, set by RECACHED_REPL_BUFFER). If a replica falls that many writes behind — due to a slow network or an overloaded replica host — it is disconnected and must reconnect from a fresh snapshot. This prevents a lagging replica from consuming unbounded memory on the primary or blocking the primary's write path.

To promote a replica to primary at runtime (manual failover), send REPLICAOF NO ONE over any RESP connection to the replica. It immediately starts accepting writes.

With auto-failover

Set RECACHED_FAILOVER_TIMEOUT on the replica. If the primary is unreachable for that many seconds, the replica promotes itself automatically without any manual intervention.

bash

# Primary
RECACHED_SAVE_PATH="/data/recached.rdb" \
RECACHED_REPL_PORT="6381" \
RECACHED_REPL_PASSWORD="repl-secret" \
recached-server

# Replica — promotes itself after 30s of primary being unreachable
RECACHED_REPLICAOF="primary-host:6381" \
RECACHED_REPL_PASSWORD="repl-secret" \
RECACHED_FAILOVER_TIMEOUT="30" \
recached-server

How the timer works. The RECACHED_FAILOVER_TIMEOUT clock starts the first time a connect attempt fails or the live sync stream drops. It resets to zero as soon as the replica successfully reconnects to the primary. This means a brief primary restart (e.g. a rolling deploy) that completes before the timeout elapses does not trigger promotion.

Split-brain risk. Auto-failover is safe in a single-replica setup. With multiple replicas, two replicas could both time out simultaneously and both promote — creating two independent primaries accepting diverging writes. To avoid this:

Use RECACHED_FAILOVER_TIMEOUT only on a designated standby replica, not all replicas.
Keep the timeout long enough (≥ 2× your typical primary restart time) to avoid spurious promotion on routine restarts.
After a failover event, update clients and other replicas to point at the new primary before bringing the old primary back online.

With multi-condition autosave (RECACHED_SAVE)

Fine-grained save policy that triggers on the first matching condition:

bash

RECACHED_SAVE_PATH="/data/recached.rdb" \
RECACHED_SAVE="900:1,300:10,60:10000" \
recached-server

This matches Redis's default save policy: save after 1 write in 15 min, 10 writes in 5 min, or 10 000 writes in 1 min. Saves are skipped automatically when no writes have occurred since the last snapshot, so idle servers incur zero I/O.

With snapshot persistence

By default the server saves a snapshot every 15 minutes to recached.rdb in the working directory. On startup it restores from that file automatically.

bash

# Custom path and 5-minute autosave
RECACHED_SAVE_PATH="/var/lib/recached/dump.rdb" \
RECACHED_SAVE_INTERVAL="300" \
recached-server

bash

# Disable autosave — trigger saves manually with BGSAVE
RECACHED_SAVE_PATH="/data/recached.rdb" \
RECACHED_SAVE_INTERVAL="0" \
recached-server

The snapshot file is written atomically: the server writes to a .tmp file and renames it into place, so a crash mid-save cannot corrupt the previous snapshot. Expired keys are silently skipped on restore. On a clean shutdown (SIGTERM or Ctrl-C), a final snapshot is saved before the process exits.

High-connection workloads

The server accepts up to 1024 concurrent connections by default (enforced with a connection semaphore), configurable via RECACHED_MAX_CONNECTIONS:

bash

RECACHED_MAX_CONNECTIONS="4096" recached-server

For most web applications, 1024 concurrent connections to the cache server is more than enough. Browser clients each hold one WebSocket connection; backend services typically hold a small connection pool (2–10 connections).

Notes on sensitive configuration

Never commit RECACHED_PASSWORD to source control. Use an environment file (see Installation — systemd service) or a secrets manager (Vault, AWS Secrets Manager, Doppler).
The password is compared in constant time to prevent timing attacks, but the brute-force lockout (5 failed attempts → disconnect) is the primary protection. Use a long random password.
TLS is strongly recommended for any deployment where the cache server is reachable over a network that you do not fully control. Without TLS, RECACHED_PASSWORD is sent in plaintext on initial AUTH.

Configuration ​

Environment variable reference ​

Eviction policies ​

Common configurations ​

Development (no auth, verbose logging) ​

Production with auth and key cap ​

With IP allowlist (restrict to local network) ​

With TLS (secure RESP and WSS) ​

Prometheus metrics scrape ​

With AOF + snapshot (strong durability) ​

With leader-follower replication ​

With auto-failover ​

With multi-condition autosave (RECACHED_SAVE) ​

With snapshot persistence ​

High-connection workloads ​

Notes on sensitive configuration ​

Configuration

Environment variable reference

Eviction policies

Common configurations

Development (no auth, verbose logging)

Production with auth and key cap

With IP allowlist (restrict to local network)

With TLS (secure RESP and WSS)

Prometheus metrics scrape

With AOF + snapshot (strong durability)

With leader-follower replication

With auto-failover

With multi-condition autosave (RECACHED_SAVE)

With snapshot persistence

High-connection workloads

Notes on sensitive configuration