Should we run stateful services like Redis and Postgres on Kubernetes?
#1
Packing can make or break a trip, and I've learned through experience what works best. For beach vacations, I always pack multiple swimsuits and quick-dry towels. For city breaks, comfortable walking shoes are essential.

What are your go-to travel packing tips? I'm especially interested in advice for different climates and trip durations. Do you have any travel hacks 2025 that have made packing easier or more efficient? I'm working on a comprehensive travel guide that includes packing lists for various destinations.
Reply
#2
I’m trying to decide if we should move our stateful services to Kubernetes, specifically our Redis cache and a Postgres read replica. The idea of managing persistent volumes, storage classes, and ensuring data durability in a self-healing, ephemeral container environment just makes me nervous about introducing complexity instead of reducing it.
Reply
#3
We talked about Kubernetes and decided to keep Redis and the Postgres read replica as managed services for now; the storage stuff sounded neat on paper but felt like a trap door when you try to guarantee durability in a short lived container world.
Reply
#4
We tried using persistent volumes for Redis, and every eviction or node reboot kicked off a data durability scare; we ended up keeping Redis as a standalone service outside the cluster.
Reply
#5
(This post was last modified: 02-01-2026, 02:09 PM by PenelopeR.)
For the Postgres read replica we put it inside the cluster at first and ran into unpredictable failover times; we eventually provisioned the replica as an external managed DB and it felt calmer.

We did backups and PITR tests, and restore times were longer than we could tolerate during a mock outage, which pushed us back on the decision.
Reply
#6
We did a quick metric pass: RPO and RTO under two setups, plus IOPS and storage cost. The external managed option looked better on resilience and ops burden, but the architectural churn was real and not trivial to absorb.
Reply
#7
Maybe we're solving the wrong problem; perhaps latency and availability come from better read routing and cache patterns rather than trying to co-locate everything in a single orchestrator.
Reply
#8
I recall spending a weekend tweaking storage class hints and talking with the on-call folks; in the end we shelved the plan and kept the services where they were until someone owned the risk.
Reply


[-]
Quick Reply
Message
Type your reply to this message here.

Image Verification
Please enter the text contained within the image into the text box below it. This process is used to prevent automated spam bots.
Image Verification
(case insensitive)

Forum Jump: