About

Jorge Andreu Calatayud #

Staff Site Reliability Engineer · Shropshire, UK

13 years of experience designing and operating scalable infrastructure. I work at the intersection of platform engineering, reliability, and developer experience — building the systems that let product teams move fast without breaking things.

What I work on #

Kubernetes & Platform Engineering — multi-tenant clusters, GitOps workflows, admission controllers, operator design
Observability — OpenTelemetry pipelines, distributed tracing, cost-aware metrics collection, log aggregation
Security & Secrets Management — SOPS, Vault, secrets rotation at scale, RBAC design, IP allowlisting
Infrastructure & Networking — Traefik, MetalLB, Cilium, AWS, service mesh

Stack #

Kubernetes Helm Helmfile SOPS Traefik KEDA OpenTelemetry Prometheus Grafana AWS Cilium k3s ArgoCD Terraform

Links #

This notebook documents real-world engineering problems and how I solve them — not step-by-step tutorials, but the reasoning, trade-offs, and lessons from operating systems in production.