Jorge Andreu Calatayud #
Staff Site Reliability Engineer · Shropshire, UK
13 years of experience designing and operating scalable infrastructure. I work at the intersection of platform engineering, reliability, and developer experience — building the systems that let product teams move fast without breaking things.
What I work on #
- Kubernetes & Platform Engineering — multi-tenant clusters, GitOps workflows, admission controllers, operator design
- Observability — OpenTelemetry pipelines, distributed tracing, cost-aware metrics collection, log aggregation
- Security & Secrets Management — SOPS, Vault, secrets rotation at scale, RBAC design, IP allowlisting
- Infrastructure & Networking — Traefik, MetalLB, Cilium, AWS, service mesh
Stack #
Kubernetes Helm Helmfile SOPS Traefik KEDA OpenTelemetry Prometheus Grafana AWS Cilium k3s ArgoCD Terraform
Links #
This notebook documents real-world engineering problems and how I solve them — not step-by-step tutorials, but the reasoning, trade-offs, and lessons from operating systems in production.