Your role
Overview:
- Build and run a reliable platform for services and data workflows across Kubernetes and Prefect.
- Own CI/CD, observability, security, and developer experience for Python/Go/Rust services.
- Design, provision, and operate Kubernetes workloads (deployments, networking, autoscaling, storage).
- Build and maintain GitLab CI/CD pipelines for Python, Go, and Rust services (build, test, scan, release).
- Operate Prefect (agents, work queues, deployments, concurrency limits, task execution environments).
- Implement environment strategy and promotion flow (dev/staging/prod) with clear release gates.
- Create golden paths and templates for FastAPI microservices and Prefect flows.
- Manage secrets, configuration, and access (e.g., GitLab variables, K8s secrets).
- Establish observability: logging, metrics, traces, alerting, runbooks, and SLOs.
- Operate data stores (MySQL, PostgreSQL, Redis): provisioning, backups, migration execution, monitoring, and capacity planning.
- Optimise build and runtime costs (container images, caching, autoscaling, resource requests/limits).
- Lead incident response, postmortems, and reliability improvements.