General Summary
Own & drive the deployment of platform solutions with a strong emphasis on AWS, Infrastructure as Code, and GitOps driven workflows.
Responsibilities
- Provision, upgrade, and operate production EKS clusters and core platform services in multi‑tenant environments, including autoscaling patterns using Karpenter.
- Build and maintain GitOps workflows using Argo CD and CI/CD pipelines using GitHub Actions, enabling repeatable, audited delivery patterns.
- Continuously evaluate and tune platform capabilities and services to improve reliability, performance, and cost efficiency for development teams.
- Build and maintain robust monitoring/alerting and recovery processes for platform services and components leveraging Datadog and Prometheus/Grafana.
- Implement secure‑by‑default cluster and workload patterns, including network and policy controls (e.g., Cilium and Kyverno), RBAC, and least‑privilege access....