We are looking for a senior, hands-on AgentOps Platform Engineer to design, build, and operate the cloud-native infrastructure that powers our AI agents at scale. This is a lead-by-example role: You write the Terraform You build the pipelines You own the platform in production GCP is your primary environment, but you will design with multi-cloud in mind (AWS, Azure), ensuring portability, resilience, and long-term flexibility. This role sits at the intersection of DevOps, MLOps, and AgentOps, with deep responsibility for reliability, security, observability, and cost. KEY RESPONSIBILITIES Platform & Infrastructure Ownership Design, build, and operate production-grade infrastructure for AI agents and LLM services Own Terraform-based Infrastructure as Code for all environments (dev, uat, prod) Lead infrastructure decisions through hands-on implementation, not diagrams Build scalable foundations for: Agent orchestration Inference services RAG pipelines Vector stores Optimise cloud resourc...