About the Role
Xsolla is seeking a Staff Infrastructure AI/ML Engineer to design, implement, and maintain AI/ML-powered solutions for our infrastructure. This role involves working across GCP and multi-cloud environments to enhance reliability, security, and operational efficiency through intelligent automation.
Responsibilities
- Design and implement AI/ML-powered solutions for infrastructure use cases, including predictive autoscaling, anomaly detection, intelligent cost optimization, and automated remediation across GCP and multi-cloud environments.
- Build and maintain AI-driven monitoring and observability systems that correlate logs, metrics, and traces to surface root causes, predict bottlenecks, and reduce mean time to resolution (MTTR).
- Develop and operate automated incident response workflows using AI-powered playbooks that diagnose, contain, and resolve infrastructure issues with minimal manual intervention.
- I...