Overview Marathon TS is seeking a Cloud Reliability Engineer in Chantilly, VA to support our Department of Defense / Intelligence Community customer as part of a highly talented, highly motivated and high-performing team. As part of the Infrastructure Operations and Maintenance Support team you will be responsible for the availability, performance, monitoring, and incident response, among other things, of the Cloud Infrastructure that we support in a 24x7 environment.
Responsibilities
- Ensure the uptime of our multi-tenant infrastructure
- Work closely with the engineering teams to improve our platforms and eliminate complexity from architecture and processes
- Configure and use state-of-the-art monitoring tools to gather insights and then act upon the results
- Conduct incident response and in-depth root cause analysis.
- This position is han...