We are looking for a skilled and reliability-driven Site Reliability Engineer (SRE) to strengthen our engineering team. In this hybrid role, you will combine hands-on 2nd level support responsibilities with monitoring, automation, and reliability engineering. You will play a key role in ensuring the stability, observability, and continuous improvement of our production systems supporting real-time financial data processing. What You Will Do
Operational Support & Incident Management Provide 2nd level support for production systems and critical business applications. Investigate, troubleshoot, and resolve incidents and performance issues. Perform root cause analysis (RCA) and document findings in a structured manner. Collaborate closely with development teams to ensure sustainable issue resolution. Contribute to post-incident reviews and continuous improvement initiatives.
Monitoring, Observability & Automation Design, implement, and maintain monitoring dashboards. Improve alert qu...