Overview
Senior Engineer for Network Observability — join CoreWeave’s Network Observability team. You will design, develop, and maintain the monitoring, telemetry, and observability systems that keep CoreWeave’s GPU cloud network operating reliably and at scale. You will build solutions that provide real-time insights into network performance and enable proactive issue detection and rapid resolution.
What You’ll Do
- Develop, optimize, and maintain network observability platforms. Use Python and Go to create and automate collectors, exporters, and dashboards that provide deep visibility into network health and performance.
- Collaborate with Network Engineering and Platform teams to ingest and unify logs, metrics, and events from Arista EOS, NVIDIA Cumulus Linux, Nokia SR OS, SR Linux, and other platforms into a single observability pipeline.
- Design and implement scalable telemetry solutions using protocols like gNMI, SNMP, and streamin...