Hiring Pytorch Automation specialist, with expertise in Pytorch workloads and triton.Develop PyTorch workloads that stress model‑level execution (e.G., large GEMMs, attention patterns, MoE‑like behavior, mixed precision, long‑running loops)Author custom Triton kernels to directly stress hardware execution units, memory hierarchies, and synchronization pathsBuild parameterized stress harnesses that can scale with problem size, number of devices, and runtime durationIntegrate workloads with existing tooling for profiling, monitoring, and failure triageCollaborate with platform, firmware, and SDK teams to ensure workloads target known risk areas and emerging issuesDocument usage patterns and provide reproducible scripts for lab and CI usageExpected DeliverablesA library of reusable PyTorch stress workloadsA set of Triton‑based micro‑ and macro‑kernels designed specifically for stress and saturation testingTest harnesses/scripts supporting single‑device and multi‑device executionDocumentat...