ML Systems Engineer, ML Acceleration

Company

Motional

Location

singapore, singapore

Type

Full-time

Mission Summary We are looking for a Machine Learning Systems Engineer to join our ML Acceleration team. In this role, you will be responsible for the core systems that enable our researchers to train frontier models at scale, focusing obsessively on speed, cost, reliability, and throughput. Your work will directly impact our ability to scale large‑scale distributed model training and reduce the time-to‑convergence for our next generation of models. 
What you'll be doing Performance Profiling & Optimization : Utilize profiling tools (e.g., Nsight, PyTorch Profiler) to identify bottlenecks in data loading, gradient computation, and communication. Implement optimizations like kernel fusion, sharding, and tiling to improve step time. 
Distributed Training : Optimize distributed training pipelines using frameworks such as PyTorch Distributed. 
Kernel Development : Design and maintain high‑performance GPU kernels in Trit...
        

🍁 SearchCanadaJobs.com

ML Systems Engineer, ML Acceleration

🍁 Ready to Apply?