🍁 SearchCanadaJobs.com

Software Development Manager, AWS Neuron SDK - Distributed Training

Company

Amazon

Location

Cupertino, CA

Type

Full-time

Description
AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine
learning accelerators and the Trainium-based servers that use them. As the SDM of Software Development for the Neuron Training team, you will be responsible for leading a strong team of engineers and managers to help design and deploy these new products. A successful candidate will have an established background in developing Machine Learning products with direct customer-facing experience, a strong technical ability and a motivation to achieve results. Experience in Machine Learning and software development is also a must.

Responsible for the full development life cycle of our integrations and extensions for training support in Pytorch, JAX, and distributed training libraries with a focus on performance of latest ML models at scale on Trainium using latest techniques in performance optimization, accuracy, and resilience.

you will lead the way to ensure...

🍁 Ready to Apply?

Take the next step in your Canadian career

Apply Now