Manager for Machine Learning System Software- 89602

  • Company:
  • Location:
  • Salary:
    negotiable / month
  • Job type:
  • Posted:
    1 day ago
  • Category:

What you do at AMD changes everything At AMD, we push the boundaries of what is possible. We believe in changing the world for the better by driving innovation in high-performance computing, graphics, and visualization technologies – building blocks for gaming, immersive platforms, and the data center. Developing great technology takes more than talent: it takes amazing people who understand collaboration, respect, and who will go the “extra mile” to achieve unthinkable results. It takes people who have the passion and desire to disrupt the status quo, push boundaries, deliver innovation, and change the world. If you have this type of passion, we invite you to take a look at the opportunities available to come join our team. Manager for Machine Learning System Software The Role: Lead team developing system software to improve execution of Machine Learning system software for AMD GPU/CPU compute platforms. The Person: The ideal person has strong technical and analytical skills in C++ development in a Linux environment. They must have the ability to work as a member of a team, while also being able to work independently, define goals, scope, and lead their own development effort. Key Responsibilities: · Work on compiler technologies to optimize transformation of high-level machine learning operations for Inference on AMD GPUs. · Work jointly with library developers to adopt and use these compiler technologies to enable library development. · Work closely with open-source maintainers to have changes adopted · Participate in the co-design of AMD’s ML hardware and software stack · Benchmark, analyze, and optimize performance of key machine learning applications and participate in the co-design across AMD’s ML hardware and software stack · Develop high-performance run-time ML engine · Work closely with ML engineers to discover the hardware and software requirements of current and future ML applications · Work in a distributed compute setting to optimize for both scale-up (multi-GPU) and scale-out (multi-node) systems · Apply their knowledge of software engineering standard methodologies Preferred Experience: · Excellent C/C++ programming and software design skills including debugging, performance analysis, and test design. · Knowledge of compiler frameworks such as LLVM is an advantage. · Experience with and passion of any of the following is a plus – machine learning, parallel programming (HIP, CUDA, OpenCL), high-performance and massively parallel systems, processor and computer architecture · Ability to work independently, define project goals and scope, and lead a development effort Knowledge of GPU computing and basic understanding of Deep Learning is an advantage Academic Credentials: · Bachelor’s, Master’s, or PhD or equivalent experience in Computer Science, Computer Engineering, or related field. #LI-JG1 Requisition Number: 89602 Country: Canada Province: Ontario City: Markham Job Function:Design AMD is an inclusive employer dedicated to building a diverse workforce. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective provincial human rights codes throughout all stages of the recruitment and selection process. Any applicant who requires accommodation should contact [email protected] AMD does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services.