hero

Discover the best
jobs in tech

From design and development to sales,
people, and management, get <matched>
with the best opportunities.
92
companies
11,008
Jobs

AIML - Software Engineer (ML Efficiency), Machine Learning Platform & Infrastructure

Apple

Apple

Software Engineering, Other Engineering, Data Science
Cupertino, CA, USA
Posted on Oct 3, 2024

Summary

Posted:
Weekly Hours: 40
Role Number:200571396
Do you want to shape the platform that enables the next generation of intelligent experiences on Apple products & services? In Apple’s Machine Learning Platform Technology & Infra team we have built the platform that Apple uses for developing machine learning, artificial intelligence, and computer vision applications. As a team, we have a variety of technical backgrounds, from machine learning PhDs to builders of large-scale production systems. Specifically in this role you will be working on optimizing end-to-end system performance of distributed machine learning workloads. This is a highly collaborative role and you will be working with key partners across the company.

Description

We are seeking highly motivated and experienced engineers to join our team. The ideal candidate will have a deep understanding of machine learning systems and cloud computing infrastructure. Key responsibilities in this role are: Engage with ML researchers to optimize end-to-end performance of large scale distributed ML workloads Analyze workload metrics to identify sources of inefficiencies and work with users to understand and optimize ML workloads Conduct workload analysis based on benchmarking key workloads on deployed systems Improve large scale training resiliency by optimizing applications and frameworks for improved recovery from failures and preemptions Influence architecture, design, development, and operations of next generation ML accelerator systems based on workload insights

Minimum Qualifications

  • Experience working with large scale parallel and distributed accelerator-based systems
  • Experience optimizing performance and AI workloads at scale
  • Experience developing code in one or more of training frameworks (such as PyTorch, TensorFlow or JAX)
  • Experience in performance analysis and optimization experience in Cloud accelerators
  • Deep understanding of computer systems and the interactions between HW and SW
  • Strong communicator with ability to analyze complex and ambiguous problems
  • Programming and software design skills (proficiency in C/C++ and/or Python)
  • Experience working in a high-level collaborative environment and promoting a teamwork mentality

Key Qualifications

Preferred Qualifications

  • BS or MS in Computer Science or related field

Education & Experience

Additional Requirements

Pay & Benefits

  • Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.