Description
Leading the manufacturing of AI Servers and Systems based on Trainium chips across cross‑geographical ODMs and CMs. As part of the Manufacturing, Quality and Reliability Team in Annapurna Labs, focused on Machine Learning products that design cutting‑edge AI platforms for the world’s largest Cloud Services provider. The team is building the next‑generation cloud server infrastructure, delivering world‑class server infrastructure that handles massive scale and rapid integration of emerging technologies. Our servers include accelerators such as Trainium and Inferentia, which are machine learning products designed to deliver high performance at low cost.
Key Responsibilities
- Engage with cross‑disciplinary staff to conceive and design infrastructure technologies.
- Work closely with internal interdisciplinary teams and outside partners to drive key aspects of product definition, execution, and testing in manufacturing.
- Be responsible for test validation of future technologies.
- Drive manufacturing process improvements to address reliability issues and concerns.
- Qualify manufacturing lines and mechanisms for mass production.
- Identify and validate product/component risks, work with design teams to mitigate them, and define test methodology and coverage to assure product quality.
- Provide technical leadership and mentor engineers.
- Work with multiple vendors and ODMs to standardize component manufacturing and reliability expectations.
- Work with system engineering teams to identify and enforce DFM, DFA, and DFS principles.
- Evaluate, investigate, and introduce new manufacturing technology and methodology to enhance product quality and production efficiency at ODM and CM.
- Develop or adapt manufacturing processes at the ODM and CM, including defining fixture requirements, critical assembly requirements, test methodology, signal integrity, power and heat management requirements.
- Drive factory‑related operational issues during pre‑production builds, ensuring effective closure to enable operational success of the new product introduction cycle, and putting products into mass production.
- Manage product lifecycle changes, lead product quality and reliability improvement projects, and drive technical root causes for supplier defects.
- Implement and optimize manufacturing 1st‑pass yields and efficiency from prototype through product ramp.
- Work with engineering teams to represent process steps and reviews for smooth new product introduction and changes.
- Support cost reduction and sustaining activities.
Basic Qualifications
- Bachelor’s degree in Electrical Engineering or a related field.
- Experience identifying bugs in architecture, algorithms, functionality, and performance with strong debugging skills.
- Experience verifying at multiple levels of logic from IP blocks to SoCs to full system testing.
Preferred Qualifications
- Master’s degree in Electrical or Communications Engineering or a related field.
- Experience with formal verification techniques, including abstraction and end‑to‑end checking.
- Experience with ARM and various DSP ISAs.
- Experience with current and upcoming RF standards in cellular (4G/5G NR), WiMAX, 802.11ad, microwave backhaul, or related broadband wireless standards.
- Experience with industry standard tools and scripting languages (Python or Perl) for automation.
Equal Opportunity Employer
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. Our inclusive culture empowers Amazonians to deliver the best results for our customers.
Benefits
Amazon offers comprehensive benefits, including health insurance (medical, dental, vision, prescription, basic life & AD&D insurance and optional supplemental life plans), EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage, 401(k) matching, paid time off, and parental leave. Detailed benefits information is available at https://amazon.jobs/en/benefits.