Hardware Systems Engineer, NPI Lead
Austin, TXMenlo Park, CA • Full Time
Meta
Infrastructure
Hardware
Meta is seeking a Hardware Systems Engineering Lead to join our Release to Production (RTP) team working on new NPI hardware. Our servers and data centers are the foundation upon which our rapidly scaling infrastructure operates efficiently to deliver our innovative services. The RTP team is responsible for the end-to-end Hardware Lifecycle of all Meta servers including prototyping of experimental HW, pre-production hands-on system and hardware debugging and stress testing, enabling production-ready system monitoring, automated provisioning and automated remediation of issues. Hardware Systems Engineers in RTP work closely with HW/SW co-design teams, hardware designers, networking teams, system manufacturers, component vendors, capacity engineering, production engineering, production services, and data center operations teams to enable new systems that will be deployed in our production data centers. We also work across service and hardware architectures for new types of systems; building prototypes to demonstrate the value, enable go/no-go decisions, and finally optimizing these systems for at-scale production.
Hardware Systems Engineer, NPI Lead Responsibilities
  • Lead and execute comprehensive end-to-end system validation for the next generations of compute and storage systems.
  • Collaborate with hardware designers, external partners, and internal silicon validation teams to define test strategy at system level.
  • Contribute to new feature/technology development/validation across hardware/software stack.
  • Proactively create experiments and tooling to detect and diagnose hardware/firmware/software health issues.
  • Troubleshoot, diagnose and root cause system failures and isolate the components/failure scenarios while working with internal & external partners.
  • Develop visibility through data visualization and implement systemic solutions to hardware health issues.
  • Leverage production experience to drive external and internal teams to continuously improve product quality.
  • Communicate complex technical findings to diverse stakeholders at all levels.
  • Proactively identify and mitigate potential product risks based on testing insight.
Minimum Qualifications
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
  • 7+ years of experience in in domains such as compute (x86/ARM), Memory (DDR4/5, LPDDR5, HBM), storage (NVMe, TLC, QLC, SATA HDDs), GPU/Accelerators, High Speed IO (PCIe Gen5, Gen6, CXL), and/or system management (BMC, system firmware).
  • 2+ years of experience as a tech lead for complex SOCs (ARM, RISC, x86, CPU, GPU, TPU), leading post-silicon validation for the product, large feature development, HSIO qualification, or similar.
  • Direct experience in one or more areas including but not limited to debug for ASIC development (silicon design, bringup, characterization, validation), board level debug, firmware validation, system validation.
  • 3+ years of experience troubleshooting and debugging using lab tools (oscilloscope, logic analyzer, etc).
  • 3+ years of experience in developing test specifications, procedures, and debug guides for test solutions.
Preferred Qualifications
  • 10+ years of experience with a subset of one or more of the following domains: compute systems, storage systems, accelerated compute systems/HPC, kernel/firmware development and/or test, post-silicon bringup.
  • 4+ years of experience as a tech lead through multiple product lifecycles for ARM or x86 SOCs, or for Compute/Storage/HPC systems getting deployed in fleet/datacenter production.
  • 4+ years of direct expertise with Linux systems and server systems management/debug.
  • 4+ years experience scripting automation in Python.
  • 4+ years of experience creating test plans for complex chipsets leveraging functional, stress and performance testing.
  • Experience with embedded systems’ architecture and components, performance optimization of algorithms, test automation, and instrument communication (oscilloscopes, protocol analyzers, traffic generators, etc.).
  • Experience in debugging tools for systems-on-chip (SOCs), including but not limited to JTAG, GDB, Trace32, or similar.
  • Expertise with common bus protocols such as I2C, SPI, USB, LP/DDR, and/or PCIe.
  • Expertise with the integration of lab tools for automated workflows in large scale deployments.
  • Expertise with continuous integration/continuous delivery tools
For those who live in or expect to work from California if hired for this position, please click here for additional information.
About Meta
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics.

$163,000/year to $225,000/year + bonus + equity + benefits

Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.


Equal Employment Opportunity
Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.

Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, fill out the Accommodations request form.
Apply for this job
Take the first step toward a rewarding career at Meta.

APPLY NOW

Find your role

Explore jobs that match your skills and experience. Search by technology, team or location to find an opening that’s right for you.

View jobs
Meta logo, homepage link

Careers

Follow us

LinkedIn icon
Instagram icon
facebook icon
Threads icon
YouTube icon
Twitter icon

Equal Employment Opportunity

Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.

Meta is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need assistance or an accommodation due to a disability, fill out the Accommodations request form .