SpaceX
SpaceX1d ago
New
$125,000 – $150,000/yr

Site Reliability Engineer — HPC & Automation (Silicon Engineering)

United StatesUnited States·Redmondmid
EngineeringDevops Engineer
0 views0 saves0 applied

Quick Summary

Key Responsibilities

Deploy, upgrade, operate, maintain, and scale our suite of clusters and services Collaborate with engineers to develop automated,

Requirements Summary

Bachelor’s degree in computer science, information systems, or an engineering discipline; OR 2+ years of professional experience in system administration, high performance computing,

Technical Tools
EngineeringDevops Engineer

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars.

At SpaceX we’re leveraging our experience in building rockets and spacecraft to deploy Starlink, the world’s most advanced broadband internet system. Starlink is the world’s largest satellite constellation and is providing fast, reliable internet to millions of users worldwide. We design, build, test, and operate all parts of the system – thousands of satellites, consumer receivers that allow users to connect within minutes of unboxing, and the software that brings it all together. We’ve only begun to scratch the surface of Starlink’s potential global impact and are looking for best-in-class engineers to help maximize Starlink’s utility for communities and businesses around the globe. 

We are seeking a motivated, proactive, and intellectually curious engineer who will work alongside world-class cross-disciplinary teams (systems, firmware, architecture, design, validation, product engineering, ASIC implementation). As a Site Reliability Engineer on the Silicon Engineering team you will get the opportunity to design, operate, scale, and automate the high performance computing infrastructure we use to develop the chips powering the world's largest satellite constellation and a global internet service. This position will have a meaningful impact on Starlink silicon by enabling faster design-iterations, simulations, and regression turnaround times that gate how fast our chip teams can ship. 

Responsibilities

~1 min read
  • Deploy, upgrade, operate, maintain, and scale our suite of clusters and services
  • Collaborate with engineers to develop automated, full turnkey solutions for silicon simulation workflows to speed up project timelines
  • Manage our underlying infrastructure as code and use modern observability tools to provide a complete picture of cluster and infrastructure health
  • Operate the continuous integration pipeline, build and release systems, and version control across the environment
  • Identify and eliminate performance bottlenecks using measurement and creative engineering

Requirements

~2 min read
  • Bachelor’s degree in computer science, information systems, or an engineering discipline; OR 2+ years of professional experience in system administration, high performance computing, or site reliability engineering
  • 1+ years of development experience with Bash, Python, and/or other programming languages
  • 1+ years of experience with Linux operating systems
  • Familiarity with containerization technologies (i.e. Docker, Kubernetes)
  • Knowledge in computer system concepts (computer architecture, computer organization, operating systems and concurrency)
  • Experience with databases and data modeling (e.g., MySQL, PostgreSQL, SQLite)
  • Networking knowledge of TCP/IP
  • Experience with high performance computing and workload managers (e.g., Slurm, LSF)
  • Experience with Terraform, Ansible, Puppet, or similar automation frameworks
  • Experience building monitoring and alerting as code (e.g., Grafana, Prometheus, custom exporters)
  • Experience with CI/CD automation at scale (e.g., Jenkins, Bamboo, build systems)
  • Experience with infrastructure as code (IaC) tools for managing fleets of servers
  • Experience with using & building REST API clients/servers
  • Experience with enterprise/networked storage automation (e.g., NetApp ONTAP REST API/CLI, NFS)
  • Experience with ASIC design flows and tools (e.g., Cadence, Synopsys, Ansys, Keysight, Siemens)
  • Strong desire to find performance bottlenecks and performance improvement techniques
  • Excellent communication skills with the ability to communicate with customers, peers, management, etc. in both formal and informal situations
  • Ability to quickly learn new tools and frameworks
  • Interest in or experience with AI/LLM-assisted tooling (e.g., Grok, Claude Code)
  • Ability to work extended hours and weekends as needed to meet critical milestones
  • To conform to U.S. Government export regulations, applicant must be a (i) U.S. citizen or national, (ii) U.S. lawful, permanent resident (aka green card holder), (iii) Refugee under 8 U.S.C. § 1157, or (iv) Asylee under 8 U.S.C. § 1158, or be eligible to obtain the required authorizations from the U.S. Department of State. Learn more about the ITAR here.  

SpaceX is an Equal Opportunity Employer; employment with SpaceX is governed on the basis of merit, competence and qualifications and will not be influenced in any manner by race, color, religion, gender, national origin/ethnicity, veteran status, disability status, age, sexual orientation, gender identity, marital status, mental or physical disability or any other legally protected status.

Applicants wishing to view a copy of SpaceX’s Affirmative Action Plan for veterans and individuals with disabilities, or applicants requiring reasonable accommodation to the application/interview process should reach out to EEOCompliance@spacex.com

Location & Eligibility

Where is the job
Redmond, United States
On-site at the office
Who can apply
Open to applicants worldwide

Listing Details

Posted
July 1, 2026
First seen
July 1, 2026
Last seen
July 2, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
60%
Scored at
July 1, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
SpaceX
SpaceX
greenhouse

SpaceX is a leader in aerospace manufacturing and space transport services, founded to make life multiplanetary.

Employees
3k+
Founded
2002
View company profile
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

SpaceXSite Reliability Engineer — HPC & Automation (Silicon Engineering)$125k–$150k