Site Reliability Engineer
Quick Summary
Experience with automation and infrastructure as code, DevSecOps, CI/CD pipelines, or automated security scanning (Windows and Linux) Understanding of US federal information system security policies,
Striveworks helps organizations harness the power of artificial intelligence to solve real-world national security and business challenges by serving as the command center between data, models, and business outcomes. Founded by data scientists and engineers, Striveworks set out to make the journey from deployment to ongoing optimization simple and effective.
With Striveworks, organizations aren’t just deploying AI—they’re building systems that remain reliable, adaptable, and ready to scale in an unpredictable world. Mission-critical operations require models that perform where they’re deployed, scale as workloads grow, and adapt rapidly as AI capabilities advance. Striveworks meets these demands, increasing reliability and performance while lowering costs—and enabling confident, data-driven decision-making in dynamic environments.
As a Site Reliability Engineer at Striveworks, you’ll be challenged—and trusted—on day one to implement and manage all corporate systems. You’ll be exposed to, and gain proficiency with, a wide array of systems and infrastructure automation tools, and you will be given the opportunity to build and/or incorporate additional tools. You’ll be called on to develop solutions that prevent problems from reoccurring in the future, instead of simply mitigating the issue for today. You’ll be highly encouraged to automate solutions to reduce or eliminate “toil.”
Your day-to-day will include:
- Maintaining and developing infrastructure (as code) within both private (OpenStack) and commercial (AWS, Azure, GCP) cloud environments
- Maintaining and developing configuration management automation for Windows laptops and Linux servers
- Providing user support for all corporate systems
This position offers a hybrid/on-site environment at our office in northwest Austin, TX.
In addition to the specific skills and expertise detailed below, we are looking for individuals who share our values. Sharing a set of values allows us to move at the speed of trust.
Collectively, we value a high-trust work environment where people respect each other and use candor kindly and constructively. We value work that intersects passion and perseverance, we geek out about the potential of our contributions, and we find joy in working hard on things that matter. Finally, we value taking ownership, having agency, and feeling individual responsibility for collective results.
Here’s what we’re looking for:
- 4+ years of experience in any IT-related field
- Experience deploying infrastructure in a cloud environment such as AWS, Azure, GCP, or OpenStack
- Experience with virtualization and/or containerization solutions (e.g., OpenStack, Kubernetes, Docker, VMware, KVM, or Hyper-V)
- Experience with Ansible or another configuration management solution (e.g., Chef, Puppet, or Salt)
- Programming experience in Python or other programming/scripting languages (e.g., Bash, PowerShell, Go, Java, or JavaScript)
- Due to the nature of this role, candidates must be a US person (a US citizen, a US national, or a Green Card holder)
We’re very interested in candidates who possess the above qualifications, and we appreciate and consider the addition of:
- Experience with automation and infrastructure as code, DevSecOps, CI/CD pipelines, or automated security scanning (Windows and Linux)
- Understanding of US federal information system security policies, including Security Technical Implementation Guides (STIGs), NIST SP 800-171, NIST SP 800-53, NIST RMF, and CMMC
- Experience with network technologies (e.g., VLANs, switches, routers, firewalls, and VPN)
- Experience working with GPUs for compute workloads
- Experience maintaining distributed/clustered systems
The anticipated base pay range for this position is $110,000–$128,000/year. Striveworks’ total compensation package includes a competitive base salary, equity grants, and cash bonuses.
What We Offer
~1 min readListing Details
- Posted
- April 17, 2026
- First seen
- March 26, 2026
- Last seen
- April 17, 2026
Posting Health
- Days active
- 22
- Repost count
- 0
- Trust Level
- 83%
- Scored at
- April 17, 2026
Signal breakdown

We make data useful by making MLOps disappear. Only when both process as code and automated remediation are in place can we then talk meaningfully about MLOps becoming a self-managing process.
View company profilePlease let Striveworks know you found this job on Jobera.
4 other jobs at Striveworks
View all →Explore open roles at Striveworks.
Similar Site Reliability Engineer jobs
View all →Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.