Site Reliability Engineer - SRE
Quick Summary
Drivetrain is on a mission to empower businesses to make better decisions. Our financial planning & decision-making platform helps companies scale and achieve their targets predictably. Drivetrain is a remote-first company headquartered in the San Francisco Bay Area.
Cloud Infrastructure & Orchestration Multi-Cloud Management: Architect, manage, and continuously optimize highly available cloud infrastructure across both AWS and GCP.
As a Senior Site Reliability Engineer at Drivetrain, you will be a cornerstone of our engineering organization, ensuring our fast-growing SaaS platform remains highly available, performant, and secure. At this stage of our growth, scaling infrastructure efficiently while maintaining the rigorous security and reliability standards required for financial data is paramount. You will take ownership of our multi-cloud infrastructure, drive automation, champion observability, and collaborate closely with development teams to build a culture of reliability from code commit to production.
Responsibilities
~1 min read-
DevOps Culture: Act as an embedded reliability advocate. Collaborate closely with software engineers early in the development lifecycle to ensure applications are designed for deployability, scalability, and resilience.
-
Continuous Improvement: Proactively identify system bottlenecks and architectural weaknesses. Contribute to process improvements, build internal developer tooling, and maintain comprehensive documentation to elevate team productivity and system understanding.
Requirements
~1 min read-
Experience: 5+ years of hands-on experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles, preferably within a fast-paced SaaS environment.
-
Cloud Platforms: Deep, proven proficiency in AWS (EC2, EKS, RDS, VPC, IAM, S3) AND GCP (GKE, Compute Engine, Cloud SQL, IAM, Cloud Storage). Ability to navigate and optimize multi-cloud architectures.
-
Containerization: Expert-level knowledge of Docker and Kubernetes, including advanced deployment strategies and lifecycle management.
-
Automation/IaC: Strong programming skills in Python and extensive experience with Terraform.
-
Observability: Hands-on expertise building dashboards and alerting systems using Prometheus, Grafana, and log aggregation stacks (ELK/EFK).
-
Networking & Security: Solid understanding of cloud networking (VPC peering, load balancing, DNS) and zero-trust security principles in a containerized environment.
Location & Eligibility
Listing Details
- Posted
- September 1, 2025
- First seen
- May 5, 2026
- Last seen
- May 11, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 30%
- Scored at
- May 6, 2026
Signal breakdown
Please let Drivetrain know you found this job on Jobera.
3 other jobs at Drivetrain
View all →Explore open roles at Drivetrain.
Similar Devops Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.
