Hatchit2mo ago

Site Reliability Engineer (SRE)

RemoteFull Timemid

EngineeringDevOps & InfrastructureSite Reliability EngineerDevops EngineerInfrastructure & Cloud

3 views0 saves0 applied

Apply Now

Quick Summary

Overview

hatch I.T. is partnering with CardioOne to find a Site Reliability Engineer (SRE) to join their team. See deteails below: About the Role: CardioOne is seeking a highly skilled Site Reliability Engineer (SRE) to ensure the reliability, scalability, security, and performance of their production…

Technical Tools

ansibleawsazurebashdatadogdockerjavakubernetespostgresqlpythonterraformci-cddistributed-systemslinuxmicroservicesnetworkingperformance-optimization

hatch I.T. is partnering with CardioOne to find a Site Reliability Engineer (SRE) to join their team. See deteails below:

About the Role:

CardioOne is seeking a highly skilled Site Reliability Engineer (SRE) to ensure the reliability, scalability, security, and performance of their production systems and services. The SRE will bridge the gap between software development and operations, implementing automation, monitoring, and best practices to enable rapid, reliable delivery of applications. You will report directly to the Senior Director of Engineering.

About the Company:

CardioOne partners with independent cardiologists to provide innovative solutions that improve patient outcomes and reduce costs. Their platform helps their physician partners thrive in today’s fee-for-service environment and prepare for success in value-based care. In February 2024, they partnered with WindRose Health Investors as well as top physician services and payor executives to grow their team and invest in their next phase of growth.

CardioOne offers a magnificent work environment, good working conditions, and competitive pay. They offer medical, dental, vision, and a 401k plan with a match to benefit eligible employees. They offer PTO (Personal Time Off) and sick time to full-time employees. They take pride in creating a culture of employee engagement that translates into an exemplary patient experience. Join them in their mission to positively impact US cardiology.

Ensure high availability, scalability, and performance of production systems.

Implement and maintain SLIs, SLOs, and SLAs for critical services.

Conduct capacity planning and performance tuning.

Automate infrastructure provisioning using IaC tools such as Terraform and Terragrunt , ansible

Develop automation to minimize manual operations and improve deployment workflows.

Build CI/CD pipelines to support rapid and reliable deployments.

Design and maintain monitoring, logging, and alerting systems (Datadog).

Participate in on-call rotations and lead incident response efforts.

Perform root-cause analysis and develop postmortems to prevent recurring issues.

Manage cloud infrastructure (AWS, Azure) and container orchestration platforms (Kubernetes, ECS).

Optimize system architecture for reliability and fault tolerance.

Implement best practices for security, networking, and service resilience.

Work closely with development teams to design reliable microservices and distributed systems.

Advocate for SRE principles and drive operational excellence across engineering teams.

Mentor engineers on reliability practices, tooling, and automation strategies.

Bachelor’s degree in Computer Science, Engineering, or equivalent experience.

3–7 years of experience in SRE, DevOps, or Systems Engineering roles.

Strong proficiency with Linux systems and shell scripting.

Experience with cloud platforms (AWS, Azure).

Hands-on experience with Kubernetes/ECS and container technologies (Docker).

Proficiency in at least one programming language: Python or Java

Experience with CI/CD pipelines and DevOps tooling.

Strong understanding of distributed systems, networking, and security fundamentals.

Strong analytical and problem-solving skills.

Excellent communication and cross-team collaboration.

Ability to thrive in fast-paced, high-stakes environments.

A mindset focused on continuous improvement and operational excellence.

Experience with observability stacks (OpenTelemetry).

Knowledge of database management (PostgreSQL).

Experience with configuration management tools (Ansible, Chef, Puppet).

Familiarity with zero-downtime deployments and chaos engineering practices.

Location & Eligibility

Where is the job

Worldwide

Fully remote, anywhere in the world

Who can apply

Same as job location

Listed under

Worldwide

Listing Details

Posted: March 3, 2026
First seen: March 26, 2026
Last seen: May 13, 2026

Posting Health

Days active: 47
Repost count: 0
Trust Level: 39%
Scored at: May 13, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust

Apply for this position

Hatchit

lever

Employees

Founded

2009

Domain

Jobs

External application · ~5 min on Hatchit's site

Please let Hatchit know you found this job on Jobera.

4 other jobs at Hatchit

View all →

Explore open roles at Hatchit.

Senior Backend Engineer, Workflow Systems

USD 160000–205000

Full Time

Director of Platform Engineering (VOR)

Similar Devops Engineer jobs

View all →

Sumologic

Senior Site Reliability Engineer I

Sumologic

Site Reliability Engineer I

UjetRemote

Senior Site Reliability Engineer

$100k–$120k/yr

Remote

MongoDB

Site Reliability Engineer (Senior or Staff), Atlas

USD 127000-249000

Sumologic

Senior Site Reliability Engineer I

Scaleway

Site Reliability Engineer (SRE) - Compute

Full-time (long term)

Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

Join 12,000+ marketers

No spam. Unsubscribe at any time.

Site Reliability Engineer (SRE)

Apply Now

Site Reliability Engineer (SRE)

Quick Summary

Location & Eligibility

Listing Details

Posting Health

4 other jobs at Hatchit

Similar Devops Engineer jobs

Browse Similar Jobs

Stay ahead of the market