techchaintalent
New

Senior Site Reliability Engineer

United StatesUnited States·New Yorksenior
EngineeringDevops Engineer
1 views0 saves0 applied

Quick Summary

Overview

About the Company Stellar is a decentralized, public blockchain that gives developers the tools to create experiences that are more like cash than crypto. The network is faster, cheaper,

Technical Tools
EngineeringDevops Engineer

Stellar is a decentralized, public blockchain that gives developers the tools to create experiences that are more like cash than crypto. The network is faster, cheaper, and far more energy-efficient than most blockchain-based systems. It's designed so Stellar's ecosystem can make a real-world, lasting impact.

About the Role

~1 min read

SDF is looking for a Senior Site Reliability Engineer to help build and operate the foundation that powers our engineering teams. You'll ensure the reliability and scalability of our systems, design and improve the infrastructure behind our production environments, and automate operational work so developers can focus on building great products.

Responsibilities

~1 min read
  • Maintain, improve, scale and secure our AWS/GCP infrastructure and Linux systems.
  • Assist our development teams in running, packaging, deploying and troubleshooting applications
  • Work with developers on streamlining deployment processes with Jenkins and other CI/CD tooling.
  • Build, maintain, monitor and improve our Kubernetes clusters.
  • Work with development teams on migrating applications to Kubernetes.
  • Be responsible for maintenance and improvements to multiple internal services, for example Kubernetes, Prometheus, ELK.
  • Monitor, triage and respond to alerts in our high availability environments.
  • Participate in design and code reviews, and ensure that the foundation for our services is best in class.
  • Evaluate new technologies, design and implement as appropriate.
  • Identify automation opportunities and implement by creating custom or by using off the shelf solutions.

Requirements

~1 min read

5+ years of experience of working in cloud-based systems operations, as a SRE or DevOps engineer.

First-hand experience with configuration management and infrastructure as code (Ansible, Puppet, Terraform).

Proficient in utilizing SRE methodologies like capacity planning and disaster recovery testing to ensure the scalability, resilience, and availability of critical services.

Production experience building and maintaining Kubernetes clusters.

Will need to know how to code

Nice to Have

~1 min read
  • Ability to understand Go, Rust, C++ and TypeScript source code
  • Experience experimenting with AI-driven approaches to operations
  • Comfortable with participating in on-call rotations and conducting thorough root cause analyses to keep systems running smoothly.
  • Experienced in managing production workloads and skilled in using monitoring tools to detect issues early.
  • A strong understanding of computer networking, TCP/UDP, load balancing, distributed computing, web services, and the fundamental protocols used by the internet (HTTP, HTTPS, DNS, etc.).
  • No blockchain needed
  • Experience using AI is a plus

Location & Eligibility

Where is the job
New York, United States
On-site at the office
Who can apply
US

Listing Details

First seen
June 17, 2026
Last seen
June 18, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
51%
Scored at
June 17, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

techchaintalentSenior Site Reliability Engineer