chainlink-labs
New

Senior Site Reliability Engineer, Observability

United StatesUnited StatesRemotefull-timesenior
EngineeringDevops Engineer
0 views0 saves0 applied

Quick Summary

Overview

About ChainlinkChainlink is the industry-standard oracle platform bringing the capital markets onchain and powering the majority of decentralized finance (DeFi).

Key Responsibilities

Build and orchestrate Modern OTEL-based Observability Platform Support multiple telemetry types, like metrics, logs and traces. Define and support modern governance in observability and problems at scale.

Requirements Summary

7+ years of relevant professional experience. You probably have worked on a devops, infrastructure, SRE, and/or platform team before Ability to develop software outside of the scope of typical infrastructure requirements and configurations…

Technical Tools
argocdawscppgithub-actionsgografanajavakubernetespackerprometheuspythonrubysplunkterraformcode-reviewdistributed-systems

Requirements

~1 min read
  • Build and orchestrate Modern OTEL-based Observability Platform

  • Support multiple telemetry types, like metrics, logs and traces.

  • Define and support modern governance in observability and problems at scale.

  • Ensure reliability, security, and performance exceed our defined SLAs

  • Work with engineers from across the company to help troubleshoot issues, deploy new products and services, and increase velocity while decreasing cognitive load

  • Lead the design and deployment of monitoring/observability services to detect and alert the team of needed action.

  • Ingest, aggregate, transform, and utilize data from a multitude of sources in our real time data pipeline.

  • Oversee the availability, performance, and supportability of our observability infrastructure.

  • Create processes around alert response operations and support the team to ensure the reliable delivery of oracle data.

  • Make recommendations to ensure sufficient metrics are collected to create alerts with every new feature release.

  • Champion reliability and security by taking the time to do your work right the first time

Requirements

~1 min read
  • 7+ years of relevant professional experience. You probably have worked on a devops, infrastructure, SRE, and/or platform team before

  • Ability to develop software outside of the scope of typical infrastructure requirements and configurations

  • Experience programming in C, C++, Java, Python, Go, Perl, or Ruby

  • Expert knowledge in all aspects of designing, developing, and managing large real-time systems

  • Experience with monitoring and logging. You know how to export metrics using Prometheus, have built a Grafana dashboard or two, and have experience with a centralized logging solution like an ELK Stack, Splunk or Grafana Stack.

  • Experience with distributed systems and container orchestration. You have maintained or even built Kubernetes clusters before and feel comfortable deploying completely new services on them

  • Strong communication skills. You can give and receive constructive feedback, and you do not shy away from planning meetings and code reviews

Requirements

~1 min read
  • Excitement for blockchain, Web 3.0, and similar decentralized technologies.

  • Experience running any infrastructure in the blockchain/web3 space

  • Ability to scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity

  • Experience working remotely in a distributed team

  • A strong desire to grow and challenge yourself. We would expect you to constantly find ways to improve and automate services to reduce toil

  • AWS; Terraform/Terragrunt; Kubernetes, Calico and ArgoCD; Prometheus and Grafana; GitHub Actions; Packer

  • We expect you to be comfortable with most of those tools and very proficient in several of them.

All roles with Chainlink Labs are global and remote-based. Unless otherwise stated, we ask that you try to overlap some working hours with Eastern Standard Time (EST).

We carefully review all applications and aim to provide a response to every candidate within two weeks after the job posting closes. The closing date is listed on the job advert, so we encourage you to take the time to thoughtfully prepare your application. We want to fully consider your experience and skills, and you will hear from us regarding the status of your application shortly after the closing date.

Chainlink Labs is an equal opportunity employer. All qualified applicants will receive equal consideration for employment in compliance with applicable laws, regulations, or ordinances. If you need assistance or accommodation due to a disability or special need when applying for a role or in our recruitment process, please contact us via this form.

Information collected and processed as part of your Chainlink Labs Careers profile, and any job applications you choose to submit, is subject to our Recruiting Privacy Policy. By submitting your application, you are agreeing to our use and processing of your data as required.

Location & Eligibility

Where is the job
United States
Remote within one country
Who can apply
US

Listing Details

Posted
December 24, 2025
First seen
May 6, 2026
Last seen
May 8, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
23%
Scored at
May 6, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

chainlink-labsSenior Site Reliability Engineer, Observability