ascendingdc4mo ago

Staff Site Reliability Engineer

United States·Fairfaxlead

OtherStaff Site Reliability Engineer

3 views0 saves0 applied

Apply Now

Quick Summary

Requirements Summary

7+ years of experience with cloud computing platforms. Strong multi-cloud expertise required with AWS and Azure.

Technical Tools

OtherStaff Site Reliability Engineer

Staff Site Reliability Engineer / Cloud SME

As the Staff SRE/Cloud SME, you will be a critical technical leader driving the rearchitecting of our existing monolithic system into a resilient, cloud-native architecture. This role requires deep expertise across multiple cloud platforms (Azure and AWS) and container orchestration (Kubernetes) to ensure the next-generation platform meets the highest standards of scalability, reliability, and security.

Responsibilities

~1 min read

Architecture & Transformation Leadership

→Lead the technical rearchitecting efforts, transforming a large-scale monolithic system into a modern microservices-based, cloud-native application.
→Collaborate with cross-functional teams (Engineering, Architecture, Product) to define and implement the new system architecture using domain-driven design (DDD) principles.
→Conduct technology evaluations and provide recommendations for new tools, frameworks, and cloud services to enhance our infrastructure.

Reliability Engineering & Cloud Operations

→Utilize Kubernetes (K8S) for container orchestration and management, ensuring extreme scalability, reliability, and high availability of the system.
→Implement robust, highly resilient, and highly available components for the system.
→Develop and implement comprehensive monitoring, logging, and alerting mechanisms to ensure optimal system performance and availability.
→Drive the adoption of DevOps principles and practices throughout the software development lifecycle, ensuring seamless integration and continuous deployment processes.

Technical Expertise & Mentorship

→Stay up-to-date with emerging technologies, frameworks, and industry trends related to systems and cloud computing.
→Mentor and provide technical guidance to junior team members, fostering a culture of continuous learning and professional growth.

Requirements

~1 min read

Cloud Platforms: 7+ years of experience with cloud computing platforms. Strong multi-cloud expertise required with AWS and Azure.
Cloud-Native Transformation: 7+ years of experience in rearchitecting large-scale monolithic applications to cloud-native architectures.
Container Orchestration: Strong expertise in Kubernetes (K8S) is required, including hands-on experience with both AKS (Azure Kubernetes Service) and EKS (Elastic Kubernetes Service).
Networking: Strong experience with Cloud Networking, with the ability to design and resolve complex cloud networking architecture problems.
IaC: Expert knowledge of Terraform for infrastructure-as-code deployment and management.
Security: Must possess strong knowledge of security best practices for containers and Kubernetes clusters.
Education: Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
Bonus Knowledge: Knowledge of load balancing algorithms.