✓100% Medical, Dental & Vision Coverage for Employees
✓Paid Time Off and Paid Holidays
✓401K match up to 5%
✓Educational Benefits for Career Growth
✓Employee Referral Bonus
✓Flexible Spending Accounts:
Healthcare (FSA)
✓Parking Reimbursement Account (PRK)
✓Dependent Care Assistant Program (DCAP)
✓Transportation Reimbursement Account (TRN)
✓Design and operate scalable, resilient, and secure infrastructure platforms across cloud and hybrid environments
✓Champion DevOps and SRE practices including Infrastructure as Code, CI/CD, observability, and reliability engineering
✓Build developer-friendly platforms (“golden paths”) that simplify deployments, reduce friction, and improve velocity
✓Enable and optimize infrastructure for AI/ML workloads, including:
Data pipelines and storage systems
✓Model training and inference environments
✓GPU-enabled and high-performance compute workloads
✓Develop and maintain automated CI/CD pipelines for applications, data, and ML workflows
✓Implement observability frameworks (metrics, logs, traces) to ensure system health and performance
✓Define and manage SLOs, SLIs, and error budgets to drive reliability improvements
✓Lead incident response, root cause analysis, and postmortems with a focus on continuous improvement
✓Automate provisioning, configuration, patching, and environment lifecycle management
✓Build and manage containerized and orchestrated platforms (Docker, Kubernetes)
✓Support cloud migration, modernization, and platform standardization initiatives
✓Ensure systems meet security, compliance, backup, and disaster recovery requirements
✓Evangelize DevOps practices to development community on the AI driven tool integrations for code standards, qualitative tools, security tools in the shift-left automation
✓Mentor engineers and promote best practices in DevOps, SRE, and platform engineering
✓Stay abreast of new technologies in your areas but not limited to AIOps, MLOps, cloud computing & deployment, site reliability engineering, infrastructure automation, security best practices, data engineering etc.
✓Must have 6+ years of Hands-on Linux experience that includes specific technical experience with Ubuntu/CentOS/Red Hat operating systems, containers, dependency management and administration support
✓Must have 4+ years of experience automating Infrastructure-as-Code (IaC) deployments to one of the following cloud platforms Amazon AWS, Google GCP and Microsoft Azure
✓Must have 4+ years in DevOps / SRE roles supporting production systems
✓Must have 4+ years with CI/CD and automation tools such as Terraform, Ansible, Chef, Puppet, Jenkins, GitHub Actions
✓Strong scripting skills (Python, Bash, PowerShell or similar)
✓Experience with monitoring and observability tools (Prometheus, Grafana, ELK, or cloud-native equivalents)
✓Must be proficient using vibe coding and coding assistants to develop scripts, tools and applications for the DevOps and SRE use cases
✓Must have proficiency to debug or troubleshoot and/or deploying SQL and/or NoSQL databases, object storage, web servers, open-source programming stack for Node.JS, R, Python, .NET Core, Java is desired but not mandatory
✓Must be willing to learn new technologies, adopt and adapt to emerging technologies or needs from a project to a project
✓Cloud certifications is preferred
✓Certifications in Docker, Kubernetes, Linux or Networking (CCNA or CCNP or similar) is preferred but optional