Senior Site Reliability Engineer
Quick Summary
WHO WE ARE 🌍 We help creators get more out of every conversation with Instagram-focused automations and support for other channels like Messenger, WhatsApp, and TikTok. The result? Better engagement, more sales, and real, sustainable growth.
We help creators get more out of every conversation with Instagram-focused automations and support for other channels like Messenger, WhatsApp, and TikTok. The result? Better engagement, more sales, and real, sustainable growth.
With a diverse team of 350+ people spread across three continents, we’re building the leading Chat Marketing platform that is used — and loved — by more than 1.5 million customers worldwide.
Responsibilities
~1 min read- →Maintain and harden AWS infrastructure (EC2, ALB/NLB, WAF, IAM, CloudWatch)
- →Operate and evolve our EKS clusters powering Python-based AI services
- →Migrate existing services to Kubernetes using Terraform and Helm
- →Codify infrastructure with Terraform and manage host-level automation via Ansible
- →Build and improve CI/CD pipelines with GitHub Actions
- →Own observability efforts: Prometheus, Grafana, alerting, and on-call readiness
- →Support OS-level patching, certs, WAF rules, and general infra hygiene
- →Partner with engineers to guide best practices and drive platform reliability
- →Create clean, maintainable infrastructure documentation and playbooks
- →Occasionally support rare off-hours incidents (don’t worry, really rare)
- 5+ years of experience managing Linux in production (Ubuntu, Amazon Linux)
- Strong experience with Kubernetes (ideally EKS), Helm, and Terraform
- Comfort with running and debugging Python workloads in containers
- Solid understanding of networking, IAM, and cloud security best practices
- Hands-on Nginx experience (Ingress and reverse proxy setups)
- Excellent communication skills; you can explain complex infra to devs clearly
Nice to Have
~1 min read- Strong Ansible skills beyond the basics
- PostgreSQL or Amazon RDS tuning and operations experience
- Deep understanding of observability tools (Prometheus, Grafana, Loki, etc.)
- Familiarity with PHP production environments
- Experience with TDD, CI/CD best practices, and agile development
- Any previous SRE-like exposure such as building resilience, automation, or incident tooling
What We Offer
~2 min readLocation & Eligibility
Listing Details
- Posted
- March 25, 2026
- First seen
- April 3, 2026
- Last seen
- May 16, 2026
Posting Health
- Days active
- 43
- Repost count
- 0
- Trust Level
- 31%
- Scored at
- May 17, 2026
Signal breakdown
ManyChat is a global Chat Marketing platform that enables businesses to automate conversations and drive sales on messaging apps like Instagram, WhatsApp, and Facebook Messenger. Founded in 2015, it serves over a million businesses worldwide with its user-friendly chatbot builder and automation tools.
View company profilePlease let Manychat know you found this job on Jobera.
Similar Devops Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.