Senior Customer Reliability Engineer, Infrastructure - Hyderabad, India
Quick Summary
Provide solutions to customers to make them successful using our products.
For this role, you will embrace a flexible hybrid work model, working at least 3 days per week at our Office in Hyderabad while de
Astronomer empowers data teams to bring mission-critical software, analytics, and AI to life and is the company behind Astro, the industry-leading unified DataOps platform powered by Apache Airflow®. Astro accelerates building reliable data products that unlock insights, unleash AI value, and powers data-driven applications. Trusted by more than 800 of the world's leading enterprises, Astronomer lets businesses do more with their data. To learn more, visit www.astronomer.io.
About the Role
~1 min readThe Astronomer Customer Reliability Engineering (CRE) team is responsible for the success of our customers' usage of our managed Airflow service.
The CRE are responsible for operating, monitoring, and maintaining the platform to ensure availability, predictability, and reliable operations.
As an infrastructure specialist within the team, you will focus on the reliability of the underlying cloud infrastructure and Kubernetes clusters. This entails responding to incidents either raised by a customer, or from our monitoring system and then taking further steps to ensure problems are permanently resolved or monitored. As owners of the observability platform, CRE has unlimited potential to improve the reliability of the product and deliver the best possible outcome for our customers.
This role is directly customer-facing and gives exposure to very diverse problems and requirements. The CRE get the opportunity to interface with customers from a variety of industries across different cloud providers, and all with different expectations. Your contributions will directly impact customers' success with using the Astronomer products, and you will be able to help make meaningful improvements to the customer experience.
Responsibilities
~1 min readExperience managing a Production distributed system with at least one major cloud provider (one or all: AWS, GCP, Azure)
Strong Network Experience with one of the major Clouds
Strong Linux experience
Knowledge of how to operate and monitor issues for distributed systems
Experience with Observability tools
Previous experience in handling customers issues (internal and external)
Strong Communication Skills
DevOps or CI/CD experience
Good troubleshooting Skills
Nice to Have
~1 min readWorked with Kubernetes Custom Resources
Depth of knowledge with Azure
Airflow/Big Data Orchestration experience
IaC experience
#LI-Fulltime
#LI-Hybrid
At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Location & Eligibility
Listing Details
- Posted
- June 1, 2026
- First seen
- June 1, 2026
- Last seen
- June 2, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 54%
- Scored at
- June 1, 2026
Signal breakdown
Please let astronomer know you found this job on Jobera.
3 other jobs at astronomer
View all →Explore open roles at astronomer.
Similar Reliability Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.