Senior SRE/DevOps Engineer
Quick Summary
About TrueFoundryEvery production AI system, whether it's powering customer support, writing code, analyzing financial data, or diagnosing medical conditions, needs the same foundational infrastructure.A way to route between models. A way to manage tools and integrate them securely.
*** Experience with Golang or Python is must.** 4+ years work experience writing clean production code Well versed with maintaining infrastructure as code (Terraform, Cloudformation etc).
We're TrueFoundry, and we're building it. We're looking for a Senior SRE/DevOps Engineer to join the team.
Companies are moving beyond simple chatbots to production agentic systems. These systems route between OpenAI, Anthropic, Google, and self-hosted models. They integrate dozens of tools via protocols like MCP. They orchestrate multi-agent workflows where agents coordinate with other agents.
The infrastructure to support this doesn't exist yet. You can't just duct-tape together a few API calls and call it production-ready.
You need a control plane that handles:
- Intelligent routing with observability, cost policies, and fallback logic
- Centralized tool and MCP server management with security and lifecycle controls
- Agent orchestration with governance and guardrails
- A unified compute layer to run self-hosted models, custom tools, and agents
We've built two products to solve this:
Responsibilities
~1 min read- →Write Terraform modules for deploying different component of infrastructure in AWS like Kubernetes, RDS, Prometheus, Grafana, Static Website
- →The SRE will work closely with TrueFoundry customers, gaining a deep understanding of the TrueFoundry platform to ensure smooth deployments, reliable operations, and best practices adoption. This role will also involve training and onboarding new customers, assisting them in implementing TrueFoundry effectively, and helping drive platform adoption and operational excellence across customer teams.
- →Configure networking, autoscaling. continuous deployment, security and multiple environments
- →Make sure the infrastructure is SOC2, ISO 27001 and HIPAA compliant
- →Automate all the steps to provide a seamless experience to developers.
Requirements
~1 min read- 4+ years work experience writing clean production code
- Well versed with maintaining infrastructure as code (Terraform, Cloudformation etc). High proficiency with Terraform / Terragrunt is absolutely critical
- Experience of setting CI/CD pipelines from scratch
- Experience with ETL pipelines, Bigdata infra
- Understanding of common security issues
1 Kubernetes Focused 2 Terraform Focused Round 3 Past Projects Discussion 4 Cultural Fit Round
What We Offer
~1 min read- An opportunity to work on something that really matters
- A fast-paced environment to learn and grow
- High transparency in decision-making
- High autonomy; freedom to take risks, to experiment, and to fail
- Full ownership and autonomy
- There is no glass ceiling for this role that limits your growth
- We promise a meaningful journey and opportunities to learn and grow
Founded by alumni from IIT Kharagpur, UC Berkeley, and ex-FaceBook Engineers, we have had folks from IITs, ISB, Facebook, Amazon, , GoJek, etc. Funded by top global Investors (Sequoia Capital, ENIAC) and angels (Naval Ravikant, Anthony Goldbloom). 2nd-time founders - their previous Postmanus startup (EntHire.co) was acquired by InfoEdge + was selected to be a part of Y Combinator.
Location & Eligibility
Listing Details
- Posted
- November 12, 2025
- First seen
- May 6, 2026
- Last seen
- May 8, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 13%
- Scored at
- May 6, 2026
Signal breakdown
Please let truefoundry know you found this job on Jobera.
4 other jobs at truefoundry
View all →Explore open roles at truefoundry.
Similar Devops Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.