Field Reliability Engineer- LATAM
Quick Summary
What We’re Building Honeycomb is a service for the near and present future, defining observability and raising expectations of what developer tools can do!
- Own and operate customer-facing managed infrastructure including Refinery as a Service (RaaS) and Honeycomb Private Cloud (HnyPC) deployments across multiple AWS accounts and regions.
- Build and maintain Terraform modules, Helm charts, and deployment automation for provisioning and managing customer EKS clusters, collector pools, and Refinery instances.
- Design and implement monitoring, alerting, and observability for managed service infrastructure - using Honeycomb to monitor Honeycomb.
- Manage scaling, upgrades, and incident response for customer deployments, including capacity planning and cost optimization across AWS infrastructure.
- Building autonomous deployment and management tooling for field-operated managed services.
- Serve as the senior technical escalation point for our most challenging customer situations - production incidents, complex collector configurations, Refinery tuning, and architecture reviews that exceed the scope of standard technical roles.
- Diagnose and resolve deep infrastructure and observability issues spanning distributed systems, Kubernetes clusters, AWS networking (ALBs, PrivateLink, NLBs, VPCs), and polyglot service meshes.
- Partner directly with customer SRE, platform, and engineering teams to troubleshoot real-time production issues, often under time pressure and with direct revenue impact.
- Participate in an on-call rotation for managed services (Refinery as a Service, Honeycomb Private Cloud), providing Tier 2 escalation support for customer-facing infrastructure issues.
- Build and maintain SOPs, runbooks, and diagnostic frameworks that accelerate resolution for the broader field and support teams.
- Contribute to and maintain OpenTelemetry distributions, collectors, exporters, and instrumentation libraries that our customers depend on.
- Represent Honeycomb in the OpenTelemetry community - participating in SIGs, reviewing PRs, triaging issues, and driving adoption of best practices.
- Build reference architectures, sample collector configurations, and integration guides that demonstrate effective instrumentation patterns for common customer environments (Kubernetes, ECS, serverless).
- Identify gaps in the open source ecosystem that create friction for customers and either contribute fixes upstream or build bridging solutions.
- Contribute features and improvements to Honeycomb’s own open source projects (Refinery, Honeycomb Collector Distro) to support managed service capabilities.
- Be the person Solutions Architects call when a deal goes deeper than demo and design - you join calls to troubleshoot live production environments, validate architecture decisions, and provide the infrastructure credibility that closes technical evaluations.
- Tag-team with SAs on strategic accounts, owning the infrastructure and data pipeline conversations while they own the product narrative.
- Lead architecture reviews, SLO workshops, and instrumentation deep-dives for customers evaluating or expanding Honeycomb - especially in complex environments (multi-cluster Kubernetes, hybrid cloud, high-cardinality workloads).
- Step into customer-facing POCs and pilots as the hands-on technical lead, standing up collector pools, configuring Refinery pipelines, and proving out integrations in the customer’s actual environment.
- Create feedback loops between the field and product/engineering, surfacing patterns from customer environments that inform roadmap priorities.
Build internal tools and UIs that improve the operational efficiency of managed services - deployment dashboards, rule management interfaces, monitoring tooling.
Partner with Solutions Architecture, Customer Success, and Support to provide technical depth on complex accounts.
Collaborate with Product and Engineering on customer-impacting bugs, feature gaps, and integration challenges - bringing real-world production context.
Contribute to field enablement by training internal teams on advanced troubleshooting, collector configuration, Refinery internals, and emerging reliability patterns.
- A stake in our success - generous equity with employee-friendly stock program
- It’s not about how strong of a negotiator you are - our pay is based on transparent levels relative to experience
- Time to recharge with unlimited PTO
- A distributed-first mindset and culture (really!)
- Home office, co-working, and internet stipend
- Full benefits coverage for employees, with additional coverage available for dependents
- Up to 16 weeks of paid parental leave, regardless of path to parenthood
- Annual development allowance
- And much more...
- All communications will come from an @honeycomb.io email address
- We occasionally work with external recruiting agencies. These partners will use legitimate business email addresses—never personal accounts like Gmail or Yahoo.
- Our recruiting process will never ask you to provide financial or sensitive personal information, including but not limited to:
- Social security or tax identification numbers
- Credit card numbers
- Bank account information
Location & Eligibility
Listing Details
- Posted
- June 24, 2026
- First seen
- June 24, 2026
- Last seen
- June 25, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 68%
- Scored at
- June 24, 2026
Signal breakdown
Honeycomb is a premier observability platform that empowers engineering teams to gain insights into their software systems, offering tools for faster issue resolution and enhanced user experiences.
View company profilePlease let Honeycomb know you found this job on Jobera.
3 other jobs at Honeycomb
View all →Explore open roles at Honeycomb.
Similar Reliability Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.