Quality & Reliability Engineering Manager

United States·San Franciscolead

EngineeringOtherEngineering ManagerReliability EngineerTech Lead

0 views0 saves0 applied

Apply Now

Quick Summary

Overview

About Us Beast Industries is a multifaceted media and entertainment company founded by Jimmy Donaldson, popularly known as MrBeast, the most watched person in the world.

Technical Tools

EngineeringOtherEngineering ManagerReliability EngineerTech Lead

Beast Industries is a multifaceted media and entertainment company founded by Jimmy Donaldson, popularly known as MrBeast, the most watched person in the world. Renowned for revolutionizing digital content creation, Beast Industries encompasses a diverse portfolio of ventures that extend far beyond its origins on YouTube. With a mission to entertain, inspire, and create significant social impact, Beast Industries operates across various domains including digital media, philanthropy, consumer products, and innovative business initiatives. At Beast Industries, we believe in the transformative power of digital media and its potential to entertain, educate, and effect positive change. Our commitment to innovation, creativity, and philanthropy drives us to explore new frontiers, create unforgettable experiences, and build a legacy that inspires future generations.

Quality & Reliability Engineering Manager

Primary: Bay Area (San Francisco / Peninsula) | Secondary: NYC

We're doing an AI-first engineering rebuild for a company that already has an audience of 100M+ people. This is a zero-to-one build with no legacy constraints. You're here to reinvent what QA looks like in the AI era, giving a high-complexity engineering org the foundation to move faster than ever without sacrificing quality. Alongside that, you'll build the SRE processes and tooling that ensure the highest engineering standards as we scale to massive traffic.

You'll own how Beast Industries ships software that's both correct and resilient at scale, spanning quality engineering and site reliability across consumer-facing platforms. You set the standards and build the systems other teams rely on. That means:

Define and execute AI-driven QA plans and verification processes, combining automated validation with human oversight to catch what matters before it ships.
Build and own the automated test infrastructure: CI/CD test gates, regression suites, load-testing harnesses, and end-to-end coverage across unit, integration, performance, and chaos/resilience testing.
Establish AI-facilitated SRE processes that enforce the highest production quality at scale, serving as the gatekeeper for user experience across all live services.

Responsibilities

~1 min read

→Define and own the org's test strategy and the release-readiness criteria for high-risk launches.
→Establish SLO and error-budget frameworks for critical services, and be the technical authority on go/no-go calls across the team.
→Lead incident response for high-severity events, run blameless postmortems, and drive root-cause fixes on systemic fragility before it becomes a pattern.
→Grow and mentor a team of engineers on quality and reliability thinking, embedding best practices across the org so that high engineering standards become part of how every team ships, not just a gatekeeping function.

AI-Native: You're already using AI daily, and you have a real point of view on where AI-assisted testing and anomaly detection help versus where they just add noise.
Quality + Reliability Hybrid: 8+ year hands-on experience across both software quality engineering and site reliability, with test-automation architecture and reliability systems built for high-traffic distributed production.
Production Owner: You've defined SLO/error-budget frameworks and led incident response for severe production events, and you treat every escaped defect as a systemic problem, not an individual fault.
Builder Who Influences: You're a strong enough engineer to build the tooling and review systems-level code, and you move teams through working systems and evidence rather than mandates.

Deep fluency with observability stacks (metrics, logging, distributed tracing), CI/CD pipelines, automated test methodologies and tooling, and cloud infrastructure. Bonus points for experience in high-volume media or streaming environments, AI-assisted QA tools and processes, and contributions to open-source reliability or testing tooling.