Staff TPM - Capacity
Quick Summary
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. This architecture allows Cerebras to deliver industry-leading training and inference speeds; over 10 times faster than GPU-based hyperscale cloud inference services.
This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation.
Cerebras works with the leading model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras, to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference.
About the Role
~1 min readCerebras is building the world's fastest AI inference platform. Every day, we serve billions of inference tokens for leading AI companies including Cognition, Mistral, AlphaSense, IFM, Block, and others on the industry's largest AI accelerator systems.
As demand for AI continues to accelerate, intelligent capacity management becomes one of the company's most strategic challenges. Every customer commitment, model launch, and infrastructure investment depends on making the right capacity decisions at the right time.
We're looking for an experienced Technical Program Manager to lead capacity planning and fleet strategy for our Inference Service organization. This is a highly visible role working directly with Engineering, Product, Infrastructure, SRE, Operations, and executive leadership to maximize utilization of one of the world's most advanced AI inference fleets.
Responsibilities
~1 min read· Run weekly capacity planning and daily capacity and deployment tracking with Engineering, product and operations team. Own fleet utilization reporting and forecasting
· Drive capacity planning for new customer deployments and major model launches
· Drive continuous improvement and stakeholder adoption of new capacity management platform
· Drive org level strategic initiatives related to capacity expansion, improving fleet efficiency and maximizing effective utilization of available systems
· Lead planning around major infrastructure events including but not limited to new customer commits, new model releases, change to DC/cluster architecture, etc. that impacts capacity and fleet utilization. Update capacity plans and forecasts accordingly.
· Maintain Jira EPICs and Confluence pages related to capacity planning, reporting and change management to ensure execution transparency across teams
Requirements
~1 min read· 5+ years of TPM, technical program management, or product operations experience in cloud infrastructure, large-scale ML serving, or hyperscaler capacity planning
· Experience leading large cross-functional programs involving Engineering, Product, and Operations
· Comfort with the inference serving stack: model replicas, batching, prefill/decode, KV cache, accelerator scheduling
· Strong data fluency: SQL, Grafana, basic Python or Flux to pull your own numbers without waiting for an analyst
· Track record of running a recurring cross-functional ritual involving senior engineers and LT
· Direct experience with AI accelerator fleet operations such as Habana, TPU pods, Inferentia, Trainium
What We Offer
~1 min readPeople who are serious about software make their own hardware. At Cerebras, we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:
Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.
This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.
Location & Eligibility
Listing Details
- Posted
- June 29, 2026
- First seen
- June 29, 2026
- Last seen
- June 29, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 57%
- Scored at
- June 29, 2026
Signal breakdown
Cerebras Systems is revolutionizing AI acceleration with its innovative hardware solutions designed to enhance deep learning capabilities.
View company profilePlease let Cerebras Systems know you found this job on Jobera.
3 other jobs at Cerebras Systems
View all →Explore open roles at Cerebras Systems.
Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.