Staff TPM - Capacity

Headquarters/sunnyvale Officefull-timelead
OtherTpm
0 views0 saves0 applied

Quick Summary

Overview

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.

Technical Tools
OtherTpm

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. This architecture allows Cerebras to deliver industry-leading training and inference speeds; over 10 times faster than GPU-based hyperscale cloud inference services.

This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation.

Cerebras works with the leading model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras, to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference.

About the Role

~1 min read

Cerebras is building the world's fastest AI inference platform. Every day, we serve billions of inference tokens for leading AI companies including Cognition, Mistral, AlphaSense, IFM, Block, and others on the industry's largest AI accelerator systems.

As demand for AI continues to accelerate, intelligent capacity management becomes one of the company's most strategic challenges. Every customer commitment, model launch, and infrastructure investment depends on making the right capacity decisions at the right time.

We're looking for an experienced Technical Program Manager to lead capacity planning and fleet strategy for our Inference Service organization. This is a highly visible role working directly with Engineering, Product, Infrastructure, SRE, Operations, and executive leadership to maximize utilization of one of the world's most advanced AI inference fleets.

Responsibilities

~1 min read

· Run weekly capacity planning and daily capacity and deployment tracking with Engineering, product and operations team. Own fleet utilization reporting and forecasting

· Drive capacity planning for new customer deployments and major model launches

· Drive continuous improvement and stakeholder adoption of new capacity management platform

· Drive org level strategic initiatives related to capacity expansion, improving fleet efficiency and maximizing effective utilization of available systems

· Lead planning around major infrastructure events including but not limited to new customer commits, new model releases, change to DC/cluster architecture, etc. that impacts capacity and fleet utilization. Update capacity plans and forecasts accordingly.

· Maintain Jira EPICs and Confluence pages related to capacity planning, reporting and change management to ensure execution transparency across teams

Requirements

~1 min read

· 5+ years of TPM, technical program management, or product operations experience in cloud infrastructure, large-scale ML serving, or hyperscaler capacity planning

· Experience leading large cross-functional programs involving Engineering, Product, and Operations

· Comfort with the inference serving stack: model replicas, batching, prefill/decode, KV cache, accelerator scheduling

· Strong data fluency: SQL, Grafana, basic Python or Flux to pull your own numbers without waiting for an analyst

· Track record of running a recurring cross-functional ritual involving senior engineers and LT

· Direct experience with AI accelerator fleet operations such as Habana, TPU pods, Inferentia, Trainium

What We Offer

~1 min read

People who are serious about software make their own hardware. At Cerebras, we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:

Build a breakthrough AI platform beyond the constraints of the GPU.
Publish and open source their cutting-edge AI research.
Work on one of the fastest AI supercomputers in the world.
Enjoy job stability with startup vitality.
Our simple, non-corporate work culture that respects individual beliefs.

Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.

This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.

Location & Eligibility

Where is the job
Headquarters/sunnyvale Office
On-site at the office
Who can apply
Same as job location

Listing Details

Posted
June 29, 2026
First seen
June 29, 2026
Last seen
June 29, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
57%
Scored at
June 29, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Cerebras Systems

Cerebras Systems is revolutionizing AI acceleration with its innovative hardware solutions designed to enhance deep learning capabilities.

Employees
350
Founded
2016
View company profile
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

Cerebras SystemsStaff TPM - Capacity