$180,000 – $300,000/yr

Member of Technical Staff - Infrastructure Engineer

Freiburg (germany), San Francisco (usa)lead
EngineeringDevOps & InfrastructureMember Of Technical Staff
0 views0 saves0 applied

Quick Summary

Overview

About Black Forest Labs We’re the team behind Latent Diffusion, Stable Diffusion, and FLUX—foundational technologies that changed how the world creates images and video.

Technical Tools
EngineeringDevOps & InfrastructureMember Of Technical Staff

We’re the team behind Latent Diffusion, Stable Diffusion, and FLUX—foundational technologies that changed how the world creates images and video. We’re creating the generative models that power how people make images and video—tools used by millions of creators, developers, and businesses worldwide. Our FLUX models are among the most advanced in the world, and we’re just getting started.

Headquartered in Freiburg, Germany with a growing presence in San Francisco, we’re scaling fast while staying true to what makes us different: research excellence, open science, and building technology that expands human creativity.

We're looking for engineers to build and maintain the engine that powers our mission to develop visual intelligence. From maintaining and scaling clusters, to building research platforms to accelerate the rate of innovation, this team operates with large breadth and depth. We build the systems to make multi-week/month long training possible, to orchestrate resources at scale, and at the same time efficiently, enabling the next breakthrough model. If you’re obsessed with distributed systems at scale, infrastructure reliability, scalability, security, and continuous improvement, this team would be perfect for you.

  • Maintain research infrastructure, ensuring health, and optimizing components to extract peak performance from the system (both on application, and infrastructure side)
  • Scale infrastructure to meet growing research demands while maintaining reliability and performance
  • Collaborate with research teams to deeply understand their infrastructure needs, and design solutions that balance performance with cost efficiency.
  • Identify and resolve performance bottlenecks and capacity hotspots through deep analysis of distributed systems at scale.
  • Build and evolve telemetry and monitoring systems to provide deep visibility into infrastructure performance, utilization, and costs across our cloud and datacenter fleets.
  • Participate in on-call rotations and incident response to maintain system reliability
  • Python, Bash, Go
  • Kubernetes
  • Nvidia GPU drivers, and operators
  • OTel, Prometheus
  • Experience building or operating large-scale training platforms
  • Worked with large scale compute clusters (GPUs)
  • Proven ability to debug performance and reliability issues across large distributed fleets
  • Strong problem-solving skills and ability to work independently
  • Strong communication skills and the ability to work effectively with both internal and external partners
  • Deep knowledge of modern cloud infrastructure including Kubernetes, Infrastructure as Code, AWS, and GCP
  • Experience with SLURM
  • Experience building or operating large-scale training platforms

We’re a distributed team with real offices that people actually use. Depending on your role, you’ll either join us in Freiburg or SF at least 2 days a week (or one full week every other week), or work remotely with a monthly in-person week to stay connected. We’ll cover reasonable travel costs to make this possible. We think in-person time matters, and we’ve structured things to make it accessible to all. We’ll discuss what this will look like for the role during our interview process.

  • Obsessed: We build beautifully crafted, scientifically rigorous products by deeply understanding problems from first principles; and never shipping anything we’re not proud of.
  • Low Ego: Prioritizing the best idea over personal ownership, where titles hold no authority, credit is shared, and no task is beneath anyone.
  • Bold: We ship bold ideas early, improve fast, and take ambitious bets, without sacrificing quality for speed.
  • Kind: We treat each other with genuine care, speaking directly and kindly even when conversations are hard.

If this sounds like work you’d enjoy, we’d love to hear from you.

 

Base Annual Salary: $180,000–$300,000 USD

 

This role is based in our Freiburg / San Francisco office. We operate a hybrid model and cover reasonable travel costs — relocation is encouraged but not required. We do expect a meaningful in-person presence, and we'll discuss what that looks like for your situation during the process.

Listing Details

Posted
April 14, 2026
First seen
March 26, 2026
Last seen
April 14, 2026

Posting Health

Days active
18
Repost count
0
Trust Level
61%
Scored at
April 14, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trustcandidate experience

3 other jobs at Blackforestlabs

View all →

Explore open roles at Blackforestlabs.

Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

B
Member of Technical Staff - Infrastructure Engineer$180k–$300k