Member of Technical Staff - Large Scale Data Infrastructure
Quick Summary
ffmpeg, PyAV, codec fundamentals What We’r
We’re the team behind Latent Diffusion, Stable Diffusion, and FLUX—foundational technologies that changed how the world creates images and video. We’re creating the generative models that power how people make images and video—tools used by millions of creators, developers, and businesses worldwide. Our FLUX models are among the most advanced in the world, and we’re just getting started.
Headquartered in Freiburg, Germany with a growing presence in San Francisco, we’re scaling fast while staying true to what makes us different: research excellence, open science, and building technology that expands human creativity.
We're looking for infrastructure engineers who want to work at peta-to-exabyte scale. You'll build the data systems behind the largest training runs on thousands of GPUs, where fixing one bottleneck lets researchers train the next breakthrough model.
- Scalable data loaders for training runs across thousands of GPUs
- Efficient storage and retrieval systems for petabyte-scale datasets
- Multi-cloud object storage abstraction
- Execute large-scale data migrations across storage systems and providers
- Debug and resolve performance bottlenecks in distributed data loading
- Python, PyTorch DataLoader internals
- Object storage (e.g. S3, Azure Blob, GCS)
- Parquet for metadata
- Video: ffmpeg, PyAV, codec fundamentals
- Built and operated data pipelines at petabyte scale
- Optimized data loading
- Worked with petabyte-scale video and image datasets
- Written processing jobs operating on millions of files
- Debugged distributed system bottlenecks across large fleets of machines
Nice to Have
~1 min read- Experience streaming dataset formats (e.g. WebDataset)
- Video codec internals and frame-accurate seeking
- Distributed systems experience
- Slurm and Kubernetes for job orchestration
- Experience with object storage performance tuning across providers
We’re a distributed team with real offices that people actually use. Depending on your role, you’ll either join us in Freiburg or SF at least 2 days a week (or one full week every other week), or work remotely with a monthly in-person week to stay connected. We’ll cover reasonable travel costs to make this possible. We think in-person time matters, and we’ve structured things to make it accessible to all. We’ll discuss what this will look like for the role during our interview process.
- Obsessed: We build beautifully crafted, scientifically rigorous products by deeply understanding problems from first principles; and never shipping anything we’re not proud of.
- Low Ego: Prioritizing the best idea over personal ownership, where titles hold no authority, credit is shared, and no task is beneath anyone.
- Bold: We ship bold ideas early, improve fast, and take ambitious bets, without sacrificing quality for speed.
- Kind: We treat each other with genuine care, speaking directly and kindly even when conversations are hard.
If this sounds like work you’d enjoy, we’d love to hear from you.
Base Annual Salary (SF based role) : $180,000–$300,000 USD + Equity
Listing Details
- Posted
- April 14, 2026
- First seen
- March 26, 2026
- Last seen
- April 14, 2026
Posting Health
- Days active
- 18
- Repost count
- 0
- Trust Level
- 61%
- Scored at
- April 14, 2026
Signal breakdown
Please let Blackforestlabs know you found this job on Jobera.
4 other jobs at Blackforestlabs
View all →Explore open roles at Blackforestlabs.
Similar Member of Technical Staff - Large Scale Data Infrastructure jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.