Fal
Fal2mo ago

Software Engineer, Distributed Systems

San Francisco,San Franciscolead
OtherSoftware EngineerEngineerSoftware Engineering
6 views0 saves0 applied

Quick Summary

Overview

fal is the generative media ecosystem powering the next generation of AI products. We build the infrastructure, tools, and model access that teams need to move from idea to production, and do it at scale without compromise.

Key Responsibilities

Build our core Python/Rust platform: request routing, AI workload orchestration, scheduling, GPU autoscaling, large scale file storage, queueing, etc Produce forward designs for platform evolution as we scale to 100x current traffic and need to…

Requirements Summary

3+ years experience building distributed compute and orchestration platforms in Python or Rust Strong understanding of distributed systems fundamentals: consensus, scheduling, fault tolerance, capacity planning Deep understanding of computational…

Technical Tools
pythonrustconcurrencydistributed-systemsnetworking

fal is the generative media ecosystem powering the next generation of AI products. We build the infrastructure, tools, and model access that teams need to move from idea to production, and do it at scale without compromise. For developers and enterprises, fal is the foundation that makes generative media not just possible, but practical: a unified platform where high-performance inference, orchestration, and observability come together to unlock new categories of AI-native products.

As generative media reshapes industries across a market projected to grow by hundreds of billions over the next decade, fal is becoming the ecosystem that ambitious teams build on.

About the Role

~1 min read

You are an experienced software engineer who thrives on building large-scale computing platforms. You have deep expertise in large scale distributed systems that deal with high complexity, a lot of traffic and data. You know how to achieve reliability and scale with minimum operational load.

Responsibilities

~1 min read
  • Build our core Python/Rust platform: request routing, AI workload orchestration, scheduling, GPU autoscaling, large scale file storage, queueing, etc
  • Produce forward designs for platform evolution as we scale to 100x current traffic and need to provide low latency across the world
  • Leverage AI to an extreme level to automate the mundane parts of building complex but reliable systems
  • Profile and tune low level CPU and memory performance

Requirements

~1 min read
  • 3+ years experience building distributed compute and orchestration platforms in Python or Rust
  • Strong understanding of distributed systems fundamentals: consensus, scheduling, fault tolerance, capacity planning
  • Deep understanding of computational complexity and memory allocation
  • Track record of designing systems that scale under real production load
  • Experience building and using observability to drive performance and reliability decisions
  • Excellent communication and ability to drive technical decisions across teams
  • Self-starter who executes quickly, takes ownership, and constantly seeks improvement

Nice to Have

~1 min read
  • Experience with AI/ML inference or training infrastructure
  • Experience with high-performance systems programming (async runtimes, zero-copy, memory-safe concurrency)
  • Background in building multi-tenant compute platforms
  • Understanding of networking fundamentals and performance characteristics
  • Familiarity with GPU workload characteristics and scheduling constraints

What We Offer

~1 min read
$180,000-250,000 plus equity + benefits (This range is across all 3 levels Mid, Senior and Staff)
  • San Francisco, CA (willing to consider remote for Senior and Staff levels)

What We Offer

~1 min read
Interesting and challenging work
A lot of learning and growth opportunities
We are currently hiring in downtown San Francisco.
We offer relocation assistance to San Francisco.
Health, dental, and vision insurance (US)
Regular team events and offsites

Location & Eligibility

Where is the job
San Francisco
On-site at the office
Who can apply
Same as job location
Listed under
Worldwide

Listing Details

Posted
February 23, 2026
First seen
March 26, 2026
Last seen
May 8, 2026

Posting Health

Days active
42
Repost count
0
Trust Level
31%
Scored at
May 8, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Fal
Fal
greenhouse
Employees
5
Founded
2004
View company profile
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

FalSoftware Engineer, Distributed Systems