Software Engineer - Infrastructure

United StatesUnited States·San Francisco,San Franciscomid
OtherDevOps & InfrastructureSoftware EngineerSoftware Engineering
4 views0 saves0 applied

Quick Summary

Overview

Emergent builds autonomous coding agents that replace traditional software development by generating, testing, and deploying production applications directly from plain-language intent. Our systems run in production at global scale and are used to build millions of real applications.

Requirements Summary

Core Requirements 4+ years of software/platform engineering experience with production systems Strong proficiency in Go or Python — you write production code in at least one daily Hands-on experience building and deploying services on Kubernetes —…

Technical Tools
anthropicargocdawsdatadogdynamodbgcpgithub-actionsgrafanakafkakubernetesmongodbmysqlnew-relicpostgresqlprometheuspulumipythonredissqlterraformci-cddistributed-systemslinuxmicroservicesnetworkingsql-optimization

Emergent builds autonomous coding agents that replace traditional software development by generating, testing, and deploying production applications directly from plain-language intent. Our systems run in production at global scale and are used to build millions of real applications.

Since our public launch, we've crossed $100M in ARR and grown to over 10M users across 190+ countries. We're backed by Khosla Ventures, SoftBank, Google, Lightspeed, Prosus, Together, and Y Combinator.

We're solving the hard part of AI-driven software creation: correctness, reliability, security, and scale in real production systems. The team is built by repeat founders, Olympiad medalists, IIT & IIM alumni, and leaders from Google, Amazon, and Dropbox.

We're hiring builders who want ownership, speed, and impact at global scale.

Responsibilities

~1 min read

Platform and Infrastructure

  • Maintain stability of our platform consisting of distributed microservices closely interacting with Kubernetes and cloud providers (GCP, AWS)
  • Manage Kubernetes workloads with ArgoCD (GitOps), deploy, monitor, and troubleshoot application syncs, resource trees, and rollouts
  • Debug and resolve complex Kubernetes issues across clusters
  • Manage CDN and edge infrastructure (Cloudflare) for performance, caching, and traffic management
  • Automate infrastructure lifecycle operations and workflows

Observability and Incident Response

  • Own the observability stack: Grafana (dashboards, Loki logs, Prometheus metrics) and New Relic (APM, golden metrics, transaction analysis)
  • Enhance monitoring, alerting, and distributed tracing across services
  • Participate in on-call rotation via PagerDuty, handle incident response, and perform root cause analysis
  • Proactively identify reliability risks before they become incidents

AI Agent Infrastructure

  • Support the platform that runs AI agent workloads including job scheduling, trajectory tracking, environment provisioning, deployments, and cost attribution
  • Develop Kubernetes controllers and operators to extend platform capabilities for agent orchestration

Collaboration and Internal Tooling

  • Work closely with product and backend teams to ensure platform scalability and reliability
  • Build internal tools, automate workflows, and integrate systems to improve team productivity
  • Stay current with Kubernetes releases, CNCF ecosystem updates, and cloud-native best practices

Core Requirements

  • 3+ years of software/platform engineering experience with production systems
  • Strong proficiency in Go or Python, you write production code in at least one daily
  • Hands-on experience building and deploying services on Kubernetes, not just YAML, you've developed something that runs on K8s
  • Experience with GitOps tooling (ArgoCD, Flux, or similar)

Systems Fundamentals

  • Strong networking and DNS fundamentals: TCP/IP, HTTP, load balancing, DNS resolution, TLS, and debugging connectivity issues
  • Solid Linux/OS fundamentals: process management, filesystem, memory, systemd, and comfortable debugging with tools like strace, tcpdump, and netstat

Data and Messaging Infrastructure

  • Relational databases: experience with PostgreSQL, MySQL, or similar; indexing, query optimization, replication, and backup/restore procedures
  • NoSQL databases: familiarity with MongoDB, DynamoDB, Redis, or similar for document/key-value workloads
  • Caching: experience with Redis, Memcached, or similar for application and infrastructure-level caching
  • Message queues and streaming: hands-on with Kafka, SQS, RabbitMQ, or similar for event-driven architectures
  • Strong SQL skills for debugging and operational queries

Infrastructure and Observability

  • Comfortable with the CNCF ecosystem: Helm, Kustomize, cert-manager, Ingress controllers, CNI/CSI interfaces
  • Hands-on with at least one observability stack (Grafana/Prometheus/Loki, New Relic, Datadog, or similar)
  • Familiarity with GCP and/or AWS: managed Kubernetes (GKE/EKS), networking, IAM, storage, and cloud-native services (SES, SQS, S3, etc.)
  • Experience with CDN/edge platforms (Cloudflare, CloudFront, or similar)
  • Experience building Kubernetes Operators (kubebuilder, operator-sdk, or controller-runtime)
  • Experience tuning Kubernetes core components (API server, kubelet, scheduler)
  • Familiarity with AI/LLM infrastructure: token management, cost tracking, agent orchestration
  • Experience with CI/CD pipelines (GitHub Actions, automated testing, deployment pipelines)
  • Infrastructure as Code experience (Terraform, Pulumi, or similar)
  • Previous work on large-scale distributed systems or platform-as-a-service
  • Startup experience, you thrive in fast-paced, ambiguous environments
  • A generalist who can context-switch between debugging a K8s deployment, setting up a Grafana alert, and configuring CDN rules, all in the same day
  • You enjoy solving complex infrastructure challenges and automating away toil
  • You dig deep, when something breaks, you find the root cause, not just the workaround
  • You communicate clearly and can collaborate effectively in a fast-moving, distributed team

What We Offer

~1 min read
401(k)
Health, dental, and vision insurance
Unlimited Paid Time Off: take the time you need to recharge and come back refreshed
Flexible Working Hours: work arrangements that fit your life and commitments

Location & Eligibility

Where is the job
San Francisco, United States
On-site at the office
Who can apply
US
Listed under
Worldwide

Listing Details

Posted
April 20, 2026
First seen
April 20, 2026
Last seen
June 30, 2026

Posting Health

Days active
71
Repost count
0
Trust Level
23%
Scored at
June 30, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust

4 other jobs at Emergentlabsinc

View all →

Explore open roles at Emergentlabsinc.

Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

E
Software Engineer - Infrastructure