Lemurian Labs6mo ago

Senior ML Performance Engineer

San FranciscoFull-timesenior

OtherPerformance Engineer

2 views0 saves0 applied

Apply Now

Quick Summary

Key Responsibilities

We're looking for a Senior ML Performance Engineer to architect and lead our Performance Testing Platform from the ground up.

Technical Tools

cppdockerkubernetespythonpytorchtensorflowterraformci-cd

About Us

At Lemurian Labs, we're on a mission to bring the power of AI to everyone—without leaving a massive environmental footprint. We care deeply about the impact AI has on our society and planet, and we're building a solid foundation for its future, ensuring AI grows sustainably and responsibly. Innovation should help the world, not harm it.

We are building a high-performance, portable compiler that lets developers "build once, deploy anywhere." Yes, anywhere. We're talking about seamless cross-platform compatibility, so you can train your models in the cloud, deploy them to the edge, and everything in between—all while optimizing for resource efficiency and scalability.

If the idea of sustainably scaling AI motivates you and you're excited about making AI development both powerful and accessible, then we'd love to have you. Join us at Lemurian Labs, where you can have fun building the future—without leaving a mess behind.

The Role

We're looking for a Senior ML Performance Engineer to architect and lead our Performance Testing Platform from the ground up. You'll be the technical authority on how we measure, validate, and optimize the performance of large language models (Llama 3.2 70B, DeepSeek, and others) before and after compiler optimization on modern GPU architectures.

This is a high-impact role where you'll directly influence our product quality and our customers' success. You'll work at the intersection of ML systems, GPU architecture, and performance engineering—building the infrastructure that proves our compiler delivers real value.

Design and build a comprehensive performance testing platform for evaluating LLM inference workloads across GPU clusters

Define and implement the benchmarking methodology, metrics, and test suites that measure latency, throughput, memory utilization, power consumption, and model accuracy

Establish baseline performance for unoptimized models (Llama 3.2 70B, DeepSeek, etc.) and validate post-optimization improvements

Develop automated testing pipelines for continuous performance validation across compiler releases and model updates

Investigate performance bottlenecks using profiling tools (ROCm profilers, GPU traces, system-level monitoring) and work with the compiler team to drive optimizations

Create dashboards and reporting that provide clear visibility into performance trends, regressions, and wins

Collaborate cross-functionally with compiler engineers, ML engineers, and DevOps to ensure performance testing is integrated into our development workflow

Document best practices for performance testing and optimization of ML workloads on GPU hardware

7+ years of experience in performance engineering, benchmarking, or systems engineering roles

Deep understanding of ML inference workloads, particularly transformer-based models and LLMs

Hands-on experience with GPU programming and optimization (CUDA, ROCm, or similar)

Strong programming skills in Python and C/C++

Proven track record of building performance testing infrastructure or benchmarking platforms from scratch

Experience with ML frameworks (PyTorch, TensorFlow, ONNX Runtime, vLLM, TensorRT-LLM, etc.)

Proficiency with profiling and debugging tools for GPU workloads

Strong analytical skills with the ability to design experiments, analyze results, and communicate findings clearly

Experience with CI/CD systems and test automation frameworks

Experience with AMD GPUs (Mi200/Mi300 series) and ROCm ecosystem

Knowledge of compiler optimization techniques and their impact on performance

Experience with distributed inference and multi-GPU workloads

Familiarity with ML model quantization, pruning, and other optimization techniques

Background in high-performance computing or systems-level optimization

Experience with infrastructure-as-code (Kubernetes, Docker, Terraform)

Contributions to open-source ML or systems projects

Obsessive about details — you notice the 2% regression that others miss

Self-driven — you take ownership and don't wait for permission to solve problems

Collaborative mindset — you work well across teams and help others succeed

Passionate about sustainability — you care about making AI more efficient and environmentally responsible

Clear communicator — you can explain complex technical concepts to both engineers and stakeholders

Location & Eligibility

Where is the job

Sf Bay Area

Hybrid — some on-site time required

Who can apply

Same as job location

Listed under

Worldwide

Listing Details

Posted: October 31, 2025
First seen: March 26, 2026
Last seen: May 15, 2026

Posting Health

Days active: 49
Repost count: 0
Trust Level: 25%
Scored at: May 15, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust

Apply for this position

Lemurian Labs

lever

Employees

Founded

2018

Domain

Jobs

External application · ~5 min on Lemurian Labs's site

Please let Lemurian Labs know you found this job on Jobera.

3 other jobs at Lemurian Labs

View all →

Explore open roles at Lemurian Labs.

Compiler Code Gen Engineer

Similar Performance Engineer jobs

View all →

supply-chain

Performance Engineer II

$55k–$90k/yr

fullTime

Nice

Senior Performance Engineer, CX (JMeter/Blazemeter)

Solar Performance Engineer (O&M)

mase

Performance Engineer (IC3)

indrive

Senior Performance Engineer

Full Time

Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

Join 12,000+ marketers

No spam. Unsubscribe at any time.

Senior ML Performance Engineer

Apply Now

Senior ML Performance Engineer

Quick Summary

Location & Eligibility

Listing Details

Posting Health

3 other jobs at Lemurian Labs

Similar Performance Engineer jobs

Browse Similar Jobs

Stay ahead of the market