Zoox
Zoox2d ago
New
USD 226000–307000/yr

Senior/Staff Software Engineer - Machine Learning & System Optimization

United StatesUnited States·Foster CityFull-timesenior
Software EngineerSoftware Engineering
0 views0 saves0 applied

Quick Summary

Overview

The Perception team is pioneering the development of a multi-modality foundation model to drive the next generation of autonomous system intelligence.

Technical Tools
Software EngineerSoftware Engineering

The Perception team is pioneering the development of a multi-modality foundation model to drive the next generation of autonomous system intelligence.

As a Machine Learning and System Optimization Engineer, you will orchestrate and allocate overall system capacity to various core perception models running on-bot, as well as drive large initiatives that allow for more efficient inference by sharing various parts of the perception stack with one another.

You will focus on bringing highly efficient, production-ready large-scale models to our on-vehicle stack. We are looking for experts with hands-on experience compressing, accelerating, and deploying complex models, including LLMs, VLMs, or foundation models, for power- and thermal-constrained vehicle SoCs.

In addition, you will optimize ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution on edge devices.

  • Allocate and distribute system resources (CPU/GPU/interconnect) to various models and inference engines running on the robot.

  • Spearhead cross-cutting initiatives that allow for better compute utilization through sharing/fusing models and better scheduling strategies.

  • Optimize large-scale models (Multi-Modal Sensor Fusion models, LLMs, VLMs) using advanced quantization (PTQ, QAT), pruning, mixed-precision inference frameworks, and parameter-efficient fine-tuning (LoRA, QLoRA).

  • Architect and implement model conversion and compilation pipelines using TensorRT for edge deployment.

  • Write production-level, low-latency, and memory-safe C++ and CUDA code for real-time inference on vehicle systems.

  • Deep experience in system and performance optimization in CPU/GPU systems designed for low latency or high throughput.

  • Deep expertise in working with real-time systems & required constraints such as processing latency, memory utilization, and memory bandwidth pressure.

  • Deep expertise in model quantization (PTQ, QAT) and mixed-precision inference frameworks (INT8, FP8, FP4, BF16/FP16).

  • Proficiency in low-level programming for AI accelerators, specifically developing and optimizing custom ML OPs and TensorRT Plugins with efficient CUDA kernel implementations.

  • Production-level C++ (14/17/20) and Python programming skills, with experience developing concurrent, memory-safe, real-time inference code for edge devices.

  • Prior experience in high-performance robotics applications such as AV/drones/robots.

  • Familiarity with SOTA autonomous driving perception algorithms (temporal 3D object detection, BEV, 3D Occupancy Networks) and multi-modal sensor processing (Vision, LiDAR, Radar).

  • Experience with end-to-end autonomous driving paradigms (VLM/VLA models, Foundation models) and edge deployment technologies (e.g., TensorRT-LLM).

  • About Zoox
    Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.


    Accommodations
    If you need an accommodation to participate in the application or interview process please reach out to accommodations@zoox.com or your assigned recruiter.

    A Final Note:
    You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.

    Location & Eligibility

    Where is the job
    Foster City, United States
    Hybrid — some on-site time required
    Who can apply
    US

    Listing Details

    Posted
    June 30, 2026
    First seen
    July 3, 2026
    Last seen
    July 3, 2026

    Posting Health

    Days active
    0
    Repost count
    0
    Trust Level
    81%
    Scored at
    July 3, 2026

    Signal breakdown

    freshnesssource trustcontent trustemployer trust
    Zoox
    Zoox
    lever

    Zoox, a subsidiary of Amazon, designs fully autonomous vehicles focusing on making urban transportation safer and more efficient.

    Employees
    3k+
    Founded
    2014
    Domain
    zoox.com
    View company profile
    Newsletter

    Stay ahead of the market

    Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

    A
    B
    C
    D
    Join 12,000+ marketers

    No spam. Unsubscribe at any time.

    ZooxSenior/Staff Software Engineer - Machine Learning & System OptimizationUSD 226000–307000