A
Asapp 22mo ago
USD 170000–190000/yr

Lead AI/ML Engineer

United StatesUnited StatesFull-timelead
Data ScienceOtherMachine Learning EngineerData
5 views0 saves0 applied

Quick Summary

Overview

At ASAPP, our mission is simple: deliver the best AI-powered customer experience—faster than anyone else. To achieve that, we’re guided by principles that shape how we think, build, and execute. We value customer obsession, purposeful speed, ownership, and a relentless focus on outcomes.

Technical Tools
anthropicawsdockerkubernetesopenaipythonpytorchtensorflowab-testingci-cdmachine-learningmentoringsystem-design
At ASAPP, our mission is simple: deliver the best AI-powered customer experience—faster than anyone else. To achieve that, we’re guided by principles that shape how we think, build, and execute. We value customer obsession, purposeful speed, ownership, and a relentless focus on outcomes. We work in tight, skilled teams, prioritize clarity over complexity, and continuously evolve through curiosity, data, and craftsmanship. We’re seeking technologists and problem solvers who thrive in fast-paced environments, love collaborating with great talent, and approach every day like it’s Day 1.
 
We're a globally diverse team with hubs in New York City, Mountain View, Latin America, and India—embracing both hybrid and remote work to bring the best minds together, wherever they are. If you're driven by continuous learning, rapid pivots, and the challenges of building in a high-growth startup, we’d love to talk. This is more than a job—it’s a journey.
 
You will lead the design and delivery of end-to-end voice AI solutions, combining large language models with speech technologies such as speech-to-text, text-to-speech, and real-time streaming audio pipelines. This role requires a hands-on technical leader who can architect low-latency, highly reliable conversational voice systems and guide a team through ambiguity toward production excellence.
 
We are looking for someone who understands the unique constraints of voice experiences, latency, turn-taking, interruption handling, streaming inference, and audio quality, and can translate these into scalable, enterprise-grade systems.
 
This is a hybrid role with weekly in-person responsibilities. We have offices in New York City and Mountain View, CA
  • Build real-time conversational AI systems, including voice interfaces powered by speech-to-text, text-to-speech, and streaming inference pipelines
  • Design and optimize low-latency inference workflows for multimodal applications involving text, speech, and real-time interactions
  • Integrate and apply foundation models from major providers (OpenAI, AWS Bedrock, Anthropic, etc.) for prototyping and production use cases
  • Adapt, evaluate, and optimize LLMs for domain-specific enterprise applications
  • Build and maintain infrastructure for experimentation, deployment, and monitoring of AI models in production
  • Improve model performance and inference workflows with attention to latency, cost, and reliability
  • Provide technical leadership within the team, mentoring engineers and promoting best practices in ML engineering
  • Partner with product and cross-functional stakeholders to translate requirements into scalable ML solutions
  • Contribute to the evolution of internal standards for experimentation, evaluation, and deployment
  • 6+ years of experience in Machine Learning or AI systems, with hands-on experience in LLMs, speech, or conversational AI systems
  • Experience building on integrating speech-to-text and text-to-speech systems
  • Strong experience integrating voice models into production applications
  • Proficiency on Python and ML frameworks like PyTorch or TensorFlow
  • Proven experience leading complex, cross-functional AI initiatives
  • Deep understanding of latency-sensitive system design and distributed architectures
  • Strong proficiency in Python and ML frameworks such as PyTorch or TensorFlow
  • Understanding of RAG pipelines, prompt engineering, and vector search
  • Experience deploying and scaling AI systems using AWS (required), Docker, Kubernetes, and CI/CD practices
  • Strong communication skills with the ability to align engineering, product, and executive stakeholders
  • Comfortable operating in fast-paced environments and driving clarity in ambiguous problem spaces
  • Experience with speech model fine-tuning and acoustic/language model optimization
  • Experience with production applications of S2S models
  • Hands-on experience with real-time or streaming audio systems (WebRTC, gRPC streaming, or similar architectures)
  • Experience optimizing TTS prosody, pronunciation control, and voice customization
  • Background in MLOps, experimentation platforms, or evaluation frameworks for speech and conversational systems
  • Contributions to open-source AI or speech tooling
  • Graduate degree (MS or PhD) in Computer Science, Machine Learning, Speech Processing, or related field
  • Location & Eligibility

    Where is the job
    United States
    Hybrid within the country
    Who can apply
    US
    Listed under
    United States

    Listing Details

    Posted
    March 5, 2026
    First seen
    March 26, 2026
    Last seen
    May 15, 2026

    Posting Health

    Days active
    49
    Repost count
    0
    Trust Level
    37%
    Scored at
    May 15, 2026

    Signal breakdown

    freshnesssource trustcontent trustemployer trust
    Newsletter

    Stay ahead of the market

    Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

    A
    B
    C
    D
    Join 12,000+ marketers

    No spam. Unsubscribe at any time.

    A
    Lead AI/ML EngineerUSD 170000–190000