cantina
cantina6mo ago
$120,000 – $180,000/yr

Media Software Engineer, Speech (All Levels)

United StatesUnited States·San Francisco,Sunnyvale,Sunnyvalefull-timemid
Software EngineerSoftware Engineering
1 views0 saves0 applied

Quick Summary

Overview

About Cantina: Cantina Labs is a social AI company, developing a suite of advanced real-time models that push the boundaries of expression, personality, and realism. We bring characters to life, transforming how people tell stories, connect, and create. We build and power ecosystems.

Technical Tools
cppjavascriptcode-reviewdistributed-systemsmachine-learning

Cantina Labs is a social AI company, developing a suite of advanced real-time models that push the boundaries of expression, personality, and realism. We bring characters to life, transforming how people tell stories, connect, and create. We build and power ecosystems. Cantina, our flagship social AI platform, is just the beginning.

If you're excited about the potential AI has to shape human creativity and social interactions, join us in building the future!

About the Role

~1 min read

The Media Team at Cantina is building the real-time infrastructure powering live conversations between people and AI. Our goal is simple but technically challenging: make interacting with AI feel fast, natural, and truly conversational.

We’re looking for a Software Engineer to help improve the speech, audio, and media systems at the heart of the Cantina experience. A major focus of this role is reducing latency and improving responsiveness so AI bots can hear users, process intent, and respond in real time — without awkward pauses or delays.

This team works across everything from low-level media pipelines and WebRTC frameworks to globally distributed infrastructure supporting real-time voice and video interactions across iOS, Android, and web.

If you’re excited by high-performance C++, real-time systems, speech technologies, and building the future of conversational AI, we’d love to talk.

Responsibilities

~1 min read
  • Improve the real-time speech and media systems powering live AI conversations.

  • Reduce latency and optimize responsiveness across audio streaming and speech pipelines.

  • Build new voice and video capabilities that enable more immersive interactions between users and AI bots.

  • Improve and extend our custom WebRTC infrastructure across iOS, Android, and web.

  • Work closely with product and platform teams to shape the future of conversational AI experiences.

While we offer fully remote and hybrid employment opportunities, our Media Engineering team strongly desires candidates to be available (or willing to relocate) to work in the Bay Area. For reference, 95% of the Media Engineering team works from the Bay Area.

What We Offer

~1 min read

The anticipated annual base salary range for this role is between $120,000-$180,000. When determining compensation, a number of factors will be considered, including skills, experience, job scope, location, and competitive compensation market data.

Competitive salary and generous company equity
Medical, dental, and vision insurance – 99.99% of premiums covered by Cantina
42 days of paid time off, including:15 PTO days
10 sick days
15 company holidays
2 floating holidays
Generous parental leave & fertility support
401(k) retirement savings plan
Lifestyle spending account – $500/month to use however you’d like
Complimentary lunch and snacks for in-office employees
One Medical membership, and more!

Location & Eligibility

Where is the job
San Francisco, United States
Hybrid — some on-site time required
Who can apply
Open to applicants worldwide

Listing Details

Posted
November 24, 2025
First seen
May 6, 2026
Last seen
June 18, 2026

Posting Health

Days active
43
Repost count
0
Trust Level
29%
Scored at
June 18, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

cantinaMedia Software Engineer, Speech (All Levels)$120k–$180k