Software Engineer, Data Infrastructure & Acquisition
Quick Summary
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books,
The mission of Speechify is to make sure that reading is never a barrier to learning.
Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember more. Speechify’s text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named Speechify its 2025 Design Award winner for Inclusivity.
Today, nearly 200 people around the globe work on Speechify in a 100% distributed setting – Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and Google, leading PhD programs like Stanford, high growth startups like Stripe, Vercel, Bolt, and many founders of their own companies.
We're looking to hire for our Data side of our AI team at Speechify. This role is responsible for all aspects of data collection to support our model training operations. We are able to build high-quality datasets at petabyte-scale and low cost through a tight integration of infrastructure, engineering, and research work. We are looking for a skilled Software Engineer to join us.
Responsibilities
~1 min read- →Be scrappy to find new sources of audio data and bring it into our ingestion pipeline
- →Operate and extend the cloud infrastructure for our ingestion pipeline, currently running on GCP and managed with Terraform.
- →Collaborate closely with our Scientists to shift the cost/throughput/quality frontier, delivering richer data at bigger scale and lower cost to power our next-generation models.
- →Collaborate with others on the AI Team and Speechify Leadership to craft the AI Team’s dataset roadmap to power Speechify’s next-generation consumer and enterprise products.
- BS/MS/PhD in Computer Science or a related field.
- 5+ years of industry experience in software development.
- Proficiency with bash/Python scripting in Linux environments
- Proficiency in Docker and Infrastructure-as-Code concepts and professional experience with at least one major Cloud Provider (we use GCP)
- Experience with web crawlers, large-scale data processing workflows is a plus
- Ability to handle multiple tasks and adapt to changing priorities.
- Strong communication skills, both written and verbal.
What We Offer
~1 min readTell us more about yourself and why you're interested in the role when you apply.
And don’t forget to include links to your portfolio and LinkedIn.
Refer them!
Speechify does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.
Listing Details
- Posted
- April 19, 2026
- First seen
- March 26, 2026
- Last seen
- April 19, 2026
Posting Health
- Days active
- 23
- Repost count
- 0
- Trust Level
- 64%
- Scored at
- April 19, 2026
Signal breakdown

Speechify is an AI-powered text-to-speech application that converts text from various formats into natural-sounding audio, helping users read faster and comprehend more. Founded by Cliff Weitzman to overcome his dyslexia, the platform aims to make reading accessible to everyone.
View company profilePlease let Speechify know you found this job on Jobera.
4 other jobs at Speechify
View all →Explore open roles at Speechify.
Similar Software Engineer, Data Infrastructure & Acquisition jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.