Senior Software Engineer - API Gateway
Quick Summary
About the Role Featherless.ai is building the world’s most reliable and comprehensive open-model inference platform — the infrastructure powering the next generation of AI creators, startups, and enterprises. Our serverless approach to inference unlocks the best GPU utilization in AI infrastructure.
The API gateway is managed by the Platform Team, who aim to make Featherless the best place to find and use models. As a member of the platform team, you will undertake feature development and bug fixes to keep up with clients, resolve user issues,…
About the Role
~1 min readFeatherless.ai is building the world’s most reliable and comprehensive open-model inference platform — the infrastructure powering the next generation of AI creators, startups, and enterprises. Our serverless approach to inference unlocks the best GPU utilization in AI infrastructure.
We’re hiring Senior Software Engineers to support and evolve the API gateway to our inference cloud, which is responsible for
authentication and inference to all models
subscription management and subscription entitlement (e.g. context-length, concurrency limits)
and providing the necessary API surface for applications and builders
API Gateway is constantly evolving in response to the unending stream of new models, modalities, clients and inference load.
Responsibilities
~1 min readThe API gateway is managed by the Platform Team, who aim to make Featherless the best place to find and use models. As a member of the platform team, you will
- →
undertake feature development and bug fixes to keep up with clients, resolve user issues, and onboard new models
- →
improve the reliability of the existing API (increasing instrumentation and monitoring, right-sizing infrastructure)
- →
respond to availability incidents
- →
triage and resolve issues of inference quality and reliability
- →
manage the infrastructure on which our gateway runs
first-hand experience of the user’s we’re building for (familiarity with popular open LLMs, common clients, and experience building with LLM)
experience with the technologies and paradigms of the web (REST, websockets, DNS, networking, opentelemetry)
experience with significant components of our stack (k8s, node, mikro-orm, fastify, redis, mongodb, python, elastic cloud, cloudflare, sentry, otel)
ability to debug complex issues across a wide stack and build instrumentation as necessary
desire to work collaboratively as part of a skilled team
Alignment with team and company values, including
bias to action
responsiveness to users (bug-fixes over features)
instinct to iterate
subscribing to that done means proven by usage data
This team operates on Eastern Time. We are remote, but with a preference to hire in Toronto, Canada.
Location & Eligibility
Listing Details
- Posted
- January 14, 2026
- First seen
- May 6, 2026
- Last seen
- May 8, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 23%
- Scored at
- May 6, 2026
Signal breakdown
Please let featherlessai know you found this job on Jobera.
4 other jobs at featherlessai
View all →Explore open roles at featherlessai.
Similar Software Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.