Quick Summary
Architect and lead the development of our internal evaluation platform, moving the needle from manual testing to a fully automated lifecycle (from LLM-as-a-judge creation to production monitoring).
At Gorgias, we’re building the platform that makes this real: a unified AI agent that sells, supports, and re-engages customers across the entire journey. Conversational Commerce is the future of ecommerce, and we’re leading that shift.
Our mission is to turn every interaction between a brand and its customers into a relationship: personal, seamless, and intelligent. By combining deep product expertise with the latest in AI, we’re making shopping feel more natural, human, and connected than ever before.
To win, we focus relentlessly on:
About the Role
~1 min readMentor & Level Up: Bridge the gap between traditional software engineering and AI. You’ll mentor engineers on how to apply rigorous system design to the world of LLMs and agents.
Continuous Observability: Take ownership of the feedback loop, ensuring that production insights from our agents directly inform the next iteration of our evaluation datasets.
8+ Years of Engineering Excellence: You are a Staff-level engineer first. You’ve built systems that handle high scale, and you know how to architect for long-term maintainability and performance.
Agentic Curiosity: You’ve moved beyond the "chatbot" phase and are actively experimenting with AI Agents. You understand that the challenge isn't the prompt, but the orchestration, state management, and reliability of the agent's actions.
Systems Thinker (Non-Deterministic Mindset): You recognize that AI is probabilistic. You are excited by the challenge of building deterministic "wrappers" and Evaluation loops around models to make them safe for production.
The "Applied" Edge: You likely come from a background in distributed systems, internal platforms, or developer tooling, and you're now applying that rigor to the AI stack.
Beyond the Wrapper: You have serious experience moving beyond simple API calls to architecting multi-stage AI orchestrations (agents, chained workflows, or custom runtime logic).
Orchestration Experience: Even if you aren't an AI researcher, you have experience building complex, multi-step workflows (e.g., temporal systems, state machines, or event-driven architectures) and want to apply this to Agentic loops.
Reliability Obsession: You understand why "vibes-based" testing doesn't work. You’ve started exploring or building Eval frameworks to measure how models perform against real-world data.
Infrastructure Mindset: You are comfortable with the "glue" that makes AI work: vector databases, semantic caching, and API integration with third-party tools.
Strong backend experience (Python preferred)
Experience with distributed systems and event-driven architectures
Familiarity with tools like Kafka, Pub/Sub, or equivalent
Experience working with LLMs (prompting, RAG, agents, evaluation workflows)
Experience building APIs and scalable services
Understanding of monitoring, observability, and system performance
Recruiter phone screen
HM Interview
System Design Interview
AI Case Study (take-home, ~1–2 hours)
Technical Deep Dive of case study
Final Leadership Interview
What We Offer
~2 min read🏖️ 5-week vacation (We follow each country's appropriate PTO Laws)
🤕 Paid sick leave
🧸 Paid parental leave (16 weeks)
💻 MacBook Pro
🏥 Private health insurance and retirement pension (RRSP with Gorgias matching up to 4%)
✈️ For a smooth onboarding, we invite you to our Toronto office for one week (flights and accommodation handled by Gorgias)
💆🏻♀️ Get up to $900 CAD to set up your workstation at home
📚 Get up to $2,600 CAD of learning material per year (books, courses, training, and individual coaching)
🥰 Every quarter, we organize a company-wide summit to discuss where we're going and strengthen social bonds. Once per year we organize offsite team retreats and company retreats!
AI at Gorgias
At Gorgias, AI is a natural extension of how we work and build. Our teams use it every day to research, write, analyze, code, and craft better customer experiences. Everyone has access to premium AI tools (ChatGPT, Claude, Granola & others) and an annual L&D budget to explore new ones.
The real magic happens when we share what we learn. Our #powerup Slack channel is a digital petri dish of new tools and workflows, and each team has AI champions who showcase fresh ideas during weekly company-wide standups, now practically AI demo sessions.
We see AI not as a replacement for creativity or empathy, but as a multiplier, helping us move faster, think deeper, and serve customers better.
AI use in Recruiting at Gorgias
By submitting your application, you agree that Gorgias may collect and process your personal data for recruiting, workforce planning, and related purposes. For more information about how we process your data and your rights, please refer to our Applicant Privacy Policy.
Diversity & Inclusion at Gorgias
We’re committed to creating an inclusive environment where everyone can thrive. We welcome applicants from all backgrounds, experiences, and perspectives because diverse teams drive innovation and better decision-making.
If you need accommodations during the application or interview process, please contact us at accommodation@gorgias.com.
Location & Eligibility
Listing Details
- Posted
- June 24, 2026
- First seen
- June 25, 2026
- Last seen
- June 26, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 52%
- Scored at
- June 25, 2026
Signal breakdown
Please let gorgias know you found this job on Jobera.
3 other jobs at gorgias
View all →Explore open roles at gorgias.
Similar Machine Learning Engineer jobs
View all →Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.