Staff Machine Learning Engineer
Quick Summary
Define and lead the technical vision for Cresta’s next-generation Agentic AI systems, including Agentic Assist and enterprise AI Agents. Architect scalable,
Bachelor’s degree in Computer Science, Mathematics, or a related field; Master’s or Ph.D. strongly preferred. 7+ years of experience building and deploying machine learning systems in production,
About the Role
~1 min readMachine Learning Engineers at Cresta work across several high-impact AI initiatives. Final team placement is determined based on experience, strengths, and business needs.
Current focus areas include:
- Agentic Assist: Lead and build next-generation agentic AI systems that augment contact center agents in real time. This track requires strong pre-LLM ML foundations, deep expertise in LLMs and modern prompting techniques, a rapid prototyping mindset, and a proven ability to translate cutting-edge research into scalable, production-grade systems.
- Agent & System Quality: Design evaluation frameworks and improve the reliability, robustness, and performance of LLM-powered agents. This includes diagnosing and mitigating failure modes such as hallucinations, retrieval errors, tool misuse, context drift, prompt brittleness, and multi-step reasoning breakdowns, while defining measurable quality metrics (e.g., accuracy, faithfulness, task completion, latency, and cost) for complex, non-deterministic systems.
- Insights: Architect and scale LLM and retrieval-augmented generation pipelines that ground models in enterprise data. This track focuses on building high-performance ML systems that process complex data, extract structured insights, and deliver real-time, actionable intelligence at scale.
Responsibilities
~1 min read- →Define and lead the technical vision for Cresta’s next-generation Agentic AI systems, including Agentic Assist and enterprise AI Agents.
- →Architect scalable, production-grade LLM systems that integrate reasoning, retrieval, planning, tool use, and real-time decision-making into cohesive, intelligent workflows.
- →Design and evolve multi-agent orchestration frameworks that combine RAG, structured knowledge, domain-adapted models, and automated actions.
- →Establish best practices for building robust, reliable, and cost-efficient LLM-powered systems in high-scale production environments.
- →Own evaluation strategy for complex, non-deterministic AI systems, including offline benchmarking, online experimentation, LLM-as-a-judge methodologies, and systematic failure analysis.
- →Proactively identify and mitigate agent failure modes such as hallucinations, tool misuse, retrieval errors, prompt brittleness, context drift, and multi-step reasoning breakdowns.
- →Define measurable quality standards (accuracy, faithfulness, task completion, latency, cost efficiency, robustness) and drive continuous system improvement.
- →Influence cross-team architecture decisions across ML, backend, and product engineering to ensure seamless integration of AI capabilities.
- →Mentor senior engineers, raise the technical bar, and contribute to long-term AI strategy and roadmap planning.
- →Translate cutting-edge research advances into practical, high-impact production systems.
Requirements
~1 min read- Bachelor’s degree in Computer Science, Mathematics, or a related field; Master’s or Ph.D. strongly preferred.
- 7+ years of experience building and deploying machine learning systems in production, including deep hands-on experience with LLMs at scale.
- Demonstrated leadership in architecting complex AI systems, particularly agentic or multi-step LLM workflows.
- Deep expertise in transformer-based models, embeddings, retrieval systems, and Retrieval-Augmented Generation (RAG) pipelines.
- Experience designing evaluation frameworks for LLM systems beyond single-turn prompts, including robustness testing and production monitoring.
- Strong systems thinking: ability to design for scalability, latency constraints, cost efficiency, security, and long-term maintainability.
- Extensive experience with modern ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face) and distributed/cloud-based infrastructure.
- Proven ability to influence technical direction across teams as a senior individual contributor.
- A strong bias toward action — able to prototype rapidly while maintaining production rigor.
What We Offer
~1 min readWhat We Offer
~1 min readCresta’s approach to compensation is simple: recognize impact, reward excellence, and invest in our people. We offer competitive, location-based pay that reflects the market and what each individual brings to the table.
The posted base salary range represents what we expect to pay for this role in a given location. Final offers are shaped by factors like experience, skills, education, and geography. In addition to base pay, total compensation includes equity and a comprehensive benefits package for you and your family.
OTE Range: $230,000–$300,000 + Offers Equity
Listing Details
- Posted
- April 17, 2026
- First seen
- March 26, 2026
- Last seen
- April 17, 2026
Posting Health
- Days active
- 21
- Repost count
- 0
- Trust Level
- 74%
- Scored at
- April 17, 2026
Signal breakdown
Please let Cresta know you found this job on Jobera.
4 other jobs at Cresta
View all →Explore open roles at Cresta.
Similar Machine Learning Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.
