Paytm
Paytm14d ago

Staff AI Platform Engineer - Inference & Agentic Systems

CanadaCanada·TorontoFull-time Employmentlead
Data ScienceOtherDevOps & InfrastructurePlatform EngineerAi Platform EngineerInfrastructure & Cloud
1 views0 saves0 applied

Quick Summary

Overview

About the Role We are a small team of AI builders in Paytm Labs. As a Staff AI Platform Engineer, you will work across inference and agentic systems.

Technical Tools
Data ScienceOtherDevOps & InfrastructurePlatform EngineerAi Platform EngineerInfrastructure & Cloud
About the Role

We are a small team of AI builders in Paytm Labs.
As a Staff AI Platform Engineer, you will work across inference and agentic systems. You will
contribute to Paytm's AI inference platform (Pi), serving internal teams and enterprise customers
- running our own coding and domain-specific models (voice, vision, risk, fintech workflows) as
well as third-party models. You will also architect and build the platform that enables
autonomous AI agents to operate safely and reliably in production - the runtime, orchestration,
and developer tooling for agents to reason, plan, use tools, and execute complex multi-step
workflows, automating both software development and business processes.

You will work at the intersection of LLMs, distributed systems, and production fintech
infrastructure, helping define how inference and agentic AI are built and deployed across
payments, risk, fraud, collections, support, and developer experience.
  • Inference & Model Serving
  • Build and operate multi-model serving across modalities (text, voice, code, vision) on shared infrastructure
  • Own the model lifecycle: download, deploy, serve, monitor, update, swap
  • Drive inference optimization: latency, throughput, cost - including quantization, batching, caching, and routing strategies
  • Ensure inference is fast and reliable for the agents and systems that depend on it

  • Agentic Systems
  • Architect and build the Agentic AI Platform - runtime infrastructure, orchestration systems, and developer tooling for autonomous agents
  • Design multi-agent coordination systems enabling agents to collaborate and solve complex workflows
  • Build robust tool-use infrastructure that allows agents to interact with APIs, databases, and services safely
  • Implement workflow automation: agents that execute multi-step business and engineering tasks with appropriate guardrails
  • Build safety and guardrail systems including permissioning, sandboxing, and human-in-the-loop workflows
  • Develop evaluation and observability frameworks to measure agent behaviour, detect regressions, and debug failures
  • Develop SDKs and APIs that allow internal teams to build and deploy agents quickly and safely

  • Platform & Technical Leadership
  • Define technical direction and architecture for agentic systems across the organization
  • Build patterns and standards for agent design, tool calling, and evaluation
  • Partner closely with ML, product, and security teams to deliver production-grade agent systems
  • Mentor engineers and contribute to best practices for agent system design
  • 8+ years of software engineering experience, with 3+ years in AI systems or LLM applications
  • Strong understanding of LLM-based agent architectures (ReAct, RAG, tool use, multi-agent systems)
  • Experience building highly reliable distributed systems
  • Proficiency in Python and experience working with modern LLM APIs or open-source models
  • Experience with or strong interest in model serving (vLLM, TensorRT-LLM, Triton)
  • Understanding of distributed systems: task queues, event-driven architectures, state management
  • Experience with cloud platforms (AWS, GCP) and containerized deployments
  • Strong understanding of security risks in agentic systems (prompt injection, privilege escalation, data leakage)
  • Demonstrated experience leading complex technical initiatives
  • Strong written and verbal communication skills
  • Experience building agentic systems in regulated industries (fintech, healthcare, enterprise)
  • Familiarity with Model Context Protocol (MCP) or agent communication standards
  • Experience with model fine-tuning, quantization, or LoRA
  • Experience building CI/CD automation and developer tooling
  • Experience adapting workflow orchestration systems (Temporal, Airflow, Prefect) for AI workloads
  • Experience with voice models, multimodal models, or edge inference
  • Experience designing human-in-the-loop or oversight systems
  • Interest in testing and verification for non-deterministic AI systems
  • Location & Eligibility

    Where is the job
    Toronto, Canada
    Hybrid — some on-site time required
    Who can apply
    CA
    Listed under
    Canada

    Listing Details

    Posted
    April 16, 2026
    First seen
    April 17, 2026
    Last seen
    May 1, 2026

    Posting Health

    Days active
    13
    Repost count
    0
    Trust Level
    38%
    Scored at
    May 1, 2026

    Signal breakdown

    freshnesssource trustcontent trustemployer trust
    Paytm
    Paytm
    lever

    Indian fintech company providing digital payments, financial services, and merchant solutions

    Employees
    10,000+
    Founded
    2010
    Domain
    paytm.com
    View company profile
    Newsletter

    Stay ahead of the market

    Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

    A
    B
    C
    D
    Join 12,000+ marketers

    No spam. Unsubscribe at any time.

    PaytmStaff AI Platform Engineer - Inference & Agentic Systems