Paytm
Paytm10d ago

Staff Platform Engineer - AI Infrastructure

CanadaCanada·TorontoFull-time Employmentlead
EngineeringPlatform EngineerDevops EngineerInfrastructure & Cloud
0 views0 saves0 applied

Quick Summary

Overview

About the Role As a Staff Platform Engineer - AI Infrastructure, you will build and scale the infrastructure behind Paytm's AI inference platform,

Technical Tools
EngineeringPlatform EngineerDevops EngineerInfrastructure & Cloud
About the Role

As a Staff Platform Engineer - AI Infrastructure, you will build and scale the infrastructure behind
Paytm's AI inference platform, serving internal teams and enterprise customers and supporting
new customer use cases from the ground up. You will own GPU infrastructure, model hosting
and serving, and multi-model routing across modalities. This includes running our own coding
and domain-specific models (voice, vision, risk, fintech workflows) as well as third-party models
on shared GPU and accelerator clusters.

You will also build self-service platforms that let teams provision, compute, deploy and
customize models, and manage resources through APIs and control planes, so they can use AI
without rebuilding infrastructure each time.

Your work will form the AI control plane for Paytm Intelligence (Pi): policy-driven routing, quotas,
observability, and usage and cost visibility. It will directly affect how fast we ship agents and AI
features, how reliably they run, and how efficiently we use our hardware across payments, risk,
fraud, collections, support, and developer experience.
  • Design and operate GPU infrastructure for model hosting, including provisioning, scheduling, and cost optimization across cloud and on-premise environments
  • Build and scale model serving systems using vLLM, TensorRT-LLM, Triton, or equivalent, supporting real-time inference with strong latency and availability guarantees
  • Implement multi-model routing to serve multiple models across modalities (text, voice, code, vision) on shared infrastructure
  • Own the model lifecycle end to end: download, deploy, serve, monitor, swap, and scale
  • Drive inference optimization including quantization strategies (AWQ, GPTQ), batching, caching, and cold start reduction
  • Build self-service infrastructure platforms where teams provision compute, storage, and model endpoints through APIs and control planes
  • Implement infrastructure-as-code at scale using Terraform, Pulumi, or CDK
  • Build observability and reliability for inference systems: SLIs/SLOs, GPU utilization
  • monitoring, latency tracking, automated capacity planning, and alerting
  • Define platform standards and governance including multi-tenant isolation, cost attribution, and resource quotas
  • Lead architectural design and influence engineering direction across the AI infrastructure stack
  • 8+ years of software engineering experience, including 3+ years building infrastructure platforms or ML/AI infrastructure
  • Deep experience with cloud infrastructure (AWS, GCP) and Kubernetes
  • Hands-on experience with GPU workloads and model serving (vLLM, TensorRT-LLM, Triton, or similar)
  • Strong software engineering fundamentals in Python, Go, or C++
  • Experience with infrastructure-as-code (Terraform, Pulumi, CDK)
  • Experience designing self-service platforms or internal developer tooling
  • Understanding of model optimization: quantization, batching, serving architectures
  • Proven ability to lead complex cross-team technical initiatives
  • Strong communication skills and the ability to influence technical direction
  • Experience building or operating inference infrastructure at scale
  • Experience with CUDA, GPU scheduling, or hardware-level optimization
  • Experience with multi-model serving across different modalities
  • Experience with edge inference or on-device model deployment
  • Experience with model fine-tuning infrastructure (LoRA, QLoRA, PEFT)
  • Background in fintech or regulated industries
  • Location & Eligibility

    Where is the job
    Toronto, Canada
    Hybrid — some on-site time required
    Who can apply
    CA
    Listed under
    Canada

    Listing Details

    Posted
    April 20, 2026
    First seen
    April 20, 2026
    Last seen
    May 1, 2026

    Posting Health

    Days active
    10
    Repost count
    0
    Trust Level
    47%
    Scored at
    May 1, 2026

    Signal breakdown

    freshnesssource trustcontent trustemployer trust
    Paytm
    Paytm
    lever

    Indian fintech company providing digital payments, financial services, and merchant solutions

    Employees
    10,000+
    Founded
    2010
    Domain
    paytm.com
    View company profile
    Newsletter

    Stay ahead of the market

    Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

    A
    B
    C
    D
    Join 12,000+ marketers

    No spam. Unsubscribe at any time.

    PaytmStaff Platform Engineer - AI Infrastructure