Unlimit
Unlimit2mo ago

Site Reliability Engineer (SRE)

Belgrade · BelgradeFull-Timemid
EngineeringDevOps & InfrastructureSite Reliability EngineerDevops EngineerInfrastructure & Cloud
0 views0 saves0 applied

Quick Summary

Overview

About Unlimit Unlimit is a global fintech ecosystem built to eliminate financial borders holding businesses back. The company provides the extensive infrastructure needed to scale globally,

Technical Tools
EngineeringDevOps & InfrastructureSite Reliability EngineerDevops EngineerInfrastructure & Cloud
About Unlimit
 
Unlimit is a global fintech ecosystem built to eliminate financial borders holding businesses back. The company provides the extensive infrastructure needed to scale globally, integrating payment processing, multi-currency business accounts, BaaS and crypto gateways into a single, intelligent platform.
 
Across 17 offices globally, Unlimit bridges hyper-local expertise with a high-capacity financial network, giving companies the agility to expand across regions with operational confidence and speed. Driving the evolution of payments, Unlimit is transforming its infrastructure from human-operated fintech into AI-native financial infrastructure — where APIs are consumed by machines, integrations are negotiated by agents, and systems evolve continuously through intelligent automation. Our next users are not only humans. They are AI agents acting on behalf of humans and businesses.
 
Unlimit serves more than the needs of businesses today; we are building the nervous system for a borderless global economy.

As a Site Reliability Engineer (SRE) at Unlimit, you will help ensure the reliability, scalability, and performance of our core platform and services. You’ll work closely with Engineering and other stakeholders to design, build, and operate cloud-based infrastructure and distributed systems—while continuously improving automation, observability, and incident response.
This role blends Linux systems engineering, cloud infrastructure, Kubernetes, and automation with a strong focus on service availability, operational excellence, and continuous improvement.
  • Platform reliability & operations:
  • Ensure the availability, resilience, and performance of the platform and supporting services.
  • Own and improve incident management, including troubleshooting, escalation handling, and follow-ups aligned to SLAs.
  • Participate in an on-call rotation, supporting production systems and driving reliability improvements from real incidents.
  • Infrastructure engineering (Linux / Cloud / Kubernetes):
  • Design, deploy, configure, and manage Linux-based system architecture across environments.
  • Build and support platform implementations using AWS and other cloud technologies (compute-centric services and related infrastructure).
  • Design and implement large and complex technology projects, from initial design through production rollout and operational handover.
  • Support and maintain Kubernetes-based workloads and platform components.
  • Automation & Infrastructure as Code:
  • Build tooling and solutions to automate recurring operational tasks.
  • Use Infrastructure as Code (IaC) to standardize and scale: Terraform for provisioning , Ansible for configuration management and automation
  • Improve reliability by reducing manual steps and enabling repeatable deployments.
  • CI/CD & developer enablement:
  • Manage and maintain CI/CD pipelines across 20+ repositories spanning multiple technology stacks.
  • Partner with Engineering teams to improve build/release consistency, pipeline reliability, and deployment safety.
  • Observability & operational readiness:
  • Implement and enhance monitoring, logging, and alerting, using tools such as: Prometheus, Grafana, Zabbix, Splunk, PagerDuty (or equivalent incident alerting/response tooling).
  • Use metrics and incident learnings to reduce noise, improve signal, and shorten time-to-detect/time-to-recover.
  • Documentation & standards:
  • Produce clear, formal documentation including: Configuration standards, Troubleshooting runbooks, Infrastructure and architecture design documentation.
  • Contribute to internal standards that improve consistency, security, and operational maturity.
  • 5+ years of hands-on experience in Linux systems administration / engineering in production environments.
  • Strong working knowledge of the following (or equivalents): Linux, Kubernetes, GitLab, Terraform, Ansible.
  • Experience working in Agile (Scrum) teams.
  • Experience with AWS (compute-focused services) and/or Google Cloud Platform.
  • Proven experience with distributed systems design, maintenance, and troubleshooting.
  • Strong scripting/coding ability in at least one of: Python, Golang, bash.
  • Experience with observability and incident response tooling such as: Zabbix, Splunk, Prometheus, Grafana, PagerDuty.
  • Strong communication skills in English, with the ability to work effectively with customers, vendors, partners, and internal teams across levels.
  • Working knowledge (expected familiarity) with datastores and messaging systems such as: PostgreSQL, MongoDB, RabbitMQ. Also Web/application infrastructure components such as: Apache, Nginx
  • Demonstrated ability to learn quickly, work independently, make good decisions, and collaborate as a team player in fast-changing environments.
  • Strong AI-driven mindset and curiosity about emerging AI technologies.
  • Hands-on experience using AI tools (e.g., LLMs, automation frameworks, AI-assisted development tools) to enhance productivity or system performance.
  • Experience operating highly available, high-volume web services.
  • Strong initiative and self-starter attitude with minimal supervision.
  • Demonstrated success reducing operational toil through automation and better tooling.
  • Experience improving SLOs/SLIs, error budgets, or formal reliability practices (if applicable to your background).
  • Listing Details

    Posted
    February 5, 2026
    First seen
    March 26, 2026
    Last seen
    April 24, 2026

    Posting Health

    Days active
    28
    Repost count
    0
    Trust Level
    31%
    Scored at
    April 24, 2026

    Signal breakdown

    freshnesssource trustcontent trustemployer trust
    Unlimit
    Unlimit
    lever

    Unlimit is a global fintech company offering comprehensive payment solutions to businesses, enabling them to expand and optimize their payment processes.

    Employees
    350
    Founded
    2009
    View company profile
    Newsletter

    Stay ahead of the market

    Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

    A
    B
    C
    D
    Join 12,000+ marketers

    No spam. Unsubscribe at any time.

    UnlimitSite Reliability Engineer (SRE)