akveloncom
New

Lead/Senior/Middle DevOps Engineer with GPU experiance

United StatesUnited States·TbilisiRemoteEmployeesenior
EngineeringDevops Engineer
0 views0 saves0 applied

Quick Summary

Overview

We are looking for a Lead, Middle, or Senior DevOps Engineer to join a research infrastructure team building an on-demand GPU platform for advanced compute workflows.

Key Responsibilities

Build and improve an on-demand GPU workstation platform with lightweight containerization or virtualization; Implement scheduling, reservation, registration, image management, storage mounting, SSH with SSO, and developer-friendly access flows;…

Requirements Summary

Experience with Prometheus, Grafana, incident automation, or on-call paging workflows; Experience with developer platforms, devcontainers, or remote development tooling such as VS Code integrations; Exposure to AI-assisted monitoring, trend…

Technical Tools
azuregrafanakubernetesprometheusb2bci-cd

We are looking for a Lead, Middle, or Senior DevOps Engineer to join a research infrastructure team building an on-demand GPU platform for advanced compute workflows. The role focuses on enabling secure, scalable, and user-friendly access to high-performance GPU resources through automation, scheduling, and modern platform tooling.

Responsibilities

~1 min read
  • Build and improve an on-demand GPU workstation platform with lightweight containerization or virtualization;
  • Implement scheduling, reservation, registration, image management, storage mounting, SSH with SSO, and developer-friendly access flows;
  • Automate cluster namespace configuration across CPU, GPU, memory, and storage allocations;
  • Support hierarchical capacity allocation models with RBAC-based administration;
  • Automate storage import, export, and archival workflows as allocations change;
  • Build monitoring, alerts, and automated incident ticket creation for large-scale cluster environments;
  • Improve integrations between source control, CI/CD, package distribution, and GPU-connected development workflows;
  • Contribute automation, scripts, and agentic tooling that improve infrastructure and day-to-day research workflows.

Nice to Have

~1 min read
  • Experience with Prometheus, Grafana, incident automation, or on-call paging workflows;
  • Experience with developer platforms, devcontainers, or remote development tooling such as VS Code integrations;
  • Exposure to AI-assisted monitoring, trend analysis, or agentic infrastructure tooling.
  • B2B contract.
  • Remote work from Serbia, Georgia, Armenia, Kazakhstan, Poland, Croatia, Portugal, Egypt.
  • European working hours.
  • Occasionally available for meetings up to 10:00 AM PST (US overlap).

Location & Eligibility

Where is the job
Tbilisi, United States
Remote within one country
Who can apply
US

Listing Details

Posted
May 6, 2026
First seen
May 6, 2026
Last seen
May 7, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
61%
Scored at
May 6, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

akveloncomLead/Senior/Middle DevOps Engineer with GPU experiance