Nebius
Nebius~29d ago

HPC System Engineer

OtherSystem Engineer
0 views0 saves0 applied

Quick Summary

Requirements Summary

Proficient in Unix/Linux , plus Python and Bash for automation. Good understanding of the GPU stack: CUDA, NCCL , drivers,

Technical Tools
OtherSystem Engineer

Responsibilities

~1 min read
  • Work closely with hardware, development teams to profile and analyze GPU performance at the system and kernel level.
  • Evaluate and compare GPU performance across different platforms, architectures, and software stacks (e.g., CUDA, ROCm).
  • Perform acceptance testing for new GPU clusters, ensuring hardware and software meet performance, stability, and compatibility requirements for AI workloads.
  • Perform experiments across diverse GPU system configurations to assess the impact of varying interconnect strategies and system-level optimizations on performance and scalability.

 

  • Proficient in Unix/Linux, plus Python and Bash for automation.
  • Good understanding of the GPU stack: CUDA,NCCL, drivers, and relevant libraries
  • Proven ability to troubleshoot complex system issues including hardware, software, and networking problems.
  • Familiarity with containerized environments (e.g., Docker, Kubernetes).

 

  • Experience with modern deep learning frameworks (PyTorch, JAX, vLLM, Tensort-LLM)
  • Experience with job schedulers and resource managers (Slurm, Volcano, etc.).

What We Offer

~1 min read
Competitive salary and comprehensive benefits package.
Opportunities for professional growth within Nebius.
Flexible working arrangements.
A dynamic and collaborative work environment that values initiative and innovation.

Location & Eligibility

Where is the job
Amsterdam, Netherlands
On-site at the office
Who can apply
NL
Listed under
Netherlands

Listing Details

First seen
April 3, 2026
Last seen
May 2, 2026

Posting Health

Days active
29
Repost count
0
Trust Level
31%
Scored at
May 2, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Nebius
Nebius
greenhouse

Nebius is a cutting-edge AI cloud platform that offers scalable infrastructure for developing and deploying AI solutions.

Employees
350
Founded
2022
View company profile
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

NebiusHPC System Engineer