Thinkahead
Thinkahead2mo ago

Sr Engineer -Compute

GurugramRemoteFull Timesenior
OtherEngineer
1 views0 saves0 applied

Quick Summary

Overview

AHEAD builds platforms for digital business. By weaving together advances in cloud infrastructure, automation and analytics, and software delivery,

Technical Tools
OtherEngineer
AHEAD builds platforms for digital business. By weaving together advances in cloud infrastructure, automation and analytics, and software delivery, we help enterprises deliver on the promise of digital transformation.

At AHEAD, we prioritize creating a culture of belonging, where all perspectives and voices are represented, valued, respected, and heard. We create spaces to empower everyone to speak up, make change, and drive the culture at AHEAD. 

We are an equal opportunity employer, and do not discriminate based on an individual's race, national origin, color, gender, gender identity, gender expression, sexual orientation, religion, age, disability, marital status, or any other protected characteristic under applicable law, whether actual or perceived. 

We embrace all candidates that will contribute to the diversification and enrichment of ideas and perspectives at AHEAD. 

The High-Performance Computing Compute Engineer is primarily responsible for the overall health and maintenance of the physical cluster and server technologies in our managed services customer's environments. Our Compute Engineers are a valued member of the Managed Services Infrastructure Practice responsible for Tier 3 incident management, service request management and change management infrastructure support for all Managed Services customers.    
  • Provide enterprise-level operational support to Managed Services customers for incident, problem, and change management activities 
  • Plan and perform software and firmware maintenance activities 
  • Assess customer environments for performance and design issues and propose resolutions 
  • Work across technical teams to troubleshoot complex infrastructure issues 
  • Create and maintain detailed documentation 
  • Serve as a subject matter expert and escalation point for compute technologies 
  • Work with vendors to resolve compute issues 
  • Communicate with customers and internal team with transparency 
  • Participate in on-call rotation 
  • Completion of training and certification as assigned to further skills and knowledge 
  • Bachelor’s degree or equivalent Information Systems or related field. Unique education, specialized experience, skills, knowledge, training, or certification may be substituted for education 
  • 5+ years of advanced Linux administration and troubleshooting 
  • 5+ years managing RedHat OpenShift Kubernetes and Virtualization clusters 
  • 5+ years of expert level experience managing infrastructure in high-performance computing environments including configuration, troubleshooting, and best practice 
  • 2+ years of experience with Nvidia DGX preferred 
  • Experience with HPC schedulers (e.g., SLURM, Kubernetes, PBS, Run:ai) required 
  • Proficient in physical server environments 
  • Experience configuring, maintaining and troubleshooting containers 
  • Experience with storage technology (e.g., Ceph or Vast Data Platform) and distributed file systems (e.g., Lustre, GPFS, NFS, GlusterFS) 
  • Experience with machine learning or data science workflows in HPC/AI environments 
  • 1+ years working with monitoring platforms (e.g., Prometheus, Grafana); Elastic Observability experience is a bonus 
  • 1+ years working with an enterprise ITSM system: Service Now is a bonus 
  • Previous experience with automation tools such as Ansible, Puppet, or Chef a plus 
  • Managed Services or consulting experience is required 
  • Strong background with customer service 
  • High level problem-solving and communication skills 
  • Strong oral and written communications skills 
  • Related Linux, Nvidia, Scheduler, Containerization, Virtualization, and Clustering certifications are a bonus 
  • Listing Details

    Posted
    February 20, 2026
    First seen
    March 26, 2026
    Last seen
    April 21, 2026

    Posting Health

    Days active
    25
    Repost count
    0
    Trust Level
    39%
    Scored at
    April 21, 2026

    Signal breakdown

    freshnesssource trustcontent trustemployer trust
    Thinkahead
    Employees
    5
    Founded
    2020
    View company profile
    Newsletter

    Stay ahead of the market

    Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

    A
    B
    C
    D
    Join 12,000+ marketers

    No spam. Unsubscribe at any time.

    ThinkaheadSr Engineer -Compute