alephalpha
alephalpha21d ago
New

Senior AI Software Engineer - Model Evaluation (f/m/d)

Heidelbergfull-timesenior
OtherAi Software Engineer
0 views0 saves0 applied

Quick Summary

Overview

Aleph Alpha Research’s mission is to deliver category-defining AI innovation that enables open, accessible, and trustworthy deployment of GenAI in industrial applications.

Requirements Summary

Understanding of foundation model training - how data, scale, and architecture affect capabilities. Experience with large-scale data processing or ML infrastructure.

Technical Tools
pythonpytorchdistributed-systemsmachine-learning

At Aleph Alpha, we foster a culture built on ownership, autonomy, and empowerment. Teams and individual contributors are trusted to take responsibility for their work and drive meaningful impact. We maintain a flat organizational structure with efficient, supportive management that enables quick decision‑making, open communication, and a strong sense of shared purpose.

About the Role

~1 min read

As a Senior AI Engineer in Pre-training Evaluation, you will work across the full stack of evaluation - from methodology design to implementation to analysis. Some weeks you'll be deep in benchmark curation, understanding what a given eval actually measures and whether it predicts downstream performance. Other weeks you'll be optimising pipeline throughput or building dashboards that surface training signals.

We are looking for someone that combines significant research experience (in industry or academia) with high engineering competence.

Your work sits at high leverage: the evaluations you design and build determine which training runs we pursue, which data mixtures we prioritise, and how we allocate compute. You'll have direct influence on the models we ship.

Responsibilities

~1 min read
  • Experience with LLM evaluation, benchmark design, evaluation dataset curation, and experimental design.

  • Familiarity with statistical methods for evaluation and experiment design.

  • Track record of shipping impactful technical work - whether that's research, infrastructure, or both.

  • Strong Python skills and comfort with ML tooling (PyTorch, evaluation frameworks, distributed systems).

  • Ability to reason about what an evaluation measures and whether it matters - not just run benchmarks, but understand them.

  • Ownership mentality: you see problems through from diagnosis to solution to deployment.

  • Willingness to relocate to Heidelberg or travel regularly (potentially weekly).

Requirements

~1 min read
  • Understanding of foundation model training - how data, scale, and architecture affect capabilities.

  • Experience with large-scale data processing or ML infrastructure.

  • German language proficiency (helpful for evaluating German capabilities, not required).

  • PhD in machine learning, NLP, statistics, or a related field (valued but not required - we care about what you can do).

What We Offer

~1 min read
Become part of an AI revolution!
30 days of paid vacation
Access to a variety of fitness & wellness offerings via Wellhub
Mental health support through nilo.health
Substantially subsidized company pension plan for your future security
Subsidized Germany-wide transportation ticket
Budget for additional technical equipment
Flexible working hours for better work-life balance and hybrid working model
Virtual Stock Option Plan
JobRad® Bike Lease

Location & Eligibility

Where is the job
Heidelberg
Hybrid — some on-site time required
Who can apply
Same as job location

Listing Details

Posted
April 17, 2026
First seen
May 6, 2026
Last seen
May 8, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
21%
Scored at
May 6, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

alephalphaSenior AI Software Engineer - Model Evaluation (f/m/d)