A
Apolloresearch5mo ago
Full-stack Software Engineer (Research team)
London,LondonFull-timemid
OtherSoftware EngineerFull Stack Software EngineerSoftware Engineering
3 views0 saves0 applied
Quick Summary
Overview
Application deadline: We are conducting interviews actively and aim to fill this role as soon as we find someone suitable. ABOUT THE OPPORTUNITY We’re looking for Full-stack Software Engineers who are excited to build tools for frontier AGI safety research, e.g.
Technical Tools
anthropicpythonreact
Application deadline: We are conducting interviews actively and aim to fill this role as soon as we find someone suitable.
ABOUT THE OPPORTUNITY
We’re looking for Full-stack Software Engineers who are excited to build tools for frontier AGI safety research, e.g. building and maintaining evals libraries and tools for monitoring and controlling our own LLM traffic.
REPRESENTATIVE PROJECTS
Your main objective is to develop tooling for analyzing model evaluation results. Here is a list of features that you might build and ship in your first 6 months:
- LLM-powered search that finds interesting fragments in evaluation transcripts
- Comparison views that show how conversations and scores differ between two evaluation runs
- Ability to view and analyse conversations with coding agents (Cursor, Claude Code, etc.) in addition to evaluation transcripts
- Results streaming for evaluations that are currently being run
- Collaborative editing of evaluation logs that automatically updates metrics and other derived data.
Think of this as developing an “IDE for evaluations”.
Besides this, here are example auxiliary projects which you might do:
- Automated evaluation pipelines to minimize the time from getting access to a new model for pre-deployment testing to analyzing the most important results and sharing them.
- LLM agents and MCP tools to automate internal software engineering and research tasks, with sandboxes to prevent major failures
- Telemetry API and instrumentation of our existing tools, allowing us to monitor usage and improve reliability
- Upstream improvements to the Inspect framework and ecosystem, e.g. support for evaluating modern agentic scaffolds.
For example, we might be impressed if you have:
The following would be a bonus:
We want to emphasize that people who feel they don’t fulfill all of these characteristics but think they would be a good fit for the position nonetheless are strongly encouraged to apply. We believe that excellent candidates can come from a variety of backgrounds and are excited to give you opportunities to shine.
Location & Eligibility
Where is the job
London
On-site at the office
Who can apply
Same as job location
Listed under
Worldwide
Listing Details
- Posted
- December 5, 2025
- First seen
- March 26, 2026
- Last seen
- May 15, 2026
Posting Health
- Days active
- 49
- Repost count
- 0
- Trust Level
- 23%
- Scored at
- May 15, 2026
Signal breakdown
freshnesssource trustcontent trustemployer trust
External application · ~5 min on Apolloresearch's site
Please let Apolloresearch know you found this job on Jobera.
4 other jobs at Apolloresearch
View all →Explore open roles at Apolloresearch.
Similar Full Stack Software Engineer jobs
View all →Full Stack Software Engineer III
J
Jobscan 2Senior Full-Stack Software Engineer (Growth)
TWD 1500000–2000000
Regular Full-time
G
GroupbyincFull Stack Software Engineer
Full-time
Senior Full Stack Software Engineer
CAD 130000–150000
Full-time
Full Stack Software Engineer - Fleet Connect
Full Stack Software Engineer (Java, TypeScript)
Browse Similar Jobs
Assistant Manager5.6kManager5.5kTeam Member5.2kEngineer3.3kDirector2.6kAssistant2.6kConsultant2.3kAssociate2.2kData Collector2.2kFitness & Wellness2.1kTechnician1.9kRestaurant General Manager1.8kSupervisor1.8kCoordinator1.7kTeam Leader1.6kAnalyst1.4kCrew Member1.3kBehavioral Health1.2kAssistant General Manager1.2kPart Time1.1k
Newsletter
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
A
B
C
D
No spam. Unsubscribe at any time.
A
Full-stack Software Engineer (Research team)