Software Engineer
Quick Summary
About Mechanize Mechanize builds reinforcement learning environments that frontier AI labs use to train and evaluate their coding models. Learn more at mechanize.work.
You'll design, build, and quality-assure RL tasks. Each task is a self-contained software engineering challenge with a prompt, an environment, and an automated grader.
Mechanize builds reinforcement learning environments that frontier AI labs use to train and evaluate their coding models. Learn more at mechanize.work.
AI models have gotten good at narrow coding tasks but still fail at the complex, judgment-heavy parts of software engineering. We build the environments that expose those failures and help models improve.
Responsibilities
~1 min readYou'll design, build, and quality-assure RL tasks. Each task is a self-contained software engineering challenge with a prompt, an environment, and an automated grader. You own the full lifecycle: ideation, grading infrastructure, running frontier models against the task, failure analysis, and iteration. At this level, we expect you to consistently produce tasks that target meaningful capability gaps in frontier models, and to develop a strong sense for what makes a task informative versus merely difficult.
You will use coding agents heavily, and a large part of the job is directing them well, evaluating their output, and knowing when they are failing in subtle ways. You may also contribute to shared infrastructure: improving our build pipeline, automating parts of QA, or building tooling for other engineers.
Strong technical fundamentals combined with a well-calibrated intuition for AI model behavior. You need to anticipate where a model will take shortcuts, distinguish genuine capability gaps from grader issues, and understand how a model will interpret a prompt. At this level, we expect extensive familiarity with what frontier coding agents can and can't do.
Can code in Python
Are confident working independently at a consistent pace
Have developed an intuition for what coding agents can and can't do
No prior ML or AI experience required
Want a product engineering role building features for end users
Prefer a highly collaborative team environment with shared ownership
Want extensive structured mentorship
This is independent, high-ownership work. You own your tasks from start to finish, with regular check-ins and feedback.
What We Offer
~1 min readCompensation includes a $350,000 base salary, equity, and performance bonuses. Top performers can earn more in bonuses than in base salary.
Strong performers are recognized and promoted quickly. Benefits include health, dental, vision, and life insurance.
About Mechanize. ~20 person team in San Francisco. Backed by Patrick Collison, Nat Friedman, Daniel Gross, Jeff Dean, Dwarkesh Patel, and Sholto Douglas. Featured in the New York Times, the Dwarkesh Podcast and Hard Fork.
Learn more about the interview process: https://www.mechanize.work/how-our-interview-process-works
Learn more about the work: https://www.mechanize.work/what-working-here-is-like
Location & Eligibility
Listing Details
- Posted
- February 18, 2026
- First seen
- May 8, 2026
- Last seen
- May 8, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 25%
- Scored at
- May 8, 2026
Signal breakdown
Please let mechanize know you found this job on Jobera.
3 other jobs at mechanize
View all →Explore open roles at mechanize.
Similar Software Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.