Software Engineer – RL Environments — AfterQuery
Quick Summary
Software Engineer – RL Environments — AfterQuery Location: San Francisco, CA (Onsite) Compensation: $180,000 – $220,000 base | ~$500,000 total cash + equity About AfterQuery AfterQuery is an AI infrastructure company building training data and evaluation systems for frontier AI labs.
1–4 years of software engineering experience with strong technical depth Strong interest in how data structure and quality influence model behavior Ability to design experiments and extract insights from imperfect data Experience building and…
Location: San Francisco, CA (Onsite)
Compensation: $180,000 – $220,000 base | ~$500,000 total cash + equity
AfterQuery is an AI infrastructure company building training data and evaluation systems for frontier AI labs. They work directly with leading labs to improve model performance through datasets and experimentation. $30M raised at ~$300M valuation. Founding team from Jane Street, Citadel, Google, Goldman Sachs, and Stanford AI Lab.
About the Role
~1 min readThis is a high-impact engineering role focused on building the datasets, evaluation systems, and reward frameworks that directly influence how frontier AI models are trained. You will operate at the intersection of software engineering, data pipelines, and reinforcement learning environments. Your output will directly impact model capability, alignment, and performance across real-world domains.
- Design data slices that expose meaningful model failure modes across domains including finance, code, and enterprise workflows
- Build and refine evaluation rubrics and reward signals for RLHF and RLVR pipelines
- Run experiments to analyze model behavior and improve capabilities
- Develop frameworks to measure dataset quality, diversity, and downstream impact
- Build and manage real-world and synthetic data pipelines
- Work directly with research teams at leading AI labs — translating training objectives into concrete data and evaluation systems
Requirements
~1 min read- 1–4 years of software engineering experience with strong technical depth
- Strong interest in how data structure and quality influence model behavior
- Ability to design experiments and extract insights from imperfect data
- Experience building and shipping production systems
- Comfort working across domains including finance, engineering, and policy
Nice to Have
~1 min read- Experience with RL environment companies, AI safety, or benchmarking organizations
- Experience building data pipelines and working with ML infrastructure
- Familiarity with RLHF or RLVR training pipelines
- Startup or early engineer experience
- Pure research profiles without engineering output
- Those who prefer traditional product engineering work
- Candidates unable to operate in ambiguous environments
- Role is fully onsite in San Francisco — please only apply if you can commit to this
- Multiple headcount with active hiring demand
Shortlisted candidates will be contacted by David Joseph & Co., the recruiting partner managing this search on behalf of AfterQuery.
Location & Eligibility
Listing Details
- First seen
- May 5, 2026
- Last seen
- May 8, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 51%
- Scored at
- May 6, 2026
Signal breakdown
Please let davidjoseph-co know you found this job on Jobera.
4 other jobs at davidjoseph-co
View all →Explore open roles at davidjoseph-co.
Similar Software Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.