Research Scientist - Post Training
Quick Summary
About AfterQuery AfterQuery builds the training data and evaluation infrastructure that frontier AI labs use to make their models better. We work with the world's leading labs to design high signal datasets and run rigorous evaluations that go beyond static benchmarks.
Your job is to prove that our data works. You will design and run training experiments that isolate the impact of our datasets on model behavior.
Strong familiarity with LLM training and evaluation methodologies. Genuine obsession with how data structure, selection, and quality drive model behavior.
AfterQuery is an applied research lab curating data solutions for foundation model development.
We serve every frontier AI lab with the mission of delivering the best data to power the best models. In doing so, we can make expertise that once took a lifetime to build available to anyone who needs it. Our customers are the ones building the foundation models themselves and our work sits directly in the loop of how those systems improve.
This is a rare opportunity to join a company at a defining moment in AI. Since raising our $30M Series A at a $300M valuation, AfterQuery has grown well over a $100M revenue run rate.
We're based in San Francisco and backed by leading investors including Altos Ventures, BoxGroup, and Y Combinator and angels from Google DeepMind, OpenAI, Anthropic, Meta Superintelligence Labs, and Microsoft AI and are based in San Francisco.
Your job is to prove that our data works. You will design and run training experiments that isolate the impact of our datasets on model behavior. This includes SFT and RL-based post-training, where you’ll measure how different data sources shift capability, generalization, and alignment. Working closely with partner labs, you will turn our datasets into clear, defensible evidence: this data → this improvement → under these conditions. This is experimental, high-leverage work.
Responsibilities
~1 min read- →
Run controlled SFT and RL experiments to measure the impact of our datasets on model performance.
- →
Help build public evals and new data types that push the frontier.
- →
Publish external-facing research, blog posts, and technical reports.
- →
Work with internal SPLs to iterate on data quality based on your results.
Strong familiarity with LLM training and evaluation methodologies.
Genuine obsession with how data structure, selection, and quality drive model behavior.
Ability to design lightweight experiments, move fast, and extract actionable insights from messy results.
Comfort working across domains (you'll touch finance, software engineering, policy, and more).
A bias toward building over theorizing.
Great candidates are undergrad research or master's research (but haven't done a phd).
What We Offer
~1 min readLocation & Eligibility
Listing Details
- Posted
- April 14, 2026
- First seen
- May 6, 2026
- Last seen
- June 22, 2026
Posting Health
- Days active
- 47
- Repost count
- 0
- Trust Level
- 26%
- Scored at
- June 22, 2026
Signal breakdown
Please let afterquery know you found this job on Jobera.
4 other jobs at afterquery
View all →Explore open roles at afterquery.
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.