Member of Technical Staff - Post Training
Quick Summary
About ai& ai& is a new global AI technology company dedicated to meeting the world's growing demand for AI. Our vision is twofold: to serve as a premier AI lab specializing in localization, and to act as a global infrastructure and compute provider.
Reinforcement Learning — Research and Execution Profile, optimize, and scale RL training runs to reduce iteration time. Integrate new optimization techniques as they emerge from the research community.
ai& is a new global AI technology company dedicated to meeting the world's growing demand for AI. Our vision is twofold: to serve as a premier AI lab specializing in localization, and to act as a global infrastructure and compute provider. We are building a unified, optimized global platform that integrates next-generation data centers and infrastructure, heterogeneous compute serving, and advanced model services. We believe that the most effective way to build and scale AI is to own the stack from top to bottom.
At ai&, we empower small teams with the autonomy needed to tackle significant challenges. Our approach is to deconstruct large problems into manageable components and solve complex issues collaboratively. We seek highly motivated, mission-driven individuals who demonstrate strong personal agency. We value curiosity as the foundation of talent, and we are looking for people eager to develop alongside our evolving technology and expanding business.
We are actively hiring worldwide, with presence in Tokyo, SF, Austin, and Toronto. We are more than happy to meet exceptional talent where they are.
Responsibilities
~1 min read- →
Requirements
~2 min readReinforcement Learning in Practice You have actually run RL on language models. You have implemented reward models, dealt with reward hacking, tuned KL penalties, and shipped models that are meaningfully better as a result. You understand the theory and you have applied it.
Post-Training Engineering Depth Hands-on experience with data generation and evaluation for LLM post-training. You have run SFT, preference alignment, and RL workflows on real models and you know where these pipelines break.
Framework Proficiency Strong Python and PyTorch proficiency with hands-on experience optimizing training pipelines. Experience with DeepSpeed, FSDP, vLLM, or similar frameworks for efficient model training and inference.
Data Quality Instinct Strong intuition for what good training data looks like. Experience designing and executing data generation, filtering, curation, and quality assessment processes at scale.
End-to-End Thinking You reason across data generation, training, alignment, and evaluation as a single system. You do not optimize one stage in isolation from the others.
Customer and Communication Fluency Comfortable working directly with enterprise customers. You can translate between customer needs and internal technical teams, push back when needed, and be trusted as the technical owner of a delivery.
Continual Learning Familiarity Familiar with the challenges of continual and lifelong learning in neural networks. You have thought seriously about catastrophic forgetting and how to build models that stay current without degrading.
Great Team Spirit A mission-driven approach to engineering, valuing clear communication, hands-on execution, and collective success over individual silos.
Location & Eligibility
Listing Details
- Posted
- March 20, 2026
- First seen
- May 6, 2026
- Last seen
- May 8, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 42%
- Scored at
- May 6, 2026
Signal breakdown
Please let aiand know you found this job on Jobera.
4 other jobs at aiand
View all →Explore open roles at aiand.
Similar Member Of Technical Staff jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.