Senior Staff Research Engineer – Reinforcement Learning for AI Agents
Quick Summary
XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical take-off and landing (eVTOL) aircraft, and robotics.
Reinforcement learning methods for LLM-driven agents and decision systems. Policy optimization for long-horizon reasoning and planning. Learning from human or AI feedback (RLHF / RLAIF).
Experience with RLHF or preference learning. Experience with LLM agents or tool-using AI systems. Multi-agent systems or long-horizon planning. Simulation environments for RL. Publications in NeurIPS, ICML, ICLR, ACL , or related venues.
Responsibilities
~1 min read- →
Reinforcement learning methods for LLM-driven agents and decision systems.
- →
Policy optimization for long-horizon reasoning and planning.
- →
Learning from human or AI feedback (RLHF / RLAIF).
- →
Agent training pipelines built on top of our agent infrastructure platform.
- →
Evaluation and benchmarking systems for agent capabilities.
- →
Learning loops that integrate real-world and simulation data.
- →
Contribute to AI systems that continuously improve after deployment.
Requirements
~1 min read-
MS or PhD in Computer Science, AI, Machine Learning, Robotics, or a related field.
-
Strong background in reinforcement learning or machine learning.
-
Experience implementing RL algorithms such as PPO, Actor-Critic, or policy gradient methods.
-
Strong programming skills in Python with PyTorch or JAX.
-
Experience building ML training systems or infrastructure.
Requirements
~2 min read-
Experience with RLHF or preference learning.
-
Experience with LLM agents or tool-using AI systems.
-
Multi-agent systems or long-horizon planning.
-
Simulation environments for RL.
-
Publications in NeurIPS, ICML, ICLR, ACL, or related venues.
-
A fun, supportive and engaging environment.
-
Opportunity to make significant impact on transportation revolution by the means of advancing autonomous driving.
-
Opportunity to work on cutting edge technologies with the top talent in the field.
-
Competitive compensation package.
-
Snacks, lunches and fun activities.
Location & Eligibility
Listing Details
- First seen
- March 26, 2026
- Last seen
- May 8, 2026
Posting Health
- Days active
- 44
- Repost count
- 0
- Trust Level
- 34%
- Scored at
- May 9, 2026
Signal breakdown
Please let Xpengmotors know you found this job on Jobera.
4 other jobs at Xpengmotors
View all →Explore open roles at Xpengmotors.
Similar Research Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.