Reinforcement learning engineer
Quick Summary
Dexmate is building the foundation for physical AI — a unified platform that combines high-quality robotic hardware with a universal Physical AI OS, making robots as easy to build and deploy as software.
Design and implement reinforcement learning algorithms for various robotics tasks Develop and optimize RL training pipelines in both simulation and real-world environments Collaborate with robotics engineers to integrate RL models into production…
Strong experience with reinforcement learning (PPO, SAC, TD3, DDPG, etc.) Hands-on experience with robotics systems (simulation or real robots) Proven track record applying RL to manipulation, locomotion, or navigation tasks Proficiency in Python…
Dexmate is building the foundation for physical AI — a unified platform that combines high-quality robotic hardware with a universal Physical AI OS, making robots as easy to build and deploy as software. Today, robotics is fragmented, slow, and closed: most builders are forced to reinvent the same stack again and again, and most ideas never make it past the prototype stage. We exist to change that. Our mission is to democratize robotics by lowering the barrier to entry, delivering a plug-and-play platform for developers, researchers, and enterprises, and cultivating an open ecosystem that accelerates the evolution of physical AI. If you want to help shape the next layer of human capability — and believe the future of robotics should be built together, not in isolation — we'd love to build it with you.
We're seeking Reinforcement Learning experts to develop and deploy cutting-edge RL algorithms that enhance our robots' capabilities.
Responsibilities
~1 min read- →
Design and implement reinforcement learning algorithms for various robotics tasks
- →
Develop and optimize RL training pipelines in both simulation and real-world environments
- →
Collaborate with robotics engineers to integrate RL models into production systems
- →
Conduct experiments to evaluate and improve algorithm performance
- →
Scale training infrastructure for efficient learning across multiple robots
Requirements
~1 min readStrong experience with reinforcement learning (PPO, SAC, TD3, DDPG, etc.)
Hands-on experience with robotics systems (simulation or real robots)
Proven track record applying RL to manipulation, locomotion, or navigation tasks
Proficiency in Python and deep learning frameworks (PyTorch, TensorFlow, JAX)
Strong understanding of robot kinematics, dynamics, and control
Experience with GPU-based simulation such as Isaac Gym, Isaac Lab, SAPIEN, etc.
Experience with distributed RL training systems
Experience with sim-to-real transfer techniques
Publications in robotics or RL conferences (CoRL, ICRA, RSS, NeurIPS, ICLR, ICML, etc.)
Location & Eligibility
Listing Details
- Posted
- January 19, 2026
- First seen
- May 6, 2026
- Last seen
- May 28, 2026
Posting Health
- Days active
- 21
- Repost count
- 0
- Trust Level
- 15%
- Scored at
- May 28, 2026
Signal breakdown
Please let dexmate know you found this job on Jobera.
4 other jobs at dexmate
View all →Explore open roles at dexmate.
Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.