Ray / Distributed AI Scheduling Engineer
Quick Summary
We are looking for an experienced Ray / Distributed AI Scheduling Engineer This role owns the distributed AI scheduling layer. What This Person Will Build Ray-based distributed task graphs for inference, training, RAG, vector search, document retrieval, video, and compound workloads.
vLLM, TensorRT-LLM, KubeRay, vector search, RDMA storage, GPU scheduling, video pipelines, edge inference, or large-scale model-serving platforms.
We are looking for an experienced Ray / Distributed AI Scheduling Engineer
This role owns the distributed AI scheduling layer.
- Ray-based distributed task graphs for inference, training, RAG, vector search, document retrieval, video, and compound workloads.
- Ray head and worker deployment patterns on Kubernetes.
- Per-hardware workload drivers for GPU inference, vector search, storage retrieval, video acceleration, and edge inference.
- Model-locality and cache-awareness logic so workloads run near the right model or data.
- Integration between scheduling decisions, capacity state, SLA rules, billing events, and observability traces.
- End-to-end workload tests for compound RAG and multi-plane AI jobs.
- Strong Python distributed-systems experience.
- Real Ray experience, preferably production Ray, KubeRay, or distributed AI workload orchestration.
- Understanding of AI inference, training, RAG pipelines, model serving, model locality, and accelerator-aware scheduling.
- Kubernetes experience for deploying distributed compute systems.
- Debugging ability across Python, Kubernetes, GPUs, network movement, and application latency.
Nice to Have
~1 min read- vLLM, TensorRT-LLM, KubeRay, vector search, RDMA storage, GPU scheduling, video pipelines, edge inference, or large-scale model-serving platforms.
Send your Resume / CV to apply online
Founded in California in 2004, Codeminders specializes in developing cutting-edge software solutions for high-tech companies in the Silicon Valley of California. Our expertise spans a wide range of industries, with a primary focus on modern technologies such as AI, mobile applications, video conferencing, and cloud computing.
As a member of the Codeminders team, you’ll have the unique opportunity to work on innovative projects. Whether it’s collaborating with dynamic startups or established companies serving millions of users, every project is a chance to shape the future of technology.
We believe in empowering our team members with the tools, opportunities, and culture to thrive. Here's what you can expect when you join us:
- Innovation at Its Core: Work on transformative projects that utilize the latest technologies, tools, and methodologies.
- Global Collaboration: Partner with world-class engineers from both the US and Ukraine, fostering diverse perspectives and international exposure.
- Strong Ethical Foundation: We stand firm in our values by maintaining zero business ties with Russia, Belarus, and temporarily occupied Ukrainian territories (Crimea, Donbas, etc.).
What We Offer
~1 min readLocation & Eligibility
Listing Details
- Posted
- April 30, 2026
- First seen
- May 8, 2026
- Last seen
- May 8, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 39%
- Scored at
- May 8, 2026
Signal breakdown
Please let codeminders know you found this job on Jobera.
3 other jobs at codeminders
View all →Explore open roles at codeminders.
Similar Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.