AI / ML Engineer/Lead
Quick Summary
At Codvo, software and people transformations go hand-in-hand. We are a global empathy-led technology services company where product innovation and mature software engineering are embedded in our core DNA.
Generative AI Pipeline Development • Design and implement scalable and modular pipelines for data ingestion, transformation, and orchestration across GenAI workloads.
• Experience with OCR, document parsing, and layout-aware chunking. • Hands-on with MLOps and LLMOps tools for Generative AI. • Contributions to open-source GenAI or AI infrastructure projects.
About the Role
~1 min readYou will also be expected to apply modern LLMOps practices, handle schema-constrained generation, optimize cost and latency trade-offs, mitigate hallucinations, and ensure robust safety, personalization, and observability across GenAI systems.
Responsibilities
~1 min read- • Design and implement scalable and modular pipelines for data ingestion, transformation, and orchestration across GenAI workloads.
- • Manage data and model flow across LLMs, embedding services, vector stores, SQL sources, and APIs.
- • Build CI/CD pipelines with integrated prompt regression testing and version control.
- • Use orchestration frameworks like LangChain or LangGraph for tool routing and multi-hop workflows.
- • Monitor system performance using tools like Langfuse or Prometheus.
- • Develop systems to ingest unstructured (PDF, OCR) and structured (SQL, APIs) data.
- • Apply preprocessing pipelines for text, images, and code.
- • Ensure data integrity, format consistency, and security across sources.
- • Integrate external and internal LLM APIs (OpenAI, Claude, Mistral, Qwen, etc.).
- • Build internal APIs for smooth backend-AI communication.
- • Optimize performance through fallback routing to classical or smaller models based on latency or cost budgets.
- • Use schema-constrained prompting and output filters to suppress hallucinations and maintain factual accuracy.
- • Build hybrid RAG pipelines using vector similarity (FAISS/Qdrant) and structured data (SQL/API).
- • Design custom retrieval strategies for multi-modal or multi-source documents.
- • Apply post-retrieval ranking using DPO or feedback-based techniques.
- • Improve contextual relevance through re-ranking, chunk merging, and scoring logic.
- • Manage prompt engineering, model interaction, and tuning workflows.
- • Implement LLMOps best practices: prompt versioning, output validation, caching (KV store), and fallback design.
- • Optimize generation using temperature tuning, token limits, and speculative decoding.
- • Integrate observability and cost-monitoring into LLM workflows.
- • Design and maintain scalable backend services supporting GenAI applications.
- • Implement monitoring, logging, and performance tracing.
- • Build RBAC (Role-Based Access Control) and multi-tenant personalization.
- • Support containerization (Docker, Kubernetes) and autoscaling infrastructure for production.
Requirements
~1 min read- • 5+ years of experience in AI/ML engineering with end-to-end pipeline development.
- • Hands-on experience building and deploying LLM/RAG systems in production.
- • Strong experience with public cloud platforms (AWS, Azure, or GCP).
- • Proficient in Python and libraries such as Transformers, SentenceTransformers, PyTorch.
- • Deep understanding of GenAI infrastructure, LLM APIs, and toolchains like LangChain/LangGraph.
- • Experience with RESTful API development and version control using Git.
- • Knowledge of vector DBs (Qdrant, FAISS, Weaviate) and similarity-based retrieval.
- • Familiarity with Docker, Kubernetes, and scalable microservice design.
- • Experience with observability tools like Prometheus, Grafana, or Langfuse.
- • Knowledge of LLMs, VAEs, Diffusion Models, GANs.
- • Experience building structured + unstructured RAG pipelines.
- • Prompt engineering with safety controls, schema enforcement, and hallucination mitigation.
- • Experience with prompt testing, caching strategies, output filtering, and fallback logic.
- • Familiarity with DPO, RLHF, or other feedback-based fine-tuning methods.
- • Strong analytical, problem-solving, and debugging skills.
- • Excellent collaboration with cross-functional teams: product, QA, and DevOps.
- • Ability to work in fast-paced, agile environments and deliver production-grade solutions.
- • Clear communication and strong documentation practices.
Requirements
~1 min read- • Experience with OCR, document parsing, and layout-aware chunking.
- • Hands-on with MLOps and LLMOps tools for Generative AI.
- • Contributions to open-source GenAI or AI infrastructure projects.
- • Knowledge of GenAI governance, ethical deployment, and usage controls.
- • Experience with hallucination suppression frameworks like Guardrails.ai, Rebuff, or Constitutional AI.
Location & Eligibility
Listing Details
- First seen
- May 6, 2026
- Last seen
- May 8, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 59%
- Scored at
- May 6, 2026
Signal breakdown
Please let codvo-team know you found this job on Jobera.
4 other jobs at codvo-team
View all →Explore open roles at codvo-team.
Similar Machine Learning Engineer jobs
View all →Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.