codvo-team
codvo-team~2d ago
New

Data Scientist (Remote)

IndiaIndia·PuneRemotemid
Data ScientistData
0 views0 saves0 applied

Quick Summary

Overview

Data Scientist

Key Responsibilities

Model Development & Training Maintain and improve the physics-based simulation engine — 19 equipment families, 64+ fault signatures, first-principles governing equations Run model training pipelines — dataset generation, feature engineering, model…

Technical Tools
pandaspythonscikit-learnab-testingmachine-learning

At Codvo, we are committed to building scalable, future-ready data platforms that power business impact. We believe in a culture of innovation, collaboration, and growth, where engineers can experiment, learn, and thrive. Join us to be part of a team that solves complex data challenges with creativity and cutting-edge technology.

Model development, training pipeline, and analytics backend. Works in close coordination with

the on-site Data Scientist — the on-site person provides site context and validation feedback,

the offshore person implements model improvements, retraining logic, and drift detection.

Responsibilities

~1 min read
  • Maintain and improve the physics-based simulation engine — 19 equipment families,
  • 64+ fault signatures, first-principles governing equations
  • Run model training pipelines — dataset generation, feature engineering, model fitting, hyperparameter tuning, MLflow experiment tracking
  • Implement model retraining triggers — drift detection (PSI-based), accuracy degradation monitoring, scheduled recalibration
  • Build and maintain the champion/challenger evaluation framework — shadow scoring, A/B testing, promotion guardrails
  • Develop new fault signatures as customer feedback identifies gaps
  • Implement probability calibration — Platt scaling, isotonic regression, ECE monitoring
  • Build the adaptive threshold controller — feedback-driven alarm threshold adjustment based on false alarm rate and recall
  • Develop the CMMS label linking pipeline — match work orders to predictions with confidence scoring
  • Analyze prediction outcomes — precision, recall, F1 by equipment family, by fault type, by site
  • Produce the weekly and monthly accuracy reports
  • Define and maintain feature sets for each equipment family — physics-informed features, rolling statistics, cross-tag correlations
  • Monitor data quality metrics — null rates, stale timestamps, schema violations, sensor drift
  • Build the healthy baseline update pipeline — daily computation of per-tag statistics from healthy operating data
  • Implement the training data snapshot pipeline — versioned, reproducible dataset extraction with manifest tracking
  • 4+ years in machine learning engineering or applied data science
  • Strong Python skills — pandas, scikit-learn, XGBoost/LightGBM, MLflow
  • Experience with time-series data, anomaly detection, or predictive maintenance modeling
  • Understanding of model deployment patterns — model registry, versioning, A/B testing, canary deployments
  • Experience with statistical process control, calibration, or reliability engineering is a plus

Location & Eligibility

Where is the job
Pune, India
Remote within one country
Who can apply
IN

Listing Details

First seen
May 6, 2026
Last seen
May 8, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
46%
Scored at
May 6, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

codvo-teamData Scientist (Remote)