Data Scientist (Remote)
Quick Summary
Data Scientist
Model Development & Training Maintain and improve the physics-based simulation engine — 19 equipment families, 64+ fault signatures, first-principles governing equations Run model training pipelines — dataset generation, feature engineering, model…
At Codvo, we are committed to building scalable, future-ready data platforms that power business impact. We believe in a culture of innovation, collaboration, and growth, where engineers can experiment, learn, and thrive. Join us to be part of a team that solves complex data challenges with creativity and cutting-edge technology.
Model development, training pipeline, and analytics backend. Works in close coordination with
the on-site Data Scientist — the on-site person provides site context and validation feedback,
the offshore person implements model improvements, retraining logic, and drift detection.
Responsibilities
~1 min read- Maintain and improve the physics-based simulation engine — 19 equipment families,
- 64+ fault signatures, first-principles governing equations
- Run model training pipelines — dataset generation, feature engineering, model fitting, hyperparameter tuning, MLflow experiment tracking
- Implement model retraining triggers — drift detection (PSI-based), accuracy degradation monitoring, scheduled recalibration
- Build and maintain the champion/challenger evaluation framework — shadow scoring, A/B testing, promotion guardrails
- Develop new fault signatures as customer feedback identifies gaps
- Implement probability calibration — Platt scaling, isotonic regression, ECE monitoring
- Build the adaptive threshold controller — feedback-driven alarm threshold adjustment based on false alarm rate and recall
- Develop the CMMS label linking pipeline — match work orders to predictions with confidence scoring
- Analyze prediction outcomes — precision, recall, F1 by equipment family, by fault type, by site
- Produce the weekly and monthly accuracy reports
- Define and maintain feature sets for each equipment family — physics-informed features, rolling statistics, cross-tag correlations
- Monitor data quality metrics — null rates, stale timestamps, schema violations, sensor drift
- Build the healthy baseline update pipeline — daily computation of per-tag statistics from healthy operating data
- Implement the training data snapshot pipeline — versioned, reproducible dataset extraction with manifest tracking
- 4+ years in machine learning engineering or applied data science
- Strong Python skills — pandas, scikit-learn, XGBoost/LightGBM, MLflow
- Experience with time-series data, anomaly detection, or predictive maintenance modeling
- Understanding of model deployment patterns — model registry, versioning, A/B testing, canary deployments
- Experience with statistical process control, calibration, or reliability engineering is a plus
Location & Eligibility
Listing Details
- First seen
- May 6, 2026
- Last seen
- May 8, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 46%
- Scored at
- May 6, 2026
Signal breakdown
Please let codvo-team know you found this job on Jobera.
4 other jobs at codvo-team
View all →Explore open roles at codvo-team.
Similar Data Scientist jobs
View all →Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.