Associate Principal Engineer (India Office)
Quick Summary
Data professional with experience in building and validating end-to-end data pipelines across Azure Data Factory, Databricks, and ADLS. Skilled in data quality assurance,
Data professional with experience in building and validating end-to-end data pipelines across Azure Data Factory, Databricks, and ADLS. Skilled in data quality assurance, automation using Python and PySpark, and integrating tests into CI/CD workflows. Adept at large-scale data validation, monitoring data health, and ensuring reliable batch and streaming processes, with additional expertise in developing scalable BI dashboards and reporting solutions.
- Design and implement end-to-end data validation strategies for pipelines built on Azure Data Factory, Databricks (Delta Lake), and ADLS
- Perform source-to-target reconciliation across ingestion, transformation, and consumption layers
- Implement robust checks for Data Quality Dimensions
- Build scalable data testing frameworks using python, pyspark and good knowledge of automation tools like playwright & pyspark based automation testing.
- Integrate automated data tests into CI/CD pipelines (Azure DevOps / GitHub Actions)
- Implement data quality-as-code practices using tools like Soda, dbt etc.
- Validate Spark transformations, Delta Lake tables, and streaming pipelines
- Perform large-scale data validation using PySpark and SQL
- Optimize validation logic for high-volume datasets
- Ensure correctness of: Batch and streaming jobs, Incremental loads (CDC pipelines), Slowly changing dimensions (SCD)
- Validate data movement, orchestration workflows, and failure handling in Azure Data factory & Azure Databricks services
- Define and implement data quality SLAs and KPIs
- Build dashboards to track data health and pipeline reliability
- Proactively identify anomalies and data drifts
- Implement alerting mechanisms for data failures
- Design, develop, and optimize interactive dashboards and reports using BI tools (e.g., Power BI, Tableau, Looker).
- Translate business requirements into technical reporting solutions.
- Identify bottlenecks and improve report/query performance.
- Implement best practices for dashboard design and scalability.
- Establish BI standards, frameworks, and best practices.
Requirements
~1 min read- 8-10 years experience in enterprise data modeling and data architecture roles.
- Strong hands-on experience with Azure Data Factory & Pyspark in large-scale environments is preferred.
- Experience with Databricks (Delta Lake, Spark, Unity Catalog).
- Advanced SQL and strong understanding of pyspark scripts & using pyspark for Data validation.
- Experience integrating major enterprise applications (ERP, CRM, OMS, MDM, AR systems).
- Strong understanding of data governance, data quality, metadata, and lineage.
- Excellent communication skills across business and technical audiences.
- Analytics Platform: Databricks (Delta Lake, Unity Catalog), Microsoft PowerBI
- Languages: SQL, Spark SQL, Python / PySpark
- Governance/Data Quality: Unity Catalog/Informatica DG/DQ
#LI-KS1
Location & Eligibility
Listing Details
- Posted
- June 11, 2026
- First seen
- June 11, 2026
- Last seen
- June 11, 2026
Posting Health
- Days active
- 0
- Repost count
- 1
- Trust Level
- 45%
- Scored at
- June 11, 2026
Signal breakdown
Please let CSS know you found this job on Jobera.
4 other jobs at CSS
View all →Explore open roles at CSS.
Similar Principal Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.