Job Title: SDET – Data / ETL (Offshore)
Experience: 6+ Years
Location: Remote
Duration: ~3 Months (Contract)
Notice Period: Immediate Joiners Preferred
Role Overview
We are seeking a highly skilled SDET – Data/ETL to support testing and validation of complex data pipelines and ETL workflows. The ideal candidate will have strong expertise in PySpark, Python, and SQL, along with hands-on experience in data validation, reconciliation, and large-scale data testing across cloud platforms.
Key Responsibilities
- Perform end-to-end testing and validation of data pipelines and data extract workflows.
- Validate ETL transformations and ensure accurate data reconciliation between source and target systems.
- Develop and maintain automated validation scripts using PySpark and Python.
- Execute SQL-based data validation and reconciliation across large datasets.
- Support and enhance automation frameworks for data pipeline testing.
- Collaborate closely with Data Engineering, QA, and Product teams in Agile environments.
- Identify defects, perform troubleshooting, and support root cause analysis (RCA).
- Ensure data quality, integrity, and consistency across all stages of data processing.
Required Skills
- Strong hands-on experience with PySpark and Python.
- Advanced proficiency in SQL for data validation and reconciliation.
- Proven experience in ETL/Data pipeline testing.
- Experience building automation scripts/frameworks for data validation.
- Familiarity with cloud data platforms (AWS preferred).
- Strong understanding of data quality, data validation, and large dataset testing.
- Experience working in Agile/Scrum environments.
- Hands-on experience with Databricks, Spark pipelines, or AWS Glue.
Preferred Skills
- Experience in Healthcare or Payer domain.
- Exposure to data lake/lakehouse architectures.
- Experience with CI/CD pipelines for data testing.
Key Competencies
- Strong analytical and problem-solving skills
- Ability to work independently in a remote setup
- Excellent communication and collaboration skills