Employment Type
Full Time
Data Engineer (US Healthcare)
About the Role
We are looking for an offshore Data Engineer with experience in US healthcare data to help design, build, and maintain scalable data infrastructure that powers analytics, reporting, and operational decision making. This role partners closely with US based analytics and product teams and requires strong technical fundamentals, a strong data modeling mindset, and familiarity with healthcare data privacy standards.
What You Will Do
- Build, optimize, and maintain reliable data pipelines that support analytics, reporting, and downstream applications
- Translate raw healthcare data (claims, eligibility, enrollment, clinical, and financial data) into well defined core data models that serve as a source of truth for analytics and reporting
- Design and develop reusable, abstracted data models and datasets to enable consistent analytics and self service reporting
- Define and implement a federated data validation and data quality framework to proactively identify inconsistencies and anomalies
- Partner with analytics and business stakeholders to understand healthcare data semantics and translate business requirements into scalable technical solutions
- Ensure data privacy, security, and compliance through adherence to best practices and regulatory requirements
- Monitor, troubleshoot, and improve data pipeline performance and reliability
Technologies We Use
- SQL
- Python
- Microsoft Fabric (Lakehouse, Pipelines, Warehouses)
- Cloud based data warehouses and orchestration tools
What You Will Bring
- 2 plus years of experience in data engineering or software engineering
- Strong expertise in SQL and Python
- Hands on experience building scalable data pipelines, query optimization, and defining reusable datasets
- Experience translating raw healthcare data sources (claims, eligibility, enrollment, clinical, and financial files) into normalized and analytics ready core data models
- Experience with distributed data processing, data warehousing, and data governance concepts
- Experience working with orchestration tools (Airflow preferred)
- Experience with relational databases (PostgreSQL preferred) and cloud data warehouses (Snowflake preferred)
- Familiarity with medallion or event driven data architectures and telemetry or monitoring practices
- Experience working with US healthcare or insurance data
- Understanding of data security best practices and HIPAA compliance requirements
- Strong attention to data quality, documentation, and operational excellence
- Ability to collaborate effectively with US based teams across time zones