Job Description
We’re seeking a certified Data Engineer to lead the way in turning complex data into real healthcare impact. You’ll build smart pipelines, manage public data ingestion, and help shape decision-making across hospitals and health systems.
What You’ll Do:
Build scalable ETL pipelines using Databricks (incl. Azure Databricks)
Automate and optimize monthly data updates from public sources
Clean, transform, and prep data for analysis and business insight
Partner with stakeholders to define data needs and deliver solutions
Ensure high data quality and pipeline reliability
Document workflows and follow best practices for cloud data engineering
What You Bring:
Solid experience with Databricks, Spark, and Python
Skilled in building efficient ETL pipelines
Hands-on with web data extraction (APIs, scraping, bulk downloads)
Comfortable with both structured and unstructured data
Strong cloud experience (Azure preferred)
SQL proficiency and data validation know-how