Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
config		config
dags		dags
docker		docker
infra		infra
notebooks		notebooks
pipelines		pipelines
src		src
tests		tests
README.md		README.md
config.		config.

Repository files navigation

Azure Healthcare Data Pipeline

This project demonstrates a healthcare data pipeline built on Azure using Data Factory, Data Lake, Databricks, and Azure SQL.

Tech Stack

Azure Data Factory (Orchestration)
Azure Data Lake Storage (Raw & Processed Data)
Azure Databricks (Transformation/Analytics)
Azure SQL Database (Reporting Layer)
Infrastructure as Code (Bicep)
Python (ETL scripts, Databricks jobs)

Structure

pipelines/ – Azure Data Factory pipeline definition
src/ – ETL Python scripts
infra/ – Bicep templates for infrastructure deployment
notebooks/ – Databricks notebooks for analysis
config/ – Pipeline configuration
tests/ – Unit tests

Getting Started

Deploy infrastructure:

az deployment sub create --location <region> --template-file infra/main.bicep --parameters infra/parameters.json

Configure Azure Data Factory using pipelines/adf_pipeline.json.
Run ETL scripts locally or as Databricks jobs.
Explore and analyze data using provided notebooks.

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published

Languages