Azure Formula1 project

Azure Formula1 project is an implementation of the data pipeline which consumes data from the Ergast API and makes F1 drivers/constructors standings available for Business Intelligence consumption. The pipeline infrastructure was built using Microsoft Azure as a backbone with ADLS Gen 2 as Datalake, Databricks/Spark as a data transformation framework, and Data Factory as an orchestrator.

Data Architecture Diagram

Ergast API Table Schema

How it works

Data Project Overview

These are the files from the Ergast API and the respective file formats that are used in this project, so different approaches are needed from the spark API to read each type of file.

File Name	Format
Races	CSV
Constructors	Single Line JSON
Drivers	Single Line Nested JSON
Results	Single Line JSON
Pit Stops	Multi Line JSON
Lap Times	Split CSV Files
Qualifying	Split Multi Line JSON Files

Data Ingestion Requirements

Ingest all 8 files into the data lake.
Ingested Data must have the schema applied.
Ingested Data must have audit columns.
Ingested Data must be stored in columnar format.
Must be able to analyze ingested data via SQL.
Ingestion logic must be able to handle the incremental load.
Join the key information required for reporting to create a new table.
Join the key information required for analysis to create a new table.
Transformed tables must have audit columns.

Reporting requirements

Driver standings for each year.
Constructor Standings for each year.

Analysis Requirements

Most dominant drivers over the years.
Most dominant Constructors over the years.
Visualize the outputs.

Prerequisites

Microsoft Azure account
Azure Databricks Service
Azure Data Factory

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
Includes		Includes
Ingest		Ingest
Presentation		Presentation
Set-up		Set-up
adf_formula1		adf_formula1
analysis		analysis
demo		demo
presentation_database		presentation_database
processed_database		processed_database
raw_database		raw_database
utils		utils
Data architecture.png		Data architecture.png
README.md		README.md
ergast_db.png		ergast_db.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Azure Formula1 project

Table of contents

Data Architecture Diagram

Ergast API Table Schema

How it works

Data Project Overview

Data Ingestion Requirements

Reporting requirements

Analysis Requirements

Prerequisites

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Azure Formula1 project

Table of contents

Data Architecture Diagram

Ergast API Table Schema

How it works

Data Project Overview

Data Ingestion Requirements

Reporting requirements

Analysis Requirements

Prerequisites

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages