Data Engineering Specialization

This repository contains resources for the Data Engineering Specialization, a MOOC offered by deeplearning.ai that provides a comprehensive guide to developing and deploying data systems that deliver real business value. It includes course subtitles and solutions to the lab exercises for all courses in the specialization.

Specialization Overview

This specialization equips learners with the skills to approach data engineering problems systematically, focusing on the full data engineering lifecycle. Key topics include building effective data pipelines, designing scalable architectures, and applying data transformations to serve business needs.

Courses in the Specialization

1. Introduction to Data Engineering

Think critically about the components of an end-to-end data architecture that satisfies requirements while remaining flexible for the future.
Evaluate technologies and tools against the context of requirements and good data architecture.
Design a data architecture and implement a batch and streaming pipeline on AWS.

2. Source Systems, Data Ingestion, and Pipelines

Identify data formats and appropriate source systems for various use cases.
Differentiate between relational and NoSQL databases, and understand ACID compliance.
Perform batch and streaming data ingestion using ETL and ELT patterns.
Interact with REST APIs, object storage, and event-streaming platforms for ingestion.
Learn DataOps concepts such as CI/CD, Infrastructure as Code, and observability.
Orchestrate data pipelines using Airflow and integrate data quality tests.

3. Data Storage and Queries

Explore storage systems (object, block, and file) and their impact on performance.
Understand the differences between row-oriented and column-oriented databases.
Learn about data warehouses, data lakes, and the lakehouse architecture.
Implement advanced SQL queries and strategies for query performance optimization.
Execute aggregate and join queries on streaming data.

4. Data Modeling, Transformation, and Serving

Define data modeling techniques and apply normalization to data.
Understand warehouse modeling approaches (Inmon, Kimball, Data Vault) and transform data for analytics.
Prepare tabular, textual, and image data for machine learning models.
Compare batch and streaming transformation frameworks such as Spark and Pandas.
Serve processed data to stakeholders using modern architectures.

Repository Contents

Transcripts
Lab solutions

Disclaimer

This repository is intended for educational purposes only. The contents of this repository, including subtitles, are not my own. The solutions to the labs are provided for reference and are not endorsed by the course instructors or platform. Please adhere to the course’s honor code and guidelines when using these resources.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
1. Introduction to Data Engineering		1. Introduction to Data Engineering
2. Source Systems, Data Ingestion, and Pipelines		2. Source Systems, Data Ingestion, and Pipelines
3. Data Storage and Queries		3. Data Storage and Queries
4. Data Modeling, Transformation, and Serving		4. Data Modeling, Transformation, and Serving
.gitignore		.gitignore
README.md		README.md
specialization certificate.pdf		specialization certificate.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Data Engineering Specialization

Specialization Overview

Courses in the Specialization

1. Introduction to Data Engineering

2. Source Systems, Data Ingestion, and Pipelines

3. Data Storage and Queries

4. Data Modeling, Transformation, and Serving

Repository Contents

Disclaimer

About

Uh oh!

Releases

Packages

Languages

mshisheh/data-engineering-professional-certificate

Folders and files

Latest commit

History

Repository files navigation

Data Engineering Specialization

Specialization Overview

Courses in the Specialization

1. Introduction to Data Engineering

2. Source Systems, Data Ingestion, and Pipelines

3. Data Storage and Queries

4. Data Modeling, Transformation, and Serving

Repository Contents

Disclaimer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages