Data Engineering on AWS is a 3-day intermediate course designed for professionals who want to master data engineering solutions using AWS services.

Through hands-on labs, theory, and real-world exercises, participants learn how to design, build, optimize, and secure modern data architectures — including data lakes, warehouses, and pipelines (batch and streaming).

The course prepares learners for the AWS Certified Data Engineer – Associate certification.

We can organize this training at your preferred date and location. Contact Us!

Prerequisites

Participants should have:

Basic understanding of machine learning concepts,
Familiarity with Python and common data libraries (NumPy, Pandas, Scikit-learn),
Knowledge of cloud computing and AWS fundamentals,
Experience with SQL recommended,
Familiarity with Git is beneficial.

Who Should Attend

This course is ideal for:

Data engineers and architects working with AWS,
Professionals building or managing large-scale data platforms,
Teams migrating on-premises data infrastructure to AWS,
Candidates preparing for the AWS Data Engineer Associate (DEA-C01) exam.

What You Will Learn

By the end of the course, participants will be able to:

Understand data engineering roles and key AWS services,
Design and implement data lakes and warehouses on AWS,
Apply performance tuning, orchestration, and security best practices,
Build and manage batch and streaming data pipelines,
Optimize data architectures for scalability, cost, and compliance.

Training Outline

Day 1

Module 1: Data Engineering Roles and Key Concepts

Role of a Data Engineer
Key functions of a Data Engineer
Data Personas
Data Discovery
AWS Data Services

Module 2: AWS Data Engineering Tools and Services

Orchestration and Automation
Data Engineering Security
Monitoring
Continuous Integration and Continuous Delivery
Infrastructure as Code
AWS Serverless Application Model
Networking Considerations
Cost Optimization Tools

Module 3: Designing and Implementing Data Lakes

Data lake introduction
Data lake storage
Ingest data into a data lake
Catalog data
Transform data
Server data for consumption
Hands-on lab: Setting up a Data Lake on AWS

Module 4: Optimizing and Securing a Data Lake Solution

Open Table Formats
Security using AWS Lake Formation
Setting permissions with Lake Formation
Security and governance
Troubleshooting
Hand-on lab: Automating Data Lake Creation using AWS Lake Formation Blueprints

Day 2

Module 5: Data Warehouse Architecture and Design Principles

Introduction to data warehouses
Amazon Redshift Overview
Ingesting data into Redshift
Processing data
Serving data for consumption
Hands-on Lab: Setting up a Data Warehouse using Amazon Redshift Serverless

Module 6: Performance Optimization Techniques for Data Warehouses

Monitoring and optimization options
Data optimization in Amazon Redshift
Query optimization in Amazon Redshift
Orchestration options

Module 7: Security and Access Control for Data Warehouses

Authentication and access control in Amazon Redshift
Data security in Amazon Redshift
Auditing and compliance in Amazon Redshift
Hands-on lab: Managing Access Control in Redshift

Module 8: Designing Batch Data Pipelines

Introduction to batch data pipelines
Designing a batch data pipeline
AWS services for batch data processing

Module 9: Implementing Strategies for Batch Data Pipeline

Elements of a batch data pipeline
Processing and transforming data
Integrating and cataloging your data
Serving data for consumption
Hands-on lab: A Day in the Life of a Data Engineer

Day 3

Module 10: Optimizing, Orchestrating, and Securing Batch Data Pipelines

Optimizing the batch data pipeline
Orchestrating the batch data pipeline
Securing the batch data pipeline
Hands-on lab: Orchestrating Data Processing in Spark using AWS Step Functions

Module 11: Streaming Data Architecture Patterns

Introduction to streaming data pipelines
Ingesting data from stream sources
Streaming data ingestion services
Storing streaming data
Processing Streaming Data
Analyzing Streaming Data with AWS Services
Hands-on lab: Streaming Analytics with Amazon Managed Service for Apache Flink

Module 12: Optimizing and Securing Streaming Solutions

Optimizing a streaming data solution
Securing a streaming data pipeline
Compliance considerations
Hands-on lab: Access Control with Amazon Managed Streaming for Apache Kafka

Why Choose Bilginç IT Academy

At Bilginç IT Academy, we combine our strong presence in both the UK and Türkiye to deliver high-quality, practical training solutions for organizations worldwide.

International Presence with Local Expertise
With operations in the United Kingdom and Türkiye, we bring together global standards and local market understanding to deliver effective training experiences across regions.

Expert Instructors with Real-World Experience
Our courses are delivered by certified trainers with extensive industry experience, ensuring you gain practical knowledge that can be applied immediately.

Corporate-Focused Training Approach
We specialize in training corporate teams, tailoring our programs to meet your organization’s goals, technologies, and project requirements.

Flexible Training Delivery Worldwide
We offer classroom, virtual classroom, and onsite training options globally, tailored to your organization’s needs.

Hands-On, Practical Learning
Our training sessions include real-world scenarios, case studies, and interactive exercises to ensure lasting understanding and skill development.

Proven Track Record
With over 10 years of experience, we have successfully trained professionals from leading organizations across different industries and regions.

Why have you chosen us?

I have attended a training from Bilginc IT Academy before and I was satisfied.

I have attended a training from a different provider and it was not helpful.

Other

How many employees do you have in your IT department?

0 – 50

50 – 250

250 – 1000

1000+

Data Engineering on AWS Training

Prerequisites

Who Should Attend

What You Will Learn

Training Outline

Why Choose Bilginç IT Academy

Avaible Training Dates