Modernizing Data Lakes and Data Warehouses with Google Cloud Training in United States of America

  • Learn via: Classroom
  • Duration: 1 Day
  • Level: Intermediate
  • Price: From €1,365+VAT
We can host this training at your preferred location. Contact us!

The two key components of any data pipeline are data lakes and warehouses. This course highlights use-cases for each type of storage and dives into the available data lake and warehouse solutions on Google Cloud in technical detail. Also, this course describes the role of a data engineer, the benefits of a successful data pipeline to business operations, and examines why data engineering should be done in a cloud environment.

This is the first course of the Data Engineering on Google Cloud series. After completing this course, enroll in the Building Batch Data Pipelines on Google Cloud course.

Introduction to Data Engineering

This module discusses the role of data engineering and motivates the claim why data engineering should be done in the Cloud

  • Module introduction
  • The role of a data engineer
  • Data engineering challenges
  • Introduction to BigQuery
  • Data lakes and data warehouses
  • Transactional databases versus data warehouses
  • Partner effectively with other data teams
  • Manage data access and governance
  • Demo: Finding PII in your dataset with the DLP API
  • Build production-ready pipelines
  • Google Cloud customer case study
  • Lab Intro: Using BigQuery to do Analysis
  • LAB: Using BigQuery to do Analysis: In this lab, you analyze 2 different public datasets, run queries on them, separately and then combined, to derive interesting insights.
  • QUIZ

Building a Data Lake

In this module, we describe what data lake is and how to use Cloud Storage as your data lake on Google Cloud.

  • Module Introduction
  • Introduction to data lakes
  • Data storage and ETL options on Google Cloud
  • Build a data lake using Cloud Storage
  • Secure Cloud Storage
  • Store all sorts of data types
  • Cloud SQL as a relational data lake
  • Lab Intro: Loading Taxi Data into Google Cloud SQL
  • LAB: Loading Taxi Data into Google Cloud SQL 2.5:In this lab you will import data from CSV text files into Cloud SQL and then carry out some basic data analysis using simple queries.
  • QUIZ

Building a Data Warehouse

In this module, we talk about BigQuery as a data warehousing option on Google Cloud

  • Module Introduction
  • The modern data warehouse
  • Introduction to BigQuery
  • Demo: Querying TB of data in seconds
  • Get started with BigQuery
  • Load data into BigQuery
  • Lab Intro: Loading Data into BigQuery
  • LAB: Loading data into BigQuery: This lab focuses on how to ingest data into tables inside of BigQuery.
  • Explore schemas
  • Demo: Exploring Schemas
  • Schema design
  • Nested and repeated fields
  • Demo: Nested and repeated fields
  • Design the optimal schema for BigQuery
  • Lab Intro: Working with JSON and Array data in BigQuery
  • LAB: Working with JSON and Array data in BigQuery 2.5: In this lab you will work with semi-structured data (ingesting JSON, Array data types) inside of BigQuery. You will practice loading, querying, troubleshooting, and unnesting various semi-structured datasets.
  • Optimize with partitioning and clustering
  • Lab Intro: Partitioned Tables in BigQuery
  • LAB: Partitioned Tables in Google BigQuery:This lab focuses on how to query partitioned datasets and how to create your own dataset partitions to improve query performance, which reduces cost.
  • Review
  • QUIZ



Contact us for more detail about our trainings and for all other enquiries!

Upcoming Trainings

Join our public courses in our United States of America facilities. Private class trainings will be organized at the location of your preference, according to your schedule.

10 January 2025 (1 Day)
United States of America
Classroom / Virtual Classroom
24 January 2025 (1 Day)
United States of America
Classroom / Virtual Classroom
28 January 2025 (1 Day)
United States of America
Classroom / Virtual Classroom
€1,365 +VAT
Book Now
10 January 2025 (1 Day)
United States of America
Classroom / Virtual Classroom
10 February 2025 (1 Day)
United States of America
Classroom / Virtual Classroom
24 January 2025 (1 Day)
United States of America
Classroom / Virtual Classroom
28 January 2025 (1 Day)
United States of America
Classroom / Virtual Classroom
€1,365 +VAT
Book Now
10 February 2025 (1 Day)
United States of America
Classroom / Virtual Classroom
Modernizing Data Lakes and Data Warehouses with Google Cloud Training Course in the United States

The United States of America (USA) is a country in North America and a federal republic of 50 states. At almost 9.8 million square kilometers, the United States is one of the world’s biggest and most populous countries. While America’s capital city is Washington, D.C., some of its well known cities are New York, Los Angeles, Miami, Chicago, Orlando, Las Vegas, Dallas, San Francisco and Kansas City.

The most iconic symbol of the country is probably the Statue of Liberty in New York and it was gifted by France. Despite the fact that English is the most widely used language in the United States, there is no official language. Independent since July 4, 1776, USA’s motto is “In God We Trust” and their current president is Joe Biden. Some of the best places to visit in the United States are Grand Canyon, Yosemite, Maui, New Orleans, Honolulu, Zion National Park, Kauai, Lake Tahoe, Aspen, Big Sur and Santa Fe.

Achieve your IT goals through our versatile courses, spanning programming, data analytics, software development, business skills, cloud computing, cybersecurity, project management. Benefit from the flexibility of hosting training at your preferred location within United States, where our experienced instructors will provide hands-on learning and practical expertise.
By using this website you agree to let us use cookies. For further information about our use of cookies, check out our Cookie Policy.