Practical Machine Learning for Cloudera Platform Training

  • Learn via: Classroom / Virtual Classroom / Online
  • Duration: 1 Day
  • Download PDF
  • We can host this training at your preferred location. Contact us!

Cloudera University’s one-day Introduction to Machine Learning with Spark ML and MLlib will teach you the key language concepts to machine learning, Spark MLlib, and Spark ML. The course includes coverage of collaborative filtering, clustering, classification, algorithms, and data volume.

There are no prerequisites for this course.

This course is intended for software engineers who have basic Linux experience in addition to experience with either the Scala or Python programming languages (code examples and exercises are presented in both languages, so students can choose whichever language they prefer).

Through instructor-led discussion, as well as hands-on exercises, participants will learn topics including: 

  • Data types, statistics support, feature extraction, transforming vectors, using the StandardScaler class 
  • An overview of dimensionality reduction 
  • Machine learning models, regression, linear regression support, and regularization. 
  • Finally, the course discusses machine learning with Spark ML topics such as using data frames, transformers and estimators, an introduction to pipelines, using pipelines to generate models, and regularization.

1. Machine Learning Overview

  • Introduction
  •  Collaborative Filtering
  •  Clustering
  •  Classification
  •  Relationship of Algorithms and Data Volume

2. Machine Learning with Spark MLlib

  • Introduction
  •  Data Types
  •  Basic Statistics
  •  Feature Extraction
  •  Dimensionality Reduction
  •  Models
  •  Regression

3. Machine Learning with Spark ML

  • Overview of Spark ML
  •  DataFrames
  •  Transformers and Estimators
  •  Pipelines
  •  Decision Tree Classifiers
  •  k-Means Clustering


Contact us for more detail about our trainings and for all other enquiries!