With the advent of big data, there is an increased focus on data mining and the value that can be derived from large data sets. Data mining is the process of selecting, exploring, and modeling large amounts of data to uncover previously unknown information for business benefit.
R is an open source software environment for statistical computing and graphics and is very popular with data scientists. R is being used for data analysis, extracting and transforming data, fitting models, drawing inferences, making predictions, plotting, and reporting results. Learn how to use R basics, working with data frames, data reshaping, basic statistics, graphing, linear models, non-linear models, clustering, and model diagnostics.
There are no prerequisites for this course.
Anyone interested in learning to use data mining techniques to find insights in data and who has at least some statistical and programming experience.
Modules:
Module 1 Overview
Module 2 Overview
Module 3 Overview
Module 4 Overview
Module 5 – Summary