Data Science with R and SQL Server Training in Hong Kong

  • Learn via: Classroom / Virtual Classroom / Online
  • Duration: 4 Days
  • Price: Please contact for booking options
We can host this training at your preferred location. Contact us!

R is the most popular environment and language for statistical analyses, data mining, and machine learning. Managed and scalable version of R runs in SQL Server, Power BI, and Azure ML. The main topic of this 4-day course is the R language. However, the course also shows how to use the languages and tools available in MS BI suite for data science applications, including Python, T-SQL, Power BI, Azure ML, and Excel. The labs focus on R; the demos also show the code in other languages.

Attendees should have basic understanding of data analysis and basic familiarity with SQL Server tools.

Course Format

This seminar consists of instructor presentations and individual work during labs. During labs, the attendees use mainly the R language.

Every attendee gets a .PDF printout of all slides and all code and solutions for the demos presented and for the lab exercises.

In addition, every attendee gets an electronic version of the Data Science with SQL Server Quick Start Guide book by Dejan Sarka, Packt, 2018.

Each attendee works on a pre-prepared computer on a virtual machine with the following software pre-installed:

  • SQL Server 2017 or 2019 Database Engine with ML Services (In-Database)
  • AdventureWorksDW2017 demo database
  • Microsoft R Client
  • RStudio IDE
  • SQL Server Management Studio

Learning Outcomes

Attendees of this course learn to program with R from the scratch. Basic R code is introduced using the free R engine and RStudio IDE. A lifecycle of a data science project is explained in details. The attendees learn how to perform the data overview and do the most tedious task in a project, the data preparation task. After data overview and preparation, the analytical part begins with intermediate statistics in order to analyze associations between pairs of variables. Then the course introduces more advanced methods for researching linear dependencies.

Too many variables in a model can make its own problem. The course shows how to do feature selection, starting with the basics of matrix calculations. Then the course switches more advanced data mining and machine learning analyses, including supervised and unsupervised learning. The course also introduces the currently modern topics, including forecasting, text mining, and reinforcement learning.

Finally, the attendees also learn how to use the R code in SQL Server, Azure ML, and Power BI through labs, and how to use Python for inside all of the tools mentioned through demos.

Course Outline

Following an introduction the modules will be as follows:

  1. Introducing data science and R
    1. What are statistics, data mining, machine learning…
    2. Data science projects and their lifetime
    3. Introducing R
    4. R tools
    5. R data structures

Lab 1

  1. Introducing Python
    1. Basic syntax and objects
    2. Data manipulation with NumPy and Pandas
    3. Visualizations with matplotlib and seaborn libraries
    4. Data science with Scikit-Learn

Discussion: R vs Python

  1. Data overview
    1. Datasets, cases and variables
    2. Types of variables
    3. Introductory statistics for discrete variables
    4. Descriptive statistics for continuous variables
    5. Basic graphs
    6. Sampling, confidence level, confidence interval

Lab 2

  1. Data preparation
    1. Derived variables
    2. Missing values and outliers
    3. Smoothing and normalization
    4. Time series
    5. Training and test sets

Lab 3

  1. Associations between two variables and visualizations of associations
    1. Covariance and correlation
    2. Contingency tables and chi-squared test
    3. T-test and analysis of variance
    4. Bayesian inference
    5. Linear models

Lab 4

  1. Feature selection and matrix operations
    1. Feature selection in linear models
    2. Basic matrix algebra
    3. Principal component analysis
    4. Exploratory factor analysis

Lab 5

  1. Unsupervised learning
    1. Hierarchical clustering
    2. K-means clustering
    3. Association rules

Lab 6

  1. Supervised learning
    1. Neural Networks
    2. Logistic Regression
    3. Decision and regression trees
    4. Random forests
    5. Gradient boosting trees
    6. K-nearest neighbors

Lab 7

  1. Modern topics
    1. Support vector machines
    2. Time series
    3. Text mining
    4. Deep learning
    5. Reinforcement learning

Lab 8

  1. R in SQL Server and MS BI
    1. ML Services (In-Database) structure
    2. Executing external scripts in SQL Server
    3. Storing a model and performing native predictions
    4. R in Azure ML and Power BI

Lab 9



Contact us for more detail about our trainings and for all other enquiries!

Upcoming Trainings

Join our public courses in our Hong Kong facilities. Private class trainings will be organized at the location of your preference, according to your schedule.

Classroom / Virtual Classroom
04 July 2024
Hong Kong, Kowloon, Tsuen Wan
4 Days
Classroom / Virtual Classroom
15 July 2024
Hong Kong, Kowloon, Tsuen Wan
4 Days
Classroom / Virtual Classroom
08 August 2024
Hong Kong, Kowloon, Tsuen Wan
4 Days
Classroom / Virtual Classroom
19 August 2024
Hong Kong, Kowloon, Tsuen Wan
4 Days
Classroom / Virtual Classroom
19 August 2024
Hong Kong, Kowloon, Tsuen Wan
4 Days
Classroom / Virtual Classroom
23 August 2024
Hong Kong, Kowloon, Tsuen Wan
4 Days
Classroom / Virtual Classroom
24 August 2024
Hong Kong, Kowloon, Tsuen Wan
4 Days
Classroom / Virtual Classroom
11 September 2024
Hong Kong, Kowloon, Tsuen Wan
4 Days
Data Science with R and SQL Server Training Course in Hong Kong

Hong Kong is officially known as the Hong Kong Special Administrative Region of the People's Republic of China (HKSAR) and is a city and special administrative region of China on the eastern Pearl River Delta in South China. Hong Kong is one of the most densely populated places in the world, with over 7.5 million population. The official languages of the HKSAR are Chinese and English. Hong Kong is a highly developed territory and ranks fourth on the United Nations Human Development Index and the residents of Hong Kong have the highest life expectancies in the world.

The best time to visit Hong Kong is from September to December, since the temperatures, averaging between 19 to 28 degree Celsius. During this outdoor activities-friendly travelling season, you can take a walk along Victoria Harbour, visit the islands of Lantau, Lamma and Cheung Chau and participate in the Mid-Autumn Festival. Top choices of the tourists to visit in Hong Kong are Big Buddha statue, Wong Tai Sin Temple, Repulse Bay and the Beaches and Hong Kong Disneyland.

Explore our diverse range of IT courses, encompassing programming, software development, cyber security, data science, business skills, and Agile/Scrum. Wherever you are in Hong Kong, our seasoned instructors will bring practical training and expert knowledge to your preferred training venue.
By using this website you agree to let us use cookies. For further information about our use of cookies, check out our Cookie Policy.