1. Introduction to Mathematics for Data Science
- Discuss the source code in commonly used Data Science Python or R libraries
- Identify use cases for a range of Mathematical techniques used in Data Science, Machine Learning, Big Data, AI, and Quantum Computing such as Probability and Statistics, Linear Algebra (Vectors and Matrices), Calculus, Graphs and Networks, and Complex Numbers.
2. Matrices and Programming
- Describe the structure of a Matrix.
- Use Python NumPy Arrays or R Matrices to build Mathematically structured Matrices
- Explore NumPy or R functions to create various Matrices
3. Matrix Arithmetic
- Explain the conditions that matrices need to meet in order to apply mathematical operations.
- Apply Arithmetic with matrices using NumPy or R
- Apply Multiplication with scalars and matrices using NumPy or R
4. Inverses and Solving Simultaneous Equations
- Apply a Multiplicative Inverse for dividing matrices
- Calculate the determinant of a matrix
- Solve Simultaneous equations using matrices
- Interpret the meaning of a zero determinant in the context of solving simultaneous equations
5. Matrix Transformations
- Identify how matrices can be transformed
- Perform multiple vector transformations using NumPy or R
- Calculate and interpret eigenvalues and eigenvectors
6. Vectors and Dot Product
- Identify the purpose of vectors and dot products in Data Science
- Create and visualise vectors using NumPy or R
- Calculate dot products and use visualisations to explain how dot product can be used to indicate how similar two columns or rows of data are
7. Vector Spaces
- Describe what a vector basis is
- Identify possible vectors that can be used to form a basis for a particular space and choose the most efficient basis
- Use dot product to check the vectors in the basis are orthogonal
8. Differentiation and Gradients
- Identify notation used for differentiation
- Describe how gradients can be calculated numerically (using NumPy or R) or algebraically
- Discuss stationary points and the second differential
- Recognise the Chain rule and Partial Differentiation
9. Gradient Descent
- Identify types of stationary points
- Describe the concept behind a gradient descent algorithm
- Use differentiation and NumPy or R to locate a minimum point using a gradient descent algorithm
10. Linear Regression with Gradient Descent
- Calculate residuals and identify functions that could be used to aggregate these to select a loss function for performing linear regression
- Use gradient descent to calculate a regression line
11. Set Building and Mathematical Notation
- Identify a wide range of mathematical notation and its purpose
- Create sets using set builder notation
- Identify symbols used for common functions and processes
- Practice interpreting formulae or expressions in mathematical journal articles or books
12. Graphs, Networks, Deep Learning and Neural Network Calculations
- Investigate software packages for Deep Learning
- Identify parts of Graphs and Networks using Mathematical Terminology
- Investigate how Graphs and Networks are used to provide an abstraction to a problem in order to interpret algorithms including Big Data Analysis problems
- Examine how Matrices and Calculus are used to carry out Neural Network Calculations
13. Complex Numbers and Quantum Computing Development
- Store complex numbers in either Python or R
- Explain the meaning of i, complex numbers, complex conjugates, and perform arithmetic calculations with complex numbers by checking results using Python or R
- Match quantum computing terms to their meanings. Including qubit, state vector, Hilbert space, superposition, quantum gate, measurement, quantum circuit, unitary transformation
- Represent quantum gates as matrices and calculate the effect of transforming basis states.
Related learning
Data Science Learning Pathways can be selected by choosing either Python or R and a Cloud Platform certification:
- QAIDSDP Introduction to Data Science for Data Professionals
- Sourcing and handling data:
- QADHPYTHON Data Handling with Python
- QADHR Data Handling with R
- QAPDHAI Python Data Handling with AI APIs
- Statistics for Data Analysis:
- QASDAPY Statistics for Data Analysis with Python
- QASDAR Statistics for Data Analysis with R
- Programming and Software Development skills:
- QAPYTH3 Python Programming
- QARPROG R Programming
- Machine Learning Development:
- QADSMLP Data Science and Machine Learning with Python
- QADSMLR Data Science and Machine Learning with R
- Forecasting:
- QATSFP Time Series and Forecasting with Python
- QATSFR Time Series and Forecasting with R
Suggested courses leading to Certification:
- MDP100 Designing and Implementing a Data Science Solution on Azure (DP-100)
- AMWSMLP Machine Learning Pipelines on AWS
- GCPMLGC Machine Learning on Google Cloud