Data Handling in Python Training in South Africa

  • Learn via: Classroom
  • Duration: 3 Days
  • Level: Intermediate
  • Price: From €2,580+VAT
We can host this training at your preferred location. Contact us!

This three day course is aimed at those wishing to learn how to use Python to work with and handle Data. When combined with our Introduction to Data Science course you would be set up well to follow a Python learning journey into Data Engineering, Advanced Data Analytics, Data Science, Machine Learning, and Artificial Intelligence.

During the programme you will be introduced to Python and specific development environments and packages for working with Data, with a focus on NumPy, Pandas, Matplotlib, and Seaborn.

Along the way you will see how to clean and manipulate tabular data, apply simple statistical techniques and data visualisations, and learn about how to control the flow of your program in order to automate processes.

Throughout the course you will engage with activities and discussions with one of our Data Science technical specialists and complete technical lab activities to practice the techniques you have learnt and develop ideas for further practice.

No prior experience with Python is necessary, though it is assumed that you will be familiar with core data concepts such as simple table structures and data types – all the pre-requisites you need are covered by our Data Essentials course.

Target Audience

This course is intended for Data Analysts, Data Engineers, Data Ops roles, and those training to consume AI Services or become Data Scientists and tune and develop Machine Learning and AI models on our subsequent Data Science learning pathway.

This course covers the key pre-requisites for a large range of further learning opportunities involving Python, Data, and AI.

  • Benefit from the speed and functionality of the NumPy and Pandas python packages
  • Create and control Data Visualisations using Matplotlib and Seaborn
  • Use Python with the Jupyter development environment
  • Retrieve, clean, and prepare data from multiple types of sources
  • Gain a firm grounding in Python with Data in order to progress to further study to connect to AI models, Engineer data pipelines, and develop Data Science solutions

1. Introduction to Programming for Data Handling

  • Describe the pros and cons of using programming languages to work with data
  • Identify the languages most suitable for data handling
  • Explain the challenges of using programming languages versus data analysis tools

2. Introduction to Python and IDEs

  • Describe the key attributes of the Python programming language.
  • Explain the role of the Jupyter IDE for Python programming.
  • Use the Jupyter IDE to write a basic Python program.
  • Write a program which uses string, integer, float and boolean data types.

3. Data Structures, Flow Control, Functions, and Basic Types

  • Construct collections to solve data problems.
  • Utilise selection and iteration syntax to control the flow of a Python program.
  • Write reusable functions which can be used to alter data & automate repetitive tasks.
  • Use Python's built-in open function to create, read, and edit files.

4. Mathematical and Statistical Programming with NumPy

  • Describe the core features of NumPy arrays.
  • Create, index, and manipulate NumPy arrays to solve data problems.
  • Use masking and querying syntax to retrieve desired values.
  • Use vectorised ufuncs.

5. Introduction to Pandas

  • Create, manipulate, and alter Series and DataFrames with Pandas.
  • Define and change the indices of Series & Dataframes.
  • Use Pandas' functions and methods to change column types, compute summary statistics and aggregate data.
  • Read, manipulate, and write data from csv, xlsx, json and other structured file formats.

6. Data Cleaning with Pandas

  • Identify missing data and apply techniques to deal with it.
  • Deduplicate, transform and replace values.
  • Use DataFrame string methods to manipulate text data.
  • Write regular expressions which munge text data.

7. Data Manipulation with Pandas

  • Construct Pivot tables in Pandas.
  • Time series manipulation.
  • Stream data into Pandas to handle data size problems.

8. Methods for Visualising Data

  • Construct and tailor basic data visualisations using Matplotlib & Seaborn for both numeric & non-numeric data.
  • Meaningfully visualise aggregate data using Matplotlib and Seaborn.

Related learning

Data Science Learning Pathways can be selected by choosing either Python or R and a Cloud Platform certification:

  • QAIDSDP Introduction to Data Science for Data Professionals
  • Sourcing and handling data:
    • QADHPYTHON Data Handling with Python
    • QADHR Data Handling with R
    • QAPDHAI Python Data Handling with AI APIs
  • Statistics for Data Analysis:
    • QASDAPY Statistics for Data Analysis with Python
    • QASDAR Statistics for Data Analysis with R
  • Programming and Software Development skills:
    • QAPYTH3 Python Programming
    • QARPROG R Programming
  • Machine Learning Development:
    • QADSMLP Data Science and Machine Learning with Python
    • QADSMLR Data Science and Machine Learning with R
  • Mathematics for Developing Algorithms for AI models, Big Data Mining, and working with Neural Networks:
    • QAMFDS Mathematics for Data Science
  • Forecasting:
    • QATSFP Time Series and Forecasting with Python
    • QATSFR Time Series and Forecasting with R

Suggested courses leading to Certification:

  • MDP100 Designing and Implementing a Data Science Solution on Azure (DP-100)
  • AMWSMLP Machine Learning Pipelines on AWS
  • GCPMLGC Machine Learning on Google Cloud


Contact us for more detail about our trainings and for all other enquiries!

Upcoming Trainings

Join our public courses in our South Africa facilities. Private class trainings will be organized at the location of your preference, according to your schedule.

13 January 2025 (3 Days)
Cape Town, Durban, Johannesburg
Classroom / Virtual Classroom
€2,580 +VAT
Book Now
03 February 2025 (3 Days)
Cape Town, Durban, Johannesburg
Classroom / Virtual Classroom
€2,580 +VAT
Book Now
09 February 2025 (3 Days)
Cape Town, Durban, Johannesburg
Classroom / Virtual Classroom
13 January 2025 (3 Days)
Cape Town, Durban, Johannesburg
Classroom / Virtual Classroom
€2,580 +VAT
Book Now
03 February 2025 (3 Days)
Cape Town, Durban, Johannesburg
Classroom / Virtual Classroom
€2,580 +VAT
Book Now
09 February 2025 (3 Days)
Cape Town, Durban, Johannesburg
Classroom / Virtual Classroom
07 March 2025 (3 Days)
Cape Town, Durban, Johannesburg
Classroom / Virtual Classroom
11 March 2025 (3 Days)
Cape Town, Durban, Johannesburg
Classroom / Virtual Classroom
Data Handling in Python Training Course in South Africa

Formerly known as Union of South Africa, now officially known as Republic of South Africa is the Southernmost country in Africa. South Africa's population is over 60 million people, which makes the country the world's 23rd-most populous nation. South Africa has three capital cities: executive Pretoria, judicial Bloemfontein and legislative Cape Town, while the largest city is Johannesburg. The official languages of South Africa are Afrikaans, English, Ndebele, Pedi, Sotho, Swati, Tsonga, Tswana, Venda, Xhosa and Zulu.

South Africa can be rainy from November to February, so the best time to visit South Africa is from May to September. Despite the rainy season South Africa is a year-round destination, with varying regional climates. Blyde River Canyon, Durban, Drakensberg, Kruger National Park and of course, Cape Town are the tourist attractions of the country.

Expand your IT knowledge with our comprehensive range of courses, including programming, software development, business skills, data science, cybersecurity, cloud computing and virtualization. Our skilled instructors will facilitate hands-on training and share practical insights, all conveniently conducted at your preferred location within South Africa.
By using this website you agree to let us use cookies. For further information about our use of cookies, check out our Cookie Policy.