Data Pipelines: Workflow and Dataflow for Todays Data Architectures Training in South Africa

  • Learn via: Classroom / Virtual Classroom / Online
  • Duration: 1 Day
  • Price: Please contact for booking options
We can host this training at your preferred location. Contact us!

Data-driven is the modern mantra of business management, but enabling a data-driven organization is complex and challenging. Abundant data sources and multiple use cases result in many data pipelines—maybe as many as one for each use case. Capabilities to find the right data, manage data flow and workflow, and deliver the right data in the right forms for analysis are essential for all organizations that seek to become data-driven.

Multiple and complex data pipelines can quickly become chaotic under pressure from agile development, democratization, self-service, and organizational pockets of analytics. The resulting difficulty in governance and uncertainty of data usage are only the beginning of the troubles. Therefore, data pipeline management must ensure that data analysis results are traceable, reproducible, and of production strength, whether enterprise-level or self-service. Robust pipeline management works across a variety of platforms from relational to Hadoop, and recognizes today’s bidirectional data flows where any data store may function in both source and target roles.

Analytics architects, BI architects, data warehouse architects, data architects, and anyone in an architect role that intersects with data; data engineers who define, design, and develop data warehouses, data lakes, operational data stores, data sandboxes, master data hubs, or other enterprise data stores; data integration and preparation professionals who define, design, and develop the processes that move data through pathways from sources to consumers.

  • The challenges and complexities of modern data pipelines
  • Why data flow and workflow are critical parts of—and how they fit into—your analytics architecture
  • How to define and design data pipelines
  • The roles and functions of metadata in pipeline management
  • The important relationships between pipeline management and data governance
  • The state of tools and technologies to support pipeline management

Part One: Today’s Data Challenges

  • Variety and Complexity
    • Sources
    • Ingestion
    • Persistence
    • Management Topology
    • Utility
    • Use Cases
  • Time to Value
    • Storing Data
    • From Origin to Destination
    • Finding Data
    • Learning about Data
    • Data Cataloging
    • Data Preparation
    • Analysis and Communication

Part Two: Modern Data Solutions

  • Growth and Scalability
    • Data Scalability
    • Process Scalability
    • People Scalability
    • Analytic Scalability
  • Rethinking Data Architecture
    • Persistence and Topology
    • Data Flow
    • Services
    • Governance
    • What does this mean for your architecture?
  • Building for the Future
    • Future of Databases
    • Future of Data
    • Future of Analytics

Part Three: Data Pipeline Design

  • The Big Picture
    • Pipeline Components
  • Destination
    • Purpose and End Point
    • Timeliness
  • Origin
    • Data Supply and Begin Point
    • Data Type and Velocity
  • Data Flow
    • Data in Motion
    • Pipeline Boundaries
    • Blending Batch and Real Time
  • Data Storage
    • Data at Rest
    • Choosing Data Storage
  • Processing
    • Data Products and Data Value
    • Ingestion
    • Persistence
    • Transformation
    • Delivery
  • Workflow
    • Sequence of Activities
  • Monitoring
    • Pipeline Health
  • Technology
    • Pipeline Tools
    • Abundance of Tools
  • Design Summary
    • 7 Steps


Contact us for more detail about our trainings and for all other enquiries!

Upcoming Trainings

Join our public courses in our South Africa facilities. Private class trainings will be organized at the location of your preference, according to your schedule.

Classroom / Virtual Classroom
10 July 2024
Cape Town, Durban, Johannesburg
1 Day
Classroom / Virtual Classroom
16 July 2024
Cape Town, Durban, Johannesburg
1 Day
Classroom / Virtual Classroom
19 July 2024
Cape Town, Durban, Johannesburg
1 Day
Classroom / Virtual Classroom
04 August 2024
Cape Town, Durban, Johannesburg
1 Day
Classroom / Virtual Classroom
18 August 2024
Cape Town, Durban, Johannesburg
1 Day
Classroom / Virtual Classroom
24 August 2024
Cape Town, Durban, Johannesburg
1 Day
Classroom / Virtual Classroom
22 September 2024
Cape Town, Durban, Johannesburg
1 Day
Classroom / Virtual Classroom
04 October 2024
Cape Town, Durban, Johannesburg
1 Day
Data Pipelines: Workflow and Dataflow for Todays Data Architectures Training Course in South Africa

Formerly known as Union of South Africa, now officially known as Republic of South Africa is the Southernmost country in Africa. South Africa's population is over 60 million people, which makes the country the world's 23rd-most populous nation. South Africa has three capital cities: executive Pretoria, judicial Bloemfontein and legislative Cape Town, while the largest city is Johannesburg. The official languages of South Africa are Afrikaans, English, Ndebele, Pedi, Sotho, Swati, Tsonga, Tswana, Venda, Xhosa and Zulu.

South Africa can be rainy from November to February, so the best time to visit South Africa is from May to September. Despite the rainy season South Africa is a year-round destination, with varying regional climates. Blyde River Canyon, Durban, Drakensberg, Kruger National Park and of course, Cape Town are the tourist attractions of the country.

Expand your IT knowledge with our comprehensive range of courses, including programming, software development, business skills, data science, cybersecurity, cloud computing and virtualization. Our skilled instructors will facilitate hands-on training and share practical insights, all conveniently conducted at your preferred location within South Africa.
By using this website you agree to let us use cookies. For further information about our use of cookies, check out our Cookie Policy.