We can host this training at your preferred location. Contact us!
14 April 2021
This training course is designed for developers who need to create real-time applications to ingest and process streaming data sources using Hortonworks Data Platform (HDP) and Hortonworks Data Flow (HDF) environments. Specific technologies covered includes: Apache Hadoop, Apache Kafka, Apache Storm & Trident, Apache Spark and Apache HBase as well as Apache NiFi. The highlight of the course is the custom workshop-styled labs that will allow participants to build streaming applications with Storm and Spark Streaming.
Students should be familiar with programming principles and have experience in software development. Java programming experience is required. SQL and light scripting knowledge is also helpful. No prior Hadoop knowledge is required.
Developers and data engineers who need to understand and develop real-time / streaming applications on HDP and HDF.
Real-time architecture & overview of the class
Identify the relevant HDP/HDF components
Spark ecosystem overview
Pair RDD Programming
Building Storm Topologies
Advanced Storm Features
Introduction to HDF/NiFi
Using HDFS Commands
Introduction to SPARK REPLs and Zeppelin
Create and Manipulate RDDs
Create and Manipulate Pair RDDs
Spark Streaming Using HDFS Directories and TCP Sockets
Spark Streaming Transformations
Spark Streaming Window Transformations
Integrating Storm with Kafka
Storm Workshop with Kafka and HBase
NiFi User Interface
Building a NiFi Data Flow
Remote Processor Group
Join our public courses in our Istanbul, London and Ankara facilities. Private class trainings will be organized at the location of your preference, according to your schedule.