HDP Developer: Enterprise Apache Spark I Training in United States of America

  • Learn via: Classroom / Virtual Classroom / Online
  • Duration: 4 Days
  • Price: Please contact for booking options
We can host this training at your preferred location. Contact us!

This course is designed as an entry point for developers who need to create applications to analyze Big Data stored in Apache Hadoop using Spark. Topics include: An overview of the Hortonworks Data Platform (HDP), including HDFS and YARN; using Spark Core APIs for interactive data exploration; Spark SQL and DataFrame operations; Spark Streaming and DStream operations; data visualization, reporting, and collaboration; performance monitoring and tuning; building and deploying Spark applications; and an introduction to the Spark Machine Learning Library.

Students should be familiar with programming principles and have previous experience in software development using either Python or Scala. Previous experience with data streaming, SQL, and HDP is also helpful, but not required.

Software engineers that are looking to develop in-memory applications for time sensitive and highly iterative applications in an Enterprise HDP environment.

At the completion of the course students will be able to:

  • Describe Hadoop, HDFS, YARN, and the HDP ecosystem
  • Describe Spark use cases
  • Explore and manipulate data using Zeppelin
  • Explore and manipulate data using a Spark REPL
  • Explain the purpose and function of RDDs
  • Employ functional programming practices
  • Perform Spark transformations and actions
  • Work with Pair RDDs
  • Perform Spark queries using Spark SQL and DataFrames
  • Use Spark Streaming stateless and window transformation
  • Visualize data, generate reports, and collaborate using Zeppelin
  • Monitor Spark applications using Spark History Server
  • Learn general application optimization guidelines/tips
  • Use data caching to increase performance of applications
  • Build and package Spark applications
  • Deploy applications to the cluster using YARN
  • Understand the purpose of Spark MLlib

  • Use common HDFS commands
  • Use a REPL to program in Spark
  • Use Zeppelin to program in Spark
  • Perform RDD transformations and actions
  • Perform Pair RDD transformations and actions
  • Utilize Spark SQL* Perform stateless transformations using Spark Streaming
  • Perform window-based transformations
  • Use Zeppelin for data visualization and reporting
  • Monitor applications using Spark History Server
  • Cache and persist data
  • Configure checkpointing, broadcast variables, and executors
  • Build and submit a Spark application to YARN
  • Run Spark MLlib applications 


Contact us for more detail about our trainings and for all other enquiries!

Upcoming Trainings

Join our public courses in our United States of America facilities. Private class trainings will be organized at the location of your preference, according to your schedule.

Classroom / Virtual Classroom
26 November 2024
United States of America
4 Days
Classroom / Virtual Classroom
27 November 2024
United States of America
4 Days
Classroom / Virtual Classroom
26 November 2024
United States of America
4 Days
Classroom / Virtual Classroom
27 November 2024
United States of America
4 Days
Classroom / Virtual Classroom
11 March 2025
United States of America
4 Days
Classroom / Virtual Classroom
20 March 2025
United States of America
4 Days
Classroom / Virtual Classroom
24 March 2025
United States of America
4 Days
Classroom / Virtual Classroom
24 March 2025
United States of America
4 Days
HDP Developer: Enterprise Apache Spark I Training Course in the United States

The United States of America (USA) is a country in North America and a federal republic of 50 states. At almost 9.8 million square kilometers, the United States is one of the world’s biggest and most populous countries. While America’s capital city is Washington, D.C., some of its well known cities are New York, Los Angeles, Miami, Chicago, Orlando, Las Vegas, Dallas, San Francisco and Kansas City.

The most iconic symbol of the country is probably the Statue of Liberty in New York and it was gifted by France. Despite the fact that English is the most widely used language in the United States, there is no official language. Independent since July 4, 1776, USA’s motto is “In God We Trust” and their current president is Joe Biden. Some of the best places to visit in the United States are Grand Canyon, Yosemite, Maui, New Orleans, Honolulu, Zion National Park, Kauai, Lake Tahoe, Aspen, Big Sur and Santa Fe.

Achieve your IT goals through our versatile courses, spanning programming, data analytics, software development, business skills, cloud computing, cybersecurity, project management. Benefit from the flexibility of hosting training at your preferred location within United States, where our experienced instructors will provide hands-on learning and practical expertise.
By using this website you agree to let us use cookies. For further information about our use of cookies, check out our Cookie Policy.