Apache Spark Training Courses

Apache Spark Training

Apache Spark - an engine for big data processing training

Apache Spark Course Outlines

Code Name Duration Overview
68780 Apache Spark 14 hours Why Spark? Problems with Traditional Large-Scale Systems Introducing Spark Spark Basics What is Apache Spark? Using the Spark Shell Resilient Distributed Datasets (RDDs) Functional Programming with Spark Working with RDDs RDD Operations Key-Value Pair RDDs MapReduce and Pair RDD Operations The Hadoop Distributed File System Why HDFS? HDFS Architecture Using HDFS Running Spark on a Cluster Overview A Spark Standalone Cluster The Spark Standalone Web UI Parallel Programming with Spark RDD Partitions and HDFS Data Locality Working With Partitions Executing Parallel Operations Caching and Persistence RDD Lineage Caching Overview Distributed Persistence Writing Spark Applications Spark Applications vs. Spark Shell Creating the SparkContext Configuring Spark Properties Building and Running a Spark Application Logging Spark, Hadoop, and the Enterprise Data Center Overview Spark and the Hadoop Ecosystem Spark and MapReduce Spark Streaming Spark Streaming Overview Example: Streaming Word Count Other Streaming Operations Sliding Window Operations Developing Spark Streaming Applications Common Spark Algorithms Iterative Algorithms Graph Analysis Machine Learning Improving Spark Performance Shared Variables: Broadcast Variables Shared Variables: Accumulators Common Performance Issues
sparkdev Spark for Developers 21 hours OBJECTIVE: This course will introduce Apache Spark. The students will learn how  Spark fits  into the Big Data ecosystem, and how to use Spark for data analysis.  The course covers Spark shell for interactive data analysis, Spark internals, Spark APIs, Spark SQL, Spark streaming, and machine learning and graphX. AUDIENCE : Developers / Data Analysts Scala primer A quick introduction to Scala Labs : Getting know Scala Spark Basics Background and history Spark and Hadoop Spark concepts and architecture Spark eco system (core, spark sql, mlib, streaming) Labs : Installing and running Spark First Look at Spark Running Spark in local mode Spark web UI Spark shell Analyzing dataset – part 1 Inspecting RDDs Labs: Spark shell exploration RDDs RDDs concepts Partitions RDD Operations / transformations RDD types Key-Value pair RDDs MapReduce on RDD Caching and persistence Labs : creating & inspecting RDDs;   Caching RDDs Spark API programming Introduction to Spark API / RDD API Submitting the first program to Spark Debugging / logging Configuration properties Labs : Programming in Spark API, Submitting jobs Spark SQL SQL support in Spark Dataframes Defining tables and importing datasets Querying data frames using SQL Storage formats : JSON / Parquet Labs : Creating and querying data frames; evaluating data formats MLlib MLlib intro MLlib algorithms Labs : Writing MLib applications GraphX GraphX library overview GraphX APIs Labs : Processing graph data using Spark Spark Streaming Streaming overview Evaluating Streaming platforms Streaming operations Sliding window operations Labs : Writing spark streaming applications Spark and Hadoop Hadoop Intro (HDFS / YARN) Hadoop + Spark architecture Running Spark on Hadoop YARN Processing HDFS files using Spark Spark Performance and Tuning Broadcast variables Accumulators Memory management & caching Spark Operations Deploying Spark in production Sample deployment templates Configurations Monitoring Troubleshooting

Upcoming Courses

CourseCourse DateCourse Price [Remote / Classroom]
Apache Spark - DresdenTue, 2017-03-07 09:304530EUR / 5030EUR
Apache Spark - ErfurtTue, 2017-03-07 09:304530EUR / 4810EUR
Apache Spark - Berlin Thu, 2017-03-09 09:304530EUR / 5130EUR

Other regions

Weekend Apache Spark courses, Evening Apache Spark training, Apache Spark boot camp, Apache Spark instructor-led , Apache Spark on-site, Apache Spark trainer , Apache Spark training courses,Weekend Apache Spark training, Apache Spark coaching, Apache Spark classes, Apache Spark instructor, Apache Spark private courses, Evening Apache Spark courses

Course Discounts

Course Venue Course Date Course Price [Remote / Classroom]
Docker and Kubernetes Stuttgart Wed, 2017-02-22 09:30 3653EUR / 4303EUR
Git für Benutzer Köln Thu, 2017-03-02 09:30 891EUR / 1241EUR
Excel Data Analysis München Tue, 2017-03-21 09:30 1416EUR / 1916EUR
Python Programming Köln Tue, 2017-07-18 09:30 3285EUR / 4085EUR
Marketing Analytics using R Hannover Mon, 2017-07-31 09:30 2475EUR / 3125EUR
Forecasting with R Berlin Tue, 2017-08-08 09:30 1836EUR / 2436EUR

Course Discounts Newsletter

We respect the privacy of your email address. We will not pass on or sell your address to others.
You can always change your preferences or unsubscribe completely.

Some of our clients