5910 Breckenridge Pkwy Suite B, Tampa, FL. 33610
(800) 272-0707

SkillSoft Explore Course

IT Professional Curricula     Enterprise Database Systems Solution Area     Big Data     Apache Kafka
Apache Kafka can easily integrate with Apache Spark to allow processing of the data entered into Kafka. In this course, you will discover how to integrate Kafka with Spark.

Objectives

Spark Integration

  • start the course
  • install and configure the Spark Streaming package for Kafka
  • read data into Spark from Kafka
  • read data in parallel into Spark from Kafka
  • write data back to Kafka from Spark
  • write data back to Kafka from Spark in parallel
  • create a direct stream to access Kafka data from Spark
  • use LocationStrategies and ConsumerStrategies to improve performance
  • use an RDD in cases where batch processing would be a better solution
  • use offsets to handle exactly-once semantics
  • use Kafka and Spark to split words from sentences