5910 Breckenridge Pkwy Suite B, Tampa, FL. 33610
(800) 272-0707

SkillSoft Explore Course

IT Professional Certifications     Google     Cloud Certified Professional     Data Engineer
Dataproc can be used to perform several operations when integrating platforms, including Pig and Hive. This course will dig further into Dataproc architecture while introducing the use of Pig and Hive.

Objectives

Continued Study of Cluster Management

  • start the course
  • describe how to create a cluster with the Dataproc CLI
  • recognize implementations using the Dataproc REST API

Architecture and Machine Types

  • describe the various Dataproc architecture types in GCP and common use cases
  • define Dataproc machine types and their uses
  • configure a custom machine type

Dataproc Jobs

  • describe how and when to execute Dataproc jobs
  • recognize connections between Apache Hadoop HDFS and Cloud Storage

Pig and Hive

  • describe the use of Pig and Hive
  • configure and execute a job using Pig and Hive with Dataproc

Practice: Dataproc Implementations

  • recall concepts of Dataproc jobs, including implementation of Pig and Hive