RiseUpp Logo
Educator Logo

Leveraging Unstructured Data with Dataproc on GCP

Master Dataproc for big data analysis on the Google Cloud Platform. Learn Hadoop, Spark & ML integration.

Master Dataproc for big data analysis on the Google Cloud Platform. Learn Hadoop, Spark & ML integration.

This intensive course builds on the Data Engineering on Google Cloud Platform specialization, teaching you to create and manage compute clusters for Hadoop, Spark, Pig, and Hive jobs on Google Cloud. Through video lectures, demos, and hands-on labs, you'll learn to access cloud storage options from compute clusters and integrate Google's machine learning capabilities into your analysis programs. Practical labs cover creating and managing Dataproc clusters, running Spark and Pig jobs, and using iPython notebooks with BigQuery and storage integration.

4.6

(13 ratings)

4 already enrolled

English

Deutsch, Português (Brasil), 日本語, 2 more

Powered by

Provider Logo
Leveraging Unstructured Data with Dataproc on GCP

This course includes

10 Hours

Of Self-paced video lessons

Intermediate Level

Completion Certificate

awarded on course completion

2,435

Audit For Free

What you'll learn

  • Criar e gerenciar clusters do Dataproc usando o console da Web e a CLI

  • Executar jobs do Hadoop, Spark, Pig e Hive no Google Cloud Platform

  • Acessar opções de armazenamento em nuvem a partir de clusters de computação

  • Integrar recursos de machine learning do Google aos programas de análise

  • Utilizar notebooks iPython integrados ao BigQuery e ao armazenamento em nuvem

  • Otimizar clusters usando tipos de máquina personalizados e VMs preemptivas

Skills you'll gain

dataproc
hadoop
spark
big data
machine learning
cloud computing
data analysis
python

This course includes:

162 Minutes PreRecorded video

4 quizzes

Access on Mobile, Tablet, Desktop

FullTime access

Shareable certificate

Closed caption

Get a Completion Certificate

Share your certificate with prospective employers and your professional network on LinkedIn.

Created by

Provided by

Certificate

Top companies offer this course to their employees

Top companies provide this course to enhance their employees' skills, ensuring they excel in handling complex projects and drive organizational success.

icon-0icon-1icon-2icon-3icon-4

There are 4 modules in this course

This course focuses on leveraging unstructured data with Cloud Dataproc on Google Cloud Platform. Students will learn to create and manage compute clusters for running Hadoop, Spark, Pig, and Hive jobs in the cloud. The curriculum covers accessing various cloud storage options from compute clusters and integrating Google's machine learning capabilities into analysis programs. Through hands-on labs, participants will gain practical experience in creating and managing Dataproc clusters, running Spark and Pig jobs, and using iPython notebooks integrated with BigQuery and cloud storage. The course emphasizes the separation of storage and computation in cloud-based big data processing and introduces concepts like preemptive VMs and custom machine types for optimizing cluster performance.

Módulo 1: introdução ao Cloud Dataproc

Module 1 · 2 Hours to complete

Módulo 2: como executar jobs do Dataproc

Module 2 · 3 Hours to complete

Módulo 3: como usar o GCP

Module 3 · 3 Hours to complete

Módulo 4: como analisar dados não estruturados

Module 4 · 1 Hours to complete

Fee Structure

Instructor

Google Cloud Training
Google Cloud Training

4.7 rating

86 Reviews

26,85,892 Students

1,729 Courses

Empowering Businesses with Expert Training from Google Cloud

The Google Cloud Training team is tasked with developing, delivering, and evaluating training programs that enable our enterprise customers and partners to effectively utilize our products and solutions. Google Cloud empowers millions of organizations to enhance employee capabilities, improve customer service, and innovate for the future using cutting-edge technology built specifically for the cloud. Our products are designed with a focus on security, reliability, and scalability, covering everything from infrastructure to applications, devices, and hardware. Our dedicated teams are committed to helping customers successfully leverage our technologies to drive their success.

Leveraging Unstructured Data with Dataproc on GCP

This course includes

10 Hours

Of Self-paced video lessons

Intermediate Level

Completion Certificate

awarded on course completion

2,435

Audit For Free

Testimonials

Testimonials and success stories are a testament to the quality of this program and its impact on your career and learning journey. Be the first to help others make an informed decision by sharing your review of the course.

4.6 course rating

13 ratings

Frequently asked questions

Below are some of the most commonly asked questions about this course. We aim to provide clear and concise answers to help you better understand the course content, structure, and any other relevant information. If you have any additional questions or if your question is not listed here, please don't hesitate to reach out to our support team for further assistance.