Master Dataproc for big data analysis on the Google Cloud Platform. Learn Hadoop, Spark & ML integration.
Master Dataproc for big data analysis on the Google Cloud Platform. Learn Hadoop, Spark & ML integration.
This intensive course builds on the Data Engineering on Google Cloud Platform specialization, teaching you to create and manage compute clusters for Hadoop, Spark, Pig, and Hive jobs on Google Cloud. Through video lectures, demos, and hands-on labs, you'll learn to access cloud storage options from compute clusters and integrate Google's machine learning capabilities into your analysis programs. Practical labs cover creating and managing Dataproc clusters, running Spark and Pig jobs, and using iPython notebooks with BigQuery and storage integration.
4.6
(13 ratings)
4 already enrolled
Instructors:
English
Deutsch, Português (Brasil), 日本語, 2 more
What you'll learn
Criar e gerenciar clusters do Dataproc usando o console da Web e a CLI
Executar jobs do Hadoop, Spark, Pig e Hive no Google Cloud Platform
Acessar opções de armazenamento em nuvem a partir de clusters de computação
Integrar recursos de machine learning do Google aos programas de análise
Utilizar notebooks iPython integrados ao BigQuery e ao armazenamento em nuvem
Otimizar clusters usando tipos de máquina personalizados e VMs preemptivas
Skills you'll gain
This course includes:
162 Minutes PreRecorded video
4 quizzes
Access on Mobile, Tablet, Desktop
FullTime access
Shareable certificate
Closed caption
Get a Completion Certificate
Share your certificate with prospective employers and your professional network on LinkedIn.
Created by
Provided by

Top companies offer this course to their employees
Top companies provide this course to enhance their employees' skills, ensuring they excel in handling complex projects and drive organizational success.





There are 4 modules in this course
This course focuses on leveraging unstructured data with Cloud Dataproc on Google Cloud Platform. Students will learn to create and manage compute clusters for running Hadoop, Spark, Pig, and Hive jobs in the cloud. The curriculum covers accessing various cloud storage options from compute clusters and integrating Google's machine learning capabilities into analysis programs. Through hands-on labs, participants will gain practical experience in creating and managing Dataproc clusters, running Spark and Pig jobs, and using iPython notebooks integrated with BigQuery and cloud storage. The course emphasizes the separation of storage and computation in cloud-based big data processing and introduces concepts like preemptive VMs and custom machine types for optimizing cluster performance.
Módulo 1: introdução ao Cloud Dataproc
Module 1 · 2 Hours to complete
Módulo 2: como executar jobs do Dataproc
Module 2 · 3 Hours to complete
Módulo 3: como usar o GCP
Module 3 · 3 Hours to complete
Módulo 4: como analisar dados não estruturados
Module 4 · 1 Hours to complete
Fee Structure
Instructor
Empowering Businesses with Expert Training from Google Cloud
The Google Cloud Training team is tasked with developing, delivering, and evaluating training programs that enable our enterprise customers and partners to effectively utilize our products and solutions. Google Cloud empowers millions of organizations to enhance employee capabilities, improve customer service, and innovate for the future using cutting-edge technology built specifically for the cloud. Our products are designed with a focus on security, reliability, and scalability, covering everything from infrastructure to applications, devices, and hardware. Our dedicated teams are committed to helping customers successfully leverage our technologies to drive their success.
Testimonials
Testimonials and success stories are a testament to the quality of this program and its impact on your career and learning journey. Be the first to help others make an informed decision by sharing your review of the course.
4.6 course rating
13 ratings
Frequently asked questions
Below are some of the most commonly asked questions about this course. We aim to provide clear and concise answers to help you better understand the course content, structure, and any other relevant information. If you have any additional questions or if your question is not listed here, please don't hesitate to reach out to our support team for further assistance.