Dataproc Cookbook

Running Spark and Hadoop Workloads in Google Cloud

(Author) Narasimha Sadineni
Format: Paperback
£63.99 Price: £62.07 (3% off)
In Stock
Generally dispatched in 1 to 2 days

Get up to speed with Dataproc, the fully managed and highly scalable service for running open source big data tools and frameworks, including Hadoop, Spark, Flink, and Presto. This cookbook shows data engineers, data scientists, data analysts, and cloud architects how to use Dataproc, integrated with Google Cloud, for data lake modernization, ETL, and secure data science at a fraction of the cost. Narasimha Sadineni from Google and former Googler Anu Venkataraman show you how to set up and run Hadoop and Spark jobs on Dataproc. You'll learn how to create Dataproc clusters and run data engineering and data science workloads in long-running, ephemeral, and serverless ways. In the process, you'll gain an understanding of Dataproc, orchestration, logging and monitoring, Spark History Server, and migration patterns. This cookbook includes hands-on examples for configuring, logging, securing clusters, and migrating from on-prem to Dataproc. You'll learn how to: Create Dataproc clusters on Compute Engine and Kubernetes Engine Run data science workloads on Dataproc Execute Spark jobs on Dataproc Serverless Optimize Dataproc clusters to be cost effective and performant Monitor Spark jobs in various ways Orchestrate various workloads and activities Use different methods for migrating data and workloads from existing Hadoop clusters to Dataproc

Information
Publisher:
O'Reilly Media
Format:
Paperback
Number of pages:
None
Language:
en
ISBN:
9781098157708
Publish year:
2025
Publish date:
June 17, 2025

Narasimha Sadineni

Reviews

Leave a review

Please login to leave a review.

Be the first to review this product

Other related

The New Age of Sexism

The New Age of Sexism

How the AI Revolution is Reinventing Misogyny

Laura Bates
Paperback
Published: 2026
Where the Axe is Buried

Where the Axe is Buried

Ray Nayler
Paperback
Published: 2026
Love Machines

Love Machines

How Artificial Intelligence is Transforming Our Relationships

James Muldoon
Paperback
Published: 2026
Dark AI - Shadows of Tomorrow

Dark AI - Shadows of Tomorrow

Clara Rodriquez
Paperback
Published: 2026
The AI Paradox

The AI Paradox

How to Make Sense of a Complex Future

Virginia Dignum
Hardcover
Published: 2026
AI Ink.

AI Ink.

Writing, Publishing, and Misinformation at the Dawn of the AI Age

Jason Van Tatenhove
Hardcover
Published: 2026