Scaling Machine Learning with Spark

Distributed ML with MLlib, TensorFlow, and PyTorch

Author: Adi Polak

Adi Polak (Author)
Visit Author Page
Books by him and info about author and more.

Are you a Author?
Learn more here

Write a review
Save 10%
Write a review
MRP: 1,50000
You Pay: 1,35000
You save: 150.00
Leadtime to ship in days (default): Usually ships in 2 days
In stock
Reward points: 14 points
Our advantages
  • — SMS notification
  • — Return and exchange
  • — Different payment methods
  • — Best price
  • — Personalised Service
AuthorAdi Polak Leadtime to ship in days (default)Usually ships in 2 days

Learn how to build end-to-end scalable machine learning solutions with Apache Spark. With this practical guide, author Adi Polak introduces data and ML practitioners to creative solutions that supersede today's traditional methods. You'll learn a more holistic approach that takes you beyond specific requirements and organizational goals--allowing data and ML practitioners to collaborate and understand each other better.

Scaling Machine Learning with Spark examines several technologies for building end-to-end distributed ML workflows based on the Apache Spark ecosystem with Spark MLlib, MLflow, TensorFlow, and PyTorch. If you're a data scientist who works with machine learning, this book shows you when and why to use each technology.

You will:

  • Explore machine learning, including distributed computing concepts and terminology
  • Manage the ML lifecycle with MLflow
  • Ingest data and perform basic preprocessing with Spark
  • Explore feature engineering, and use Spark to extract features
  • Train a model with MLlib and build a pipeline to reproduce it
  • Build a data system to combine the power of Spark with deep learning
  • Get a step-by-step example of working with distributed TensorFlow
  • Use PyTorch to scale machine learning and its internal architecture.

About the Author

Adi Polak is an open source technologist who believes in communities and education, and their ability to positively impact the world around us. She is passionate about building a better world through open collaboration and technological innovation. As a seasoned engineer and Vice President of Developer Experience at Treeverse, Adi shapes the future of data and ML technologies for hands-on builders. She serves on multiple program committees and acts as an advisor for conferences like Data & AI Summit by Databricks, Current by Confluent, and Scale by the Bay, among others. Adi previously served as a senior manager for Azure at Microsoft, where she helped build advanced analytics systems and modern data architectures. Adi gained experience in machine learning by conducting research for IBM, Deutsche Telekom, and other Fortune 500 companies.

Adi Polak
Condition Type
Country Origin
Gift Wrap
Leadtime to ship in days (default)
Usually ships in 2 days
Find similar

TOC (9789355429766_toc.pdf, 70 Kb) [Download]

No reviews found

Possibly you may be interested
  • Forthcoming/Pre-Order
  • Bestsellers
  • Recently Viewed
Fast and high quality delivery

Our company makes delivery all over the country

Quality assurance and service

We offer only those goods, in which quality we are sure

Returns within 30 days

You have 30 days to test your purchase