Data Algorithms with Spark
Data Algorithms with Spark Apache Spark's speed, ease of use, sophisticated analytics, and multilanguage support makes practical knowledge of this cluster-computing framework a required skill for…
Specifikacia Data Algorithms with Spark
Data Algorithms with Spark
Apache Spark's speed, ease of use, sophisticated analytics, and multilanguage support makes practical knowledge of this cluster-computing framework a required skill for data engineers and data scientists. You'll learn how to tackle problems involving ETL, design patterns, machine learning algorithms, data partitioning, and genomics analysis. With this hands-on guide, anyone looking for an introduction to Spark will learn practical algorithms and examples using PySpark.In each chapter, author Mahmoud Parsian shows you how to solve a data problem with a set of Spark transformations and algorithms.
Each detailed recipe includes PySpark algorithms using the PySpark driver and shell script.With this book, you will: - Learn how to select Spark transformations for optimized solutionsExplore powerful transformations and reductions including reduceByKey(),