Online Library TheLib.net » Data algorithms recipes for scaling up with Hadoop and Spark
cover of the book Data algorithms recipes for scaling up with Hadoop and Spark

Ebook: Data algorithms recipes for scaling up with Hadoop and Spark

Author: Parsian Mahmoud

00
07.02.2024
0
0

If you are ready to dive into the MapReduce framework for processing large datasets, this practical book takes you step by step through the algorithms and tools you need to build distributed MapReduce applications with Apache Hadoop or Apache Spark. Each chapter provides a recipe for solving a massive computational problem, such as building a recommendation system. You'll learn how to implement the appropriate MapReduce solution with code that you can use in your projects.

Dr. Mahmoud Parsian covers basic design patterns, optimization techniques, and data mining and machine learning solutions for problems in bioinformatics, genomics, statistics, and social network analysis. This book also includes an overview of MapReduce, Hadoop, and Spark.

Topics include:

  • Market basket analysis for a large set of transactions
  • Data mining algorithms (K-means, KNN, and Naive Bayes)
  • Using huge genomic data to sequence DNA and RNA
  • Naive Bayes theorem and...
  • Download the book Data algorithms recipes for scaling up with Hadoop and Spark for free or read online
    Read Download

    Continue reading on any device:
    QR code
    Last viewed books
    Related books
    Comments (0)
    reload, if the code cannot be seen