Academia.eduAcademia.edu

Outline

Scalable Machine Learning and Related Technologies

2016, Machine Learning Using R

https://doi.org/10.1007/978-1-4842-2334-5_9

Abstract
sparkles

AI

Scalable machine learning is increasingly relevant due to advancements in infrastructure, data accessibility, and software development. This chapter discusses the importance of distributed computing in handling large datasets, emphasizes the transition from traditional algorithms to scalable solutions, and introduces key big data technologies like Apache Hadoop and Spark. It highlights how these technologies facilitate efficient data processing and computation, enabling organizations to harness the potential of big data in real-world applications.

References (2)

  1. Download pre-built for Hadoop 2.7 and later Spark release from http://spark.apache.org/downloads.html.
  2. Extract the files into the C:-2.0.0-bin-hadoop2.7 folder (you can choose your own location).