#64 Apache Spark (part 1): Fundamentals of Spark Architecture & Spark Core (RDD)

Please refer to series Get started with Spark (part 1–2–3) to get to understand briefly Spark.

Apache Spark architecture

Spark is a Hadoop enhancement to MapReduce. The primary difference between Spark and MapReduce is that Spark processes and retains data in memory for subsequent steps, whereas MapReduce processes data on disk. As a result, for smaller workloads, Spark’s data processing speeds are up to 100x faster than MapReduce.

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store