Spark stores data in memory, thus running MapReduce
operations much faster than Hadoop, which stores that on disk. Also it has
command line interfaces in Scala, Python, and R. And it includes a machine
learning library, Spark ML, that is developed by the Spark project and not
separately, like Mahout.
No comments:
Post a Comment