DEV Community 👩‍💻👨‍💻

Shekhar Sahu
Shekhar Sahu

Posted on

Migrate From Hadoop To Apache Spark

Link: https://blog.joshsoftware.com/2020/04/29/migrate-from-hadoop-to-apache-spark/

A variety of data from different resources gets generated because we have a huge volume of data and this process remains in continuous flow which will create more data in future. This huge volume of data is called Big Data and storing this Big Data is a problem for us.

Hadoop became one of the most popular tool which uses a distributed system to store and process this data to solve the problem. But now we have a new tool Apache Spark which is a more efficient tool which is based on top of Hadoop distributed file system (HDFS).

Top comments (0)

DEV runs on 100% open source code known as Forem.

 
Contribute to the codebase or host your own.
 
Check these out! 👇