Compare Apache Hadoop and Apache Spark?



Compare Apache Hadoop and Apache Spark?..

Answer / Sumit Kumar Sharma

Apache Hadoop and Apache Spark are both open-source big data processing frameworks, but they have some key differences. Hadoop is a MapReduce-based framework used primarily for batch processing, while Spark offers faster in-memory data processing capabilities for both batch and real-time data processing tasks. Spark also has built-in support for stream processing (Spark Streaming) and machine learning (MLlib), whereas Hadoop requires additional libraries to perform these functions.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Hadoop General Interview Questions

What are configuration files in Hadoop?

1 Answers  


Can you define a checkpoint?

1 Answers  


What mode(s) can hadoop code be run in?

1 Answers  


Can we deploye job tracker other than name node?

1 Answers  


Explain the common input formats in hadoop?

1 Answers  


What do you know about yarn?

1 Answers  


Is Namenode machine same as DataNode machine as in terms of hardware in Hadoop?

1 Answers  


Why do the nodes are removed and added frequently in a hadoop cluster?

1 Answers  


How analysis of Big Data is useful for organizations?

1 Answers  


How can one check whether NameNode is working or not?

1 Answers  


Whats the default port that jobtrackers listens ?

1 Answers  


What is the command to change the replication factor ?

1 Answers  


Categories