How is hadoop different from spark?
Answer / Kamlesh Bhumarkar
Hadoop is a distributed processing framework that uses MapReduce for batch processing, whereas Spark is a general-purpose cluster computing system providing high-level APIs for DataStreaming and batch processing as well as machine learning and graph processing. Spark runs faster than Hadoop due to its in-memory data processing.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is Immutable?
What do you understand by receivers in Spark Streaming ?
What is the difference between spark and apache spark?
On which all platform can Apache Spark run?
What are the major features/characteristics of rdd (resilient distributed datasets)?
What is RDD?
Who uses apache spark?
How can we create RDD in Apache Spark?
How does executor work in spark?
Explain Spark countByKey() operation?
Define parquet file format? How to convert data to parquet format?
Why is apache spark so fast?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)