Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Does spark run mapreduce?
What is RDD?
What is Apache Spark and what are the benefits of Spark over MapReduce?
How does gossip protocol work?
Is reduce-only job possible in Hadoop MapReduce?
How a task is scheduled by a jobtracker?
What are the major features/characteristics of rdd (resilient distributed datasets)?
What is SparkSession in Apache Spark?
What if rack 2 and datanode fails?
how can we change Replication Factor?
How to load data in pig?
What are the port numbers of job tracker?
What is bloom filter?
What is apache mahout?
Where is the Mapper Output stored?