Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is a rack awareness algorithm and why is it used in hadoop?
Does Hadoop requires RAID?
Write a short note on the disadvantages of mapreduce
Is spark part of hadoop ecosystem?
How to exit the vi editor?
Define "PageRank".
Explain the concept of resilient distributed dataset (rdd).
how indexing in HDFS is done?
What is rdd partition?
How can Spark be connected to Apache Mesos?
How much is flume worth?
List of some best tools that can be useful for data-analysis?
What are the various configuration parameters required to run a mapreduce job?
What do spark executors manage?
Why do we need a password-less ssh in fully distributed environment?