Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Why HDFS performs replication, although it results in data redundancy?
State some disadvantages of impala?
Explain the shuffle?
Tell any two features of flume?
Define fold() operation in Apache Spark?
Explain the rudimentary difference between Cassandra and HBase?
What are the Basics of Hadoop?
Clarify what is sqoop in hadoop?
Explain the master class and the output class do?
Is hadoop mandatory for spark?
What must we know to work on Zookeeper well?
What is spark tool?
Can we run spark on windows?
What do we mean by Partitions or slices?
What is shuffling in mapreduce?