Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How Hadoop is cost-effective?
what is a datanode?
what is NameNode in Hadoop?
Why we need compression and what are the different compression format supported?
Please explain the sparse vector in Spark.
Differentiate Reducer and Combiner in Hadoop MapReduce?
What is a bookie in bookkeeper?
What happens if there is an error in impala?
Can you explain how do ‘map’ and ‘reduce’ work?
How do you parse data in xml? Which kind of class do you use with java to pass data?
What is the concept of SuperColumn in Cassandra?
What is difference between dataset and dataframe?
What is Partition table in Hive?
What is the difference between rdd and dataframe?
What are some typical functions of Job Tracker?