Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is shuffling in mapreduce?
How to start a kafka server?
What combiners are and when you should utilize a combiner in a map reduce job?
Is it possible to leverage real time analysis on the big data collected by flume directly? If yes, then explain how?
What is zookeeper in hadoop?
What are the differences between relational databases and impala?
What are the two main parts of the hadoop framework?
Explain the process that overwrites the replication factors in HDFS?
Define catalog tables in HBase?
Is Mapreduce Required For Impala? Will Impala Continue To Work As Expected If Mapreduce Is Stopped?
Explain how can you minimize data transfers when working with spark?
What is winutils hadoop?
What are the areas where ambari helps the system administrators to do?
Mention the difference between hbase and relational database?
What is decorating filters?