Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is the best way to copy files between HDFS clusters?
What is anti-entropy and how is it associated with merkel tree?
How can you control the mapping between SQL data types and Java types?
What is difference between Column and Super Column?
What is Sqoop?
Clarify what is shuffling in map reduce?
What is pig latin statements?
What are some of the apache pig use cases you can think of?
What is a dstream in apache spark?
what are the steps involved in decommissioning removing
Elaborate kafka architecture?
Explain about trformations and actions in the context of rdds?
What is apache flume used for?
When should you use sequencefileinputformat?
What is hive metastore?