Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is the Physical plan in pig architecture?
What will you do when NameNode is down?
What do you understand about yarn?
What is the difference between python and spark?
What are the different tasks we can perform managing host using ambari host tab?
On which hosts does impala run?
Explain how do “reducers” communicate with each other?
What is the biggest shortcoming of Spark?
Compare hadoop & spark?
Explain different transformations in DStream in Apache Spark Streaming?
Describe Network Topology Strategy?
Is Apache Kafka is a distributed streaming platform? if yes, what you can do with it?
What are the different Primitive Data Types available in Hive?
what are the main configuration parameters that user need to specify to run Mapreduce Job ?
What is Immutable?