Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How is the distance between two nodes defined in Hadoop?
What are the fundamental configurations parameters specified in map reduce?
Explain the flatMap operation on Apache Spark RDD?
Differentiate between FileSink and FileRollSink?
How to create custom key and custom value in MapReduce Job?
What kind of datawarehouse application is suitable for Hive?
What is map/reduce job in hadoop?
What all tasks you can perform for managing host using Ambari host tab?
Can any impala query also be executed in hive?
What are the different collection type in Hive?
Explain Any 3 Features of HBase?
Explain what is the role of the zookeeper?
Explain the sequence of execution of all the components of MapReduce like a map, reduce, recordReader, split, combiner, partitioner, sort, shuffle.
What are the advantages of DataFrame?
How many partitions are created by default in Apache Spark RDD?