Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Define "Action" in Spark
Do we need to install scala for spark?
Where can the metastore database be hosted?
What do you understand by mem-table in cassandra?
Define primary key in Apache Cassandra?
What is the relationship between hdfs, hbase, pig, hive and azkaban?
What are the most common InputFormats in Hadoop?
What is sink processors?
What are the main classes of Data Transfer API?
What is the default replication factor in Hadoop and how will you change it?
What is rdd partition?
How can you prevent a large job from running for a long time? What do u think is more popular among the developers - Pig or Hive?
What is the role of the ZooKeeper in Kafka?
What are the views in Hive?
What is the biggest shortcoming of Spark?