Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What are the components of a Hive query processor?
How will you calculate the number of executors required to do real-time processing using Apache Spark? What factors need to be considered for deciding on the number of nodes for real-time processing?
How the HDFS Blocks are replicated?
How hadoop mapreduce works?
Explain how RDDs work with Scala in Spark
What are the uses of explode hive?
What are the usage of different consistency levels for write operations ?
What are the exact differences between reduce and fold operation in Spark?
How can you set an arbitrary number of Reducers to be created for a job in Hadoop?
What is non-dfs used in hdfs web console
What is the difference between hbase and hadoop/hdfs?
Compare Hadoop and Spark?
Name the components of spark ecosystem.
Explain about ACID transactions in Hive?
What do you mean by ss table and explain how it is different from the other original tables?