Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Name the two types of shared variable available in Apache Spark?
When we send a data to a node, do we allow settling in time, before sending another data to that node?
What is the role of “ambari-qa” user?
What are the advantages of kafka?
Mention some use cases of apache mahout?
Define the term ‘sparse vector.’
What are the different clustering in mahout?
On what basis name node distribute blocks across the data nodes?
What are the different zkclientbindings?
What is a mapreduce algorithm?
What is the method to create a data frame?
Explain the key features of hdfs?
What are apache tajo sql functions?
Explain about the different types of transformations on DStreams?
Explain when to use explode in Hive?