Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Clarify how hive de-serialize and serialize the information?
What are the three steps involved in big data?
What are the site-specific configuration files in Hadoop?
Define MapReduce?
Explain the difference between COUNT_STAR and COUNT functions in Apache Pig?
What role does worker node play in Apache Spark Cluster? And what is the need to register a worker node with the driver program?
How can we see all the hosts that are available in Ambari?
What happens when the node running the map task fails before the map output has been sent to the reducer?
Specify some uses of HBase?
Name the filter which accepts the page size as the parameter in hbase?
Explain what is kafka?
How do I download and install spark?
What is the difference between client mode and cluster mode in spark?
Explain the concept of Tunable Consistency in Cassandra?
What is Fault Tolerance?