Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
is it necessary to install Spark on all nodes while running Spark application on Yarn?
Define the term thrift
What kind of hardware is best for hadoop?
How does hadoop achieve fault tolerance?
Explain about ACID transactions in Hive?
List down the segments of a hive question processor?
What is accumulators and broadcast variables in spark?
What is document store db?
How can you control the mapping between SQL data types and Java types?
Which interface needs to be implemented to create Mapper and Reducer for the Hadoop?
Name the two types of shared variable available in Apache Spark?
Does hdfs enable a customer to peruse a record, which is already opened for writing?
What jobtracker is in hadoop? What are the activities followed by hadoop?
What are the types of Apache Spark transformation?
What do you know about collaborative filtering?