Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Can you explain the benefits of big data?
State the difference between persist() and cache() functions.
How to set the number of mappers for a MapReduce job?
Does HDFS allow a client to read a file which is already opened for writing in hadoop?
Clarify the difference between nas and hdfs.
How many Reducers run for a MapReduce job?
Mention what does the text input format do?
What is SSTable?
What are barriers?
Why should we use ‘orderby’ keyword in pig scripts?
Explain about postgresql storage handler?
What is the role of a zookeeper in a kafka cluster?
What is in memory in spark?
Mention what is rack awareness?
Explain the memtable in cassandra?