Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How Facebook Uses Hadoop, Hive and Hbase ?
What are the different life cycle commands in ambari?
how JobTracker schedules a task ?
What is flume agent?
Why cloudera is used?
Is it possible to have hadoop job output in multiple directories? If yes, how?
Explain the top() and takeordered() operation?
What bit version that ambari needs and also list out the operating systems that are compatible?
How can we check whether namenode is working or not?
I have a relation r. How can I get the top 10 tuples from the relation r?
how will you implement SQL in Spark?
What is a bloom filter?
What is optimal size of a file for distributed cache?
Where sorting is done on mapper node or reducer node in MapReduce?
Explain slot in Hadoop Map-Reduce v1?