Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Is it possible to split 100 lines of input as a single split in MapReduce?
What is HBaseFsck class?
What is a pipelinedrdd?
How to start a kafka server?
What do you know about yarn?
Name some independent extensions that contribute to the Ambari codebase?
What is a bag in pig?
What happen if one of the datanodes has much slower cpu?
Whenever we run hive query, new metastore_db is created. Why?
explain the use of blinkdb?
What are the other components of Cassandra?
Are spark dataframes distributed?
How can you add a new partition for the month December in the above partitioned table?
How to use hdfs put command for data transfer from flume to hdfs?
What's rdd?