Why do we use HDFS for applications having large data sets and not when there are lot of small files?1 541
How will you merge the contents of two or more relations and divide a single relation into two or more relations?
What are combiners? When should I use a combiner in my MapReduce Job?
State the usage of 'filters', 'group' , 'orderBy', 'distinct' keywords in pig scripts?
What is the stable version of Hive ?
what does the conf.setMapper Class do ?
What is Hive query processor?
What happens when a datanode fails ?
What is the process to change the files at arbitrary locations in HDFS?
What is your favourite tool in the hadoop ecosystem?
What is a block and block scanner in HDFS?
What is HDFS block size and what did you chose in your project?
What is the utility of using Writable Comparable Custom Class in Map Reduce code?
What are the different operational commands in HBase at record level and table level?
What is the communication channel between client and namenode/datanode?
How can you overwrite the replication factors in HDFS?