Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What kind of data warehouse application is suitable for Hive? What are the types of tables in Hive?
If reducers do not start before all mappers finish then why does the progress on mapreduce job shows something like map(50%) reduce(10%)? Why reducers progress percentage is displayed when mapper is not finished yet?
Elaborate on Identifiers?
How to add/delete a Node to the existing cluster?
What is the procedure for namenode recovery?
What is speculative execution in Hadoop?
What is the difference between hadoop and other data processing tools?
List out the some common problems faced by data analyst?
Did you ever ran into a lop sided job that resulted in out of memory error, if yes then how did you handled it ?
How to keep files in HDFS?
Explain sortbykey() operation?
What are input format, input split & record reader and what they do?
What are the functionalities of jobtracer?
What is sqoop and flume?
What are the main configuration parameters in a MapReduce program?