What type of data we should put in distributed cache? When to put the data in dc? How much volume we should put in?
434Post New Hadoop General Questions
What are input format, input split & record reader and what they do?
What are the side effects of not running a secondary name node?
What do you mean by the NameNode High Availability in hadoop?
What is rack awareness in hadoop?
What is configured in /etc/hosts and what is its role in setting Hadoop cluster?
What does jps command do in Hadoop?
Clarify what is sqoop in hadoop?
List out some common problems faced by data analyst?
Can hadoop handle streaming data?
What is the best practice to deploy the secondary name node?
Give me examples of unstructured data?
How to do ‘map’ and ‘reduce’ works?
What is the difference between namenode and datanode in hadoop?
What happens if number of reducers are 0?
Explain is it possible to search for files using wildcards?