What is difference between secondary namenode, checkpoint namenode & backupnod secondary namenode, a poorly named component of hadoop?
754What mechanism does hadoop framework provides to synchronize changes made in distribution cache during runtime of the application?
709What is the functionality of jobtracker in hadoop? How many instances of a jobtracker run on hadoop cluster?
826Post New Apache Hadoop Questions
explain Metadata in Namenode?
Define a daemon?
What is nlineoutputformat?
What do the master class and the output class do?
Is map like a pointer?
What are different hdfs dfs shell commands to perform copy operation?
What is unstructured data?
What is a checkpoint?
Why do we use Hadoop?
what are the steps involved in decommissioning removing
What are the characteristics of hadoop framework?
Data Engineer Given a list of followers in the format:123, 345234, 678345, 123…Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?
What is a “Distributed Cache” in Apache Hadoop?
How blocks are distributed among all data nodes for a particular chunk of data?
What do you know about keyvaluetextinputformat?