After the Map phase finishes, the Hadoop framework does 'Partitioning, Shuffle and sort'. Explain what happens in this phase?
473Hadoop achieves parallelism by dividing the tasks across many nodes, it is possible for a few slow nodes to rate-limit the rest of the program and slow down the program. What mechanism Hadoop provides to combat this?
456Post New Hadoop General Questions
Why we use intwritable instead of int? Why we use longwritable instead of long?
What are the different commands used to startup and shutdown Hadoop daemons?
What does secondary name-node means?
How to handle bad records during parsing?
What is throughput in Hadoop?
Explain what is storage and compute nodes?
Web-ui shows that half of the datanodes are in decommissioning mode. What does that mean? Is it safe to remove those nodes from the network?
Can you explain edge nodes in hadoop?
What does the high availability of a name-node means? How is it accomplished?
What are the port numbers of job tracker?
What does block mean?
Have you ever used counters in hadoop?
Is Namenode machine same as DataNode machine as in terms of hardware in Hadoop?
What is Combiner in Hadoop?
Whats the default port that jobtrackers listens ?