How NameNode tackle Datanode failures in Hadoop?
Answer / Neeraj Rastogi
When a DataNode fails, the NameNode notices the absence of its heartbeats and marks the affected blocks as missing. The NameNode then schedules replication of these blocks to other available DataNodes. If necessary, the NameNode may also redirect client read and write requests to other DataNodes that have replicas of the desired data.
| Is This Answer Correct ? | 0 Yes | 0 No |
Web-ui shows that half of the datanodes are in decommissioning mode. What does that mean? Is it safe to remove those nodes from the network?
What are the four basic parameters of a mapper?
Why would nosql be better than using a sql database? And how much better is it?
Why does one remove or add nodes in a Hadoop cluster frequently?
After the Map phase finishes, the Hadoop framework does 'Partitioning, Shuffle and sort'. Explain what happens in this phase?
Does hadoop follows the unix pattern?
Which object can be used to get the progress of a particular job
How to add/delete a Node to the existing cluster?
Who are ‘Data Scientists’?
What is the difference between an inputsplit and a block?
What is Mapper in Hadoop?
When and how to create hadoop archive?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)