After increasing the replication level, I still see that data is under replicated. What could be wrong?
460Web-ui shows that half of the datanodes are in decommissioning mode. What does that mean? Is it safe to remove those nodes from the network?
445Post New Hadoop General Questions
Can we deploye job tracker other than name node?
Why do we need hadoop for big data analytics?
What are the restriction to the key and value class ?
What is JPS? Why is it used in Hadoop?
Give me the examples of Columnar database ?
What are the most common InputFormats in Hadoop?
How will you write a custom partitioner for a Hadoop job?
If DataNode increases, then do we need to upgrade NameNode?
Compare Hadoop 2 and Hadoop 3?
What is the procedure for namenode recovery?
What are 'slaves' and 'masters' in Hadoop?
Hadoop achieves parallelism by dividing the tasks across many nodes, it is possible for a few slow nodes to rate-limit the rest of the program and slow down the program. What mechanism Hadoop provides to combat this?
If we want to copy 10 blocks from one machine to another, but another machine can copy only 8.5 blocks, can the blocks be broken at the time of replication?
Can you explain record reader?
What does the file hadoop-metrics.properties do?