How blocks are distributed among all data nodes for a particular chunk of data?
Answer / Yogendra Kumar
In Apache Hadoop, when a block of data is added to the system, it is broken down into smaller chunks (64 MB by default) and distributed across multiple DataNodes. The choice of which DataNode receives each chunk is based on the replica placement policy.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is the default block size in hdfs?
What is HDFS Federation?
How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?
What is Apache Hadoop?
How to resolve ioexception: cannot create directory, while formatting namenode in hadoop?
What Mapper does?
How can you connect an application
Why we cannot do aggregation (addition) in a mapper? Why we require reducer for that?
How is security achieved in Apache Hadoop?
Data Engineer Given a list of followers in the format:123, 345234, 678345, 123…Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?
What is the jobtracker and what it performs in a hadoop cluster?
Explain how input and output data format of the hadoop framework?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)