Explain the hdfs architecture and list the various hdfs daemons in hdfs cluster?
Answer / Mayank Kumar
HDFS (Hadoop Distributed File System) architecture is designed to store large datasets on commodity hardware. It consists of two main types of nodes: NameNode and DataNodes. The NameNode manages the file system namespace and is the centralized metadata repository for HDFS. DataNodes are responsible for storing and serving data blocks. Other important daemons in an HDFS cluster include SecondaryNameNode, which helps reduce I/O load on the main NameNode by performing tasks such as checkpointing, and JournalNode, which is used to store edit logs for the NameNode.
| Is This Answer Correct ? | 0 Yes | 0 No |
How to change the replication factor of data which is already stored in HDFS?
What is the difference between NAS and HDFS?
Mention what is the difference between hdfs and nas?
How data or file is read in HDFS?
How is NFS different from HDFS?
How to use hdfs put command for data transfer from flume to hdfs?
If data is present in HDFS and RF is defined, then how can we change Replication Factor?
Would you be able to change the block size of hdfs files?
How much Metadata will be created on NameNode in Hadoop?
Since the data is replicated thrice in hdfs, does it mean that any calculation done on one node will also be replicated on the other two?
Explain hdfs?
Hdfs stores data using commodity hardware which has higher chances of failures. So, how hdfs ensures the fault tolerance capability of the system?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)