What is the difference between an inputsplit and a block?
Answer / Akash Kumar Srivastava
An InputSplit in Hadoop represents a part of the data that a mapper or reducer will process. It doesn't necessarily correspond to a physical file block. A Block, on the other hand, is a contiguous set of data stored within a DataNode and has a fixed size (128 MB by default). Multiple InputSplits can be assigned to a single Block during data processing.
| Is This Answer Correct ? | 0 Yes | 0 No |
what happens when namenode goes down during file read operation in hadoop?
What does hadoop-metrics.properties file do?
Explain is it possible to search for files using wildcards?
How often DataNode send heartbeat to NameNode in Hadoop?
What is a rack?
What a task tracker is in hadoop?
How analysis of Big Data is useful for organizations?
Clarify how job tracker schedules an assignment?
What is CAP Theorem? What aspects does Hadoop support from this theorem?
Can you explain combiner?
Explain the Single point of Failure in Hadoop?
Mention what daemons run on a master node and slave nodes?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)