What is Distributed Cache in Hadoop?
Answer / Akhil Kumar Bhalla
Distributed Cache allows user-defined files or archives to be cached on all nodes that run the map and reduce tasks for a given job. It is useful for providing libraries or data sets needed by the MapReduce jobs.
| Is This Answer Correct ? | 0 Yes | 0 No |
What are 'slaves' and 'masters' in Hadoop?
What does rack awareness mean?
Explain Hadoop Archives?
Mention how many inputsplits is made by a hadoop framework?
What does the high availability of a name-node means?
Why aggregation cannot be done in Mapper?
What sorts of actions does the job tracker process perform?
Which port does SSH work on?
What happens if one hadoop client renames a file or a directory containing this file while another client is still writing into it?
What are configuration files in Hadoop?
What is the basic difference between traditional RDBMS and Hadoop?
Can you explain record reader?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)