What is partitioning?
No Answer is Posted For this Question
Be the First to Post Answer
What is replication factor?
Explain how is data partitioned before it is sent to the reducer if no custom partitioner is defined in hadoop?
Which object can be used to get the progress of a particular job
Hadoop achieves parallelism by dividing the tasks across many nodes, it is possible for a few slow nodes to rate-limit the rest of the program and slow down the program. What mechanism Hadoop provides to combat this?
Why use hadoop?
What is Reducer in Hadoop?
What happen if a datanode loses network connection for a few minutes?
What happens if the number of reducers is 0 in Hadoop?
Is it possible to provide multiple input to Hadoop? If yes then how?
Doesn’t google have its very own version of dfs?
Mention what are the three modes in which hadoop can be run?
How does hadoop achieve fault tolerance?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)