Ideally what should be the replication factor in hadoop?
No Answer is Posted For this Question
Be the First to Post Answer
Why can we not create directory /user/dataflair/inpdata001 when name node is in safe mode?
Can hadoop handle streaming data?
Explain how is data partitioned before it is sent to the reducer if no custom partitioner is defined in hadoop?
Is it possible to have hadoop job output in multiple directories?
What is the job tracker role in hadoop?
What a task tracker is in hadoop?
In hadoop_pid_dir, what does pid stands for?
What is the logistic regression?
What is the difference between an inputsplit and a block?
What does the high availability of a name-node means?
What is Disk Balancer in Hadoop?
What are the port numbers of job tracker?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)