Why do we use HDFS for applications having large data sets and not when there are lot of small files?
What exactly is hadoop?
How does NameNode tackle DataNode failures?
Virtual Box & Ubuntu Installation?
how is a file of the size 1 GB uncompressed
What is crontab? Explain with suitable example?
What is Apache Hadoop YARN?
What is hadoop framework?
Explain how input and output data format of the hadoop framework?
What are active and passive "NameNodes"?
What should be the ideal replication factor in Hadoop Cluster?
What are the basic available commands in Hadoop sqoop ?
Which are the three main hdfs-site.xml properties?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)