Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
If DataNode increases, then do we need to upgrade NameNode in Hadoop?
Difference between hive and impala?
What are the drawbacks of Apache Spark?
How does cassandra perform write operations?
What are the major differences between Hadoop 2 and Hadoop 3?
Is there another way to check whether Namenode is working?
What are core components of Flume?
When executing Hive queries in different directories, why is metastore_db created in all places from where Hive is launched?
What is crontab? Explain with suitable example?
What is throughput? How does HDFS provide good throughput?
What languages support spark?
Explain HCatalog Architecture in Brief?
What are the complex datatypes in pig?
What is the relationship between hdfs, hbase, pig, hive and azkaban?
What is the difference betwaeen mapreduce engine and hdfs cluster?