Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) When executing Hive queries in different directories, why is metastore_db created in all places from where Hive is launched?
882What kind of data warehouse application is suitable for Hive? What are the types of tables in Hive?
771How can you prevent a large job from running for a long time? What do u think is more popular among the developers - Pig or Hive?
714Wherever (Different Directory) I run hive query, it creates new metastore_db, please explain the reason for it?
706
What does heartbeat in hdfs means?
Define partitions in apache spark.
What is Fault Tolerance?
Where is spark used?
What is executor in spark?
Which command is used to show the current hbase user?
State the differences between a node, a cluster and datacenter in Cassandra?
What is Apache Spark?
Where is rdd stored?
Explain first() operation in Apache Spark RDD?
Give the differences between the different types of primary keys in cassandra?
Which language is better for spark?
What is Apache Spark and what are the benefits of Spark over MapReduce?
Why is Transformation lazy in Spark?
Mention how many inputsplits is made by a hadoop framework?