Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is Spark.executor.memory in a Spark Application?
What do you mean by meta information in hdfs?
What role does worker node play in Apache Spark Cluster? And what is the need to register a worker node with the driver program?
Is kafka a message queue?
Name the examples of some companies that are using hadoop structure?
Explain about the data model operations in HBase?
How can we create a hadoop cluster from scratch?
How Facebook Uses Hadoop, Hive and Hbase ?
What is parallelize in spark?
What is spark yarn executor memoryoverhead?
What is spark catalyst?
How to enable/configure the compression of map output data in hadoop?
What is heartbeat in hdfs?
Hadoop sqoop word came from?
What is HDFS Federation?