Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
When running Spark applications, is it necessary to install Spark on all the nodes of YARN cluster?
What is the use of apache mahout?
What is spark architecture?
What is job tracker in Hadoop?
What is data pipeline in spark?
What do you mean by a bag in Pig?
How to write a Custom Key Class?
When should you use a reducer?
How does impala compare to hive and pig?
Is it possible to create multiple table in hive for same data?
Explain the sequence of execution of all the components of MapReduce like a map, reduce, recordReader, split, combiner, partitioner, sort, shuffle.
Explain what is webdav in hadoop?
Name some best features of Ambari?
What does block mean?
What is difference between a MapReduce InputSplit and HDFS block