Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the different tools used for the ambari monitoring purpose?
How do I check my spark status?
What is difference between Column and Super Column?
What are the features of Standalone (local) mode?
What are the different database elements of cassandra?
What is JPS? Why is it used in Hadoop?
What is output format in hadoop?
Why HDFS?
List of some best tools that can be useful for data-analysis?
Explain plucktuple?
Explain a scenario where you will be using spark streaming.
What is map in spark?
Does Pig differ from MapReduce? If yes, how?
Bag in pig ?
What are the key segments of hive architecture?