Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What do you understand about yarn?
What is a metastore in hive?
List of the some best tools that can be useful for data-analysis?
What is a ledger in bookkeeper?
Why do fires spark?
Why slaves limited to 4000 in hadoop version 1?
Explain the Differences between Hive and Spark SQL?
What is accumulator in spark?
How analysis of Big Data is useful for organizations?
What is lambda in spark?
Discuss the precautions that are needed to take care while adding a column?
What is the difference between apache mahout and apache spark’s mllib?
Are spark dataframes distributed?
What does ‘jps’ command do?
What is the difference between a MapReduce InputSplit and HDFS block?