Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain first() operation in Spark?
What is the utility of using Writable Comparable Custom Class in Map Reduce code?
what Hive query processor does?
Apache Spark is a good fit for which type of machine learning techniques?
Elucidate the concept of cap theorem?
What is NoSQL?
Define a namenode?
what happens when Hadoop spawned 50 tasks for a job and one of the task failed?
Can free form SQL queries be used with Sqoop import command?
What is the stable version of Hive ?
How Big is ‘Big Data’?
How to add the partition in existing table without the partition table?
What is Bucket in Hive?
Explain the composite key?
How is rdd distributed?