Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is executor cores in spark?
What do you understand by the term snitch in cassandra?
Differentiate between drop and truncate in cqlsh
how JobTracker schedules a task ?
How many ways can you create rdd in spark?
What is the significance of using –compress-codec parameter?
Explain HCatLoader APIs?
What is difference between secondary namenode, checkpoint namenode & backupnode?
Enlist the several components in Kafka?
How Pig differs from MapReduce?
Where sorting is done in Hadoop MapReduce Job?
Have you ever used counters in hadoop?
Explain Spark Streaming with Socket?
What are shared variables?
How is jmx useful in cassandra?