Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Where do you specify the Mapper Implementation?
What is the difference between Hive CLI and Beeline?
Can you define inputsplit in hadoop?
What are the other components of Cassandra?
What do you mean by Speculative execution in Apache Spark?
What is parallelize in spark?
What is tasktracker in hadoop?
Explain the commit log?
Which one is better hadoop or spark?
What is high availability in hadoop?
What are the various InputFormats in Hadoop?
What is flume and sqoop?
Is the keyword ‘DEFINE’ as a function name?
Does spark use zookeeper?
Which is better hadoop or spark?