Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is the difference between a MapReduce InputSplit and HDFS block?
Define the roles of the file system in any framework?
Difference between groupByKey vs reduceByKey in Apache Spark?
Hadoop Libraries and Utilities and Miscellaneous Hadoop Applications?
Whenever we run hive query, new metastore_db is created. Why?
What is a flume agent?
What are the uses of explode hive?
Where is the Mapper Output intermediate kay-value data stored ?
Why Should we use Apache Kafka Cluster?
What is spark vs hadoop?
What is Cassandra Query Language?
What is the advantage of using –password-file rather than -P option while preventing the display of password in the sqoop import statement?
How does cassandra perform read operation?
Explain tajo configuration files?
What is the InputSplit in map reduce ?