Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Describe SPM?
What is Secondary NameNode in Hadoop HDFS?
Why is HDFS only suitable for large data sets and not the correct tool to use for many small files?
How to copy a file into HDFS with a different block size to that of existing block size configuration?
What is the history of apache mahout? Once did it start?
What are the important features of hadoop?
Is impala production ready?
How can apache spark be used alongside hadoop?
What do you mean by shuffling and sorting in MapReduce?
Name job control options specified by mapreduce.
Explain JobConf in MapReduce.
Can you explain apache kafka?
What are apache tajo sql functions?
Define a namenode?
Why can aggregation not be done in Mapper in MapReduce?