Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is identity mapper and reducer? In which cases can we use them?
What is an Agent?
Explain what is a difference between an input split and hdfs block?
What is hadoop sqoop?
Define "PageRank".
What are the different UDF’s in Pig?
What are the primary phases of the reducer?
What are the identity mapper and reducer in MapReduce?
List out the commands that are used to start, check the progress and stop the ambari server?
Is apache spark in demand?
What are the basic commands in Apache Sqoop and its uses?
How does spark program work?
What do you understand by Filters in HBase?
Who is a 'user' in HDFS?
what is the default replication factor in HDFS?