Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) In which scenario Pig is better fit than MapReduce?
What ensures load balancing of the server in Kafka?
What is the maximum recommended cell size?
How do you list all databases whose name starts with p?
What is Sqoop Validation?
Can you explain spark core?
Is it legal to set the number of reducer task to zero? Where the output will be stored in this case?
Define the management tools in Cassandra?
What is cloudera and why it is used?
What are the three types of tombstone markers in hbase?
What file systems does spark support?
Explain reduceByKey() Spark operation?
What is illustrate used for in apache pig?
Which scala library is used for functional programming?
Explain the hdfs architecture?