Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is key-value store db? Explain with an example.
When to use Hive?
What is hadoop spark?
What is salting in spark?
Can you briefly explain the apache mahout?
What is spark and what is its purpose?
Does Apache Flume provide support for third party plug-ins?
How do you parse data in xml? Which kind of class do you use with java to parse data?
Can you explain textinformat?
How is spark fault tolerance?
What is in memory in spark?
Differentiate between Hadoop MapReduce and Pig?
How to format the HDFS? How frequently it will be done?
Define the management tools in Cassandra?
How to change a number of mappers running on a slave in MapReduce?