Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Define Writable data types in MapReduce?
Why spark is faster than hive?
What are shared variables?
What is Kundera in Cassandra?
What is spark executor cores?
If DataNode increases, then do we need to upgrade NameNode in Hadoop?
What is the relation between MapReduce and Hive?
How many Daemon processes run on a Hadoop system?
How can you trigger automatic clean-ups in Spark to handle accumulated metadata?
Explain the process for starting a kafka server?
What is the procedure for namenode recovery?
Explain how to Tune Kafka for Optimal Performance?
How to skip header rows from a table in Hive?
How are joins performed in impala?
What is Cassandra-Cqlsh?