Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How to set property in apache tajo?
In MapReduce, ideally how many mappers should be configured on a slave?
How to optimize MapReduce Job?
What is the throughput? How does hdfs give great throughput?
Name the Spark Library which allows reliable file sharing at memory speed across different cluster frameworks.
What happens if rdd partition is lost due to worker node failure?
What is a secondary namenode?
How much space will the split occupy in Mapreduce?
What is the stable version of Hive ?
What is spark dynamic allocation?
What are the features of presto?
What is Hadoop Custom partitioner ?
When can you use ALTER KEYSPACE?
Explain a common use case for Flume?
How does speculative execution work in Hadoop?