Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are the network requirements for hadoop?
What is the difference between scala and spark?
Explain the uses of PIG?
Explain values() operation in apache spark?
What are the primary phases of a Reducer?
What ensures load balancing of the server in Kafka?
Explain how to Tune Kafka for Optimal Performance?
What is a rack awareness algorithm and why is it used in hadoop?
What are the two ways to create rdd in spark?
What is hive installation path?
is HQL case sensitive?
How Hadoop is cost-effective?
How does Mappers run method works?
What is the problem with the small file in Hadoop?
What is the InputFormat ?